相关疑难解决方法(0)

打开25GB文本文件进行处理

我有一个需要处理的25GB文件.这是我目前正在做的事情,但打开需要很长时间:

collection_pricing = os.path.join(pricing_directory, 'collection_price')
with open(collection_pricing, 'r') as f:
    collection_contents = f.readlines()

length_of_file = len(collection_contents)

for num, line in enumerate(collection_contents):
    print '%s / %s' % (num+1, length_of_file)
    cursor.execute(...)

Run Code Online (Sandbox Code Playgroud)

我怎么能改善这个？

python performance

Dav*_*542

2014 09-17

4
推荐指数

1
解决办法

960
查看次数

如何循环直到Python中的文件结束而不检查空行？

我正在编写一个用于计算文件中元音数量的赋值,目前在我的类中我们只使用这样的代码来检查文件的结尾:

vowel=0
f=open("filename.txt","r",encoding="utf-8" )
line=f.readline().strip()
while line!="":
    for j in range (len(line)):
        if line[j].isvowel():
            vowel+=1

    line=f.readline().strip()

Run Code Online (Sandbox Code Playgroud)

但是这次我们的任务由我们的教授给出的输入文件是一篇完整的文章,所以在整个文本中有几个空行来分隔段落和诸如此类的东西,这意味着我的当前代码只会计算到第一个空白行.

除了检查线路是否为空之外,有没有办法检查我的文件是否已到达终点？优选地,以类似的方式,我当前拥有我的代码,其中它检查while循环的每次迭代的某些内容

提前致谢

python file-io loops python-3.x

ay *_*mao

lucky-day

4
推荐指数

1
解决办法

9万
查看次数