wis*_*shi 27 python csv sorting parsing
我想按日期对CSV表进行排序.开始是一个简单的任务:
import sys
import csv
reader = csv.reader(open("files.csv"), delimiter=";")
for id, path, title, date, author, platform, type, port in reader:
print date
Run Code Online (Sandbox Code Playgroud)
我使用Python的CSV模块读取具有该结构的文件:
id;file;description;date;author;platform;type;port
Run Code Online (Sandbox Code Playgroud)
最佳解决方案是拥有一个像客户端一样处理文件的CSV客户端.我没有找到类似的东西.
我希望有人知道这里有一些很好的排序魔法;)
谢谢,
马吕斯
Ign*_*ams 66
import operator
sortedlist = sorted(reader, key=operator.itemgetter(3), reverse=True)
Run Code Online (Sandbox Code Playgroud)
或者使用lambda
sortedlist = sorted(reader, key=lambda row: row[3], reverse=True)
Run Code Online (Sandbox Code Playgroud)
tel*_*t99 12
读者就像一个发电机.在包含一些虚假数据的文件上:
>>> import sys, csv
>>> data = csv.reader(open('data.csv'),delimiter=';')
>>> data
<_csv.reader object at 0x1004a11a0>
>>> data.next()
['a', ' b', ' c']
>>> data.next()
['x', ' y', ' z']
>>> data.next()
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
StopIteration
Run Code Online (Sandbox Code Playgroud)
使用operator.itemgetter作为Ignacio建议:
>>> data = csv.reader(open('data.csv'),delimiter=';')
>>> import operator
>>> sortedlist = sorted(data, key=operator.itemgetter(2), reverse=True)
>>> sortedlist
[['x', ' y', ' z'], ['a', ' b', ' c']]
Run Code Online (Sandbox Code Playgroud)
按多列排序(按排序column_1,然后按排序column_2)
with open('unsorted.csv',newline='') as csvfile:
spamreader = csv.DictReader(csvfile, delimiter=";")
sortedlist = sorted(spamreader, key=lambda row:(row['column_1'],row['column_2']), reverse=False)
with open('sorted.csv', 'w') as f:
fieldnames = ['column_1', 'column_2', column_3]
writer = csv.DictWriter(f, fieldnames=fieldnames)
writer.writeheader()
for row in sortedlist:
writer.writerow(row)
Run Code Online (Sandbox Code Playgroud)