目标
我从hotmail下载了一个CSV文件,但它有很多重复项.这些副本是完整的副本,我不知道为什么我的手机创建它们.
我想摆脱重复.
途径
编写一个python脚本来删除重复项.
技术规格
Windows XP SP 3 Python 2.7 CSV file with 400 contacts
嗨,我一直在试验Winston Ewert的代码示例.
但我无法关闭输入和输出文件.我究竟做错了什么?
write_outfile.close()
write_infile.close()
回溯(最近一次调用最后一次):文件"Duplicates_01.py",第26行,在write_outfile.close()中AttributeError:'_ csv.writer'对象没有属性'close'
import csv
write_infile = csv.reader(open('File1.csv', 'r'))
write_outfile = csv.writer(open('File2.csv', 'w'))
#write_infile = open('File1.csv', 'r')
#f1 = csv.reader(write_infile)
#f1 = csv.reader(write_infile, delimiter=' ')
#write_outfile = open('File2.csv', 'w')
#f2 = csv.writer(write_outfile)
#f2 = csv.writer(write_outfile, delimiter=' ')
phone_numbers = set()
for row in write_infile:
if row[1] not in phone_numbers:
write_outfile.writerow(row)
# f2.writerow(row)
phone_numbers.add(row[1])
# write_outfile.close()
# write_infile.close()
Run Code Online (Sandbox Code Playgroud)
File1.csv
user, phone, email
joe, 123, joe@x.com
mary, 456, mary@x.com
ed, 123, ed@x.com
Run Code Online (Sandbox Code Playgroud)