我对python很新.我需要创建一个将csv数据加载到字典中的类.
我希望能够控制键和值所以让我们说下面的代码,我可以随时拉出worker1.name或worker1.age.
class ageName(object):
'''class to represent a person'''
def __init__(self, name, age):
self.name = name
self.age = age
worker1 = ageName('jon', 40)
worker2 = ageName('lise', 22)
#Now if we print this you see that it`s stored in a dictionary
print worker1.__dict__
print worker2.__dict__
#
'''
{'age': 40, 'name': 'jon'}
#
{'age': 22, 'name': 'lise'}
#
'''
#
#when we call (key)worker1.name we are getting the (value)
print worker1.name
#
'''
#
jon
#
'''
Run Code Online (Sandbox Code Playgroud)
但我坚持将我的csv数据加载到键和值.
[1]我想创建自己的密钥worker1 = ageName([name],[age],[id],[gender])
[2]每个[name],[age],[id]和[gender]来自csv数据文件中的特定列
我真的不知道如何解决这个问题.我尝试了很多方法但是我失败了.我需要一些帮助来开始这个.
----编辑这是我的原始代码
import csv
# let us first make student an object
class Student():
def __init__(self):
self.fname = []
self.lname = []
self.ID = []
self.sport = []
# let us read this file
for row in list(csv.reader(open("copy-john.csv", "rb")))[1:]:
self.fname.append(row[0])
self.lname.append(row[1])
self.ID.append(row[2])
self.sport.append(row[3])
def Tableformat(self):
print "%-14s|%-10s|%-5s|%-11s" %('First Name','Last Name','ID','Favorite Sport')
print "-" * 45
for (i, fname) in enumerate(self.fname):
print "%-14s|%-10s|%-5s|%3s" %(fname,self.lname[i],self.ID[i],self.sport[i])
def Table(self):
print self.lname
class Database(Student):
def __init__(self):
g = 0
choice = ['Basketball','Football','Other','Baseball','Handball','Soccer','Volleyball','I do not like sport']
data = student.sport
k = len(student.fname)
print k
freq = {}
for i in data:
freq[i] = freq.get(i, 0) + 1
for i in choice:
if i not in freq:
freq[i] = 0
print i, freq[i]
student = Student()
database = Database()
Run Code Online (Sandbox Code Playgroud)
这是我目前的代码(不完整)
import csv
class Student(object):
'''class to represent a person'''
def __init__(self, lname, fname, ID, sport):
self.lname = lname
self.fname = fname
self.ID = ID
self.sport = sport
reader = csv.reader(open('copy-john.csv'), delimiter=',', quotechar='"')
student = [Student(row[0], row[1], row[2], row[3]) for row in reader][1::]
print "%-14s|%-10s|%-5s|%-11s" %('First Name','Last Name','ID','Favorite Sport')
print "-" * 45
for i in range(len(student)):
print "%-14s|%-10s|%-5s|%3s" %(student[i].lname,student[i].fname,student[i].ID,student[i].sport)
choice = ['Basketball','Football','Other','Baseball','Handball','Soccer','Volleyball','I do not like sport']
lst = []
h = 0
k = len(student)
# 23
for i in range(len(student)):
lst.append(student[i].sport) # merge together
for a in set(lst):
print a, lst.count(a)
for i in set(choice):
if i not in set(lst):
lst.append(i)
lst.count(i) = 0
print lst.count(i)
Run Code Online (Sandbox Code Playgroud)
Tor*_*amo 12
import csv
reader = csv.reader(open('workers.csv', newline=''), delimiter=',', quotechar='"')
workers = [ageName(row[0], row[1]) for row in reader]
Run Code Online (Sandbox Code Playgroud)
工人现在有一份所有工人的清单
>>> workers[0].name
'jon'
Run Code Online (Sandbox Code Playgroud)
问题被修改后添加了编辑
你有没有理由使用旧式课程?我在这里使用新款式.
class Student:
sports = []
def __init__(self, row):
self.lname, self.fname, self.ID, self.sport = row
self.sports.append(self.sport)
def get(self):
return (self.lname, self.fname, self.ID, self.sport)
reader = csv.reader(open('copy-john.csv'), delimiter=',', quotechar='"')
print "%-14s|%-10s|%-5s|%-11s" % tuple(reader.next()) # read header line from csv
print "-" * 45
students = list(map(Student, reader)) # read all remaining lines
for student in students:
print "%-14s|%-10s|%-5s|%3s" % student.get()
# Printing all sports that are specified by students
for s in set(Student.sports): # class attribute
print s, Student.sports.count(s)
# Printing sports that are not picked
allsports = ['Basketball','Football','Other','Baseball','Handball','Soccer','Volleyball','I do not like sport']
for s in set(allsports) - set(Student.sports):
print s, 0
Run Code Online (Sandbox Code Playgroud)
希望这能为您提供有关python序列功能的一些想法.;)
编辑2,尽可能缩短......只是为了炫耀:P
女士们,先生们,7(.5)行.
allsports = ['Basketball','Football','Other','Baseball','Handball',
'Soccer','Volleyball','I do not like sport']
sports = []
reader = csv.reader(open('copy-john.csv'))
for row in reader:
if reader.line_num: sports.append(s[3])
print "%-14s|%-10s|%-5s|%-11s" % tuple(s)
for s in allsports: print s, sports.count(s)
Run Code Online (Sandbox Code Playgroud)
我知道这是一个非常古老的问题,但是不可能阅读这个,而不是想到令人惊讶的新(ish)Python库pandas
.它的主要分析单元是一个名为DataFrame的思想,它以R处理数据的方式建模.
假设你有一个(非常愚蠢的)csv文件example.csv
,它看起来像这样:
day,fruit,sales
Monday,Banana,10
Monday,Orange,20
Tuesday,Banana,12
Tuesday,Orange,22
Run Code Online (Sandbox Code Playgroud)
如果你想在双倍快速的时间内阅读csv,并且对它进行"填充",那么为了简洁或易于使用,你将很难击败以下代码:
>>> import pandas as pd
>>> csv = pd.read_csv('example.csv')
>>> csv
day fruit sales
0 Monday Banana 10
1 Monday Orange 20
2 Tuesday Banana 12
3 Tuesday Orange 22
>>> csv[csv.fruit=='Banana']
day fruit sales
0 Monday Banana 10
2 Tuesday Banana 12
>>> csv[(csv.fruit=='Banana') & (csv.day=='Monday')]
day fruit sales
0 Monday Banana 10
Run Code Online (Sandbox Code Playgroud)
在我看来,这真是太棒了.永远不要再次遍历csv.reader对象!
我是马克的第二个建议.特别是,从csv模块查看DictReader,它允许将逗号分隔(或一般分隔)文件作为字典读取.
查看PyMotW对csv模块的介绍,以获得快速参考和DictReader,DictWriter的使用示例