为什么 csv.DictReader 会跳过空行?

3 python csv file

即使已设置,似乎也会csv.DictReader跳过空行。restval使用以下命令,将跳过输入文件中的空行:

import csv
CSV_FIELDS = ("field1", "field2", "field3")
for row in csv.DictReader(open("f"), fieldnames=CSV_FIELDS, restval=""):
    if not row or not row[CSV_FIELDS[0]]:
        sys.exit("never reached, why?")
Run Code Online (Sandbox Code Playgroud)

文件所在位置f

1,2,3


a,b,c
Run Code Online (Sandbox Code Playgroud)

unu*_*tbu 6

在 csv.DictReader 类内部

    # unlike the basic reader, we prefer not to return blanks,
    # because we will typically wind up with a dict full of None
    # values
    while row == []:
        row = self.reader.next()
Run Code Online (Sandbox Code Playgroud)

因此会跳过空行。如果您不想跳过空行,则可以使用csv.reader.

另一种选择是子类化csv.DictReader

import csv
CSV_FIELDS = ("field1", "field2", "field3")

class MyDictReader(csv.DictReader):
    def next(self):
        if self.line_num == 0:
            # Used only for its side effect.
            self.fieldnames
        row = self.reader.next()
        self.line_num = self.reader.line_num

        d = dict(zip(self.fieldnames, row))
        lf = len(self.fieldnames)
        lr = len(row)
        if lf < lr:
            d[self.restkey] = row[lf:]
        elif lf > lr:
            for key in self.fieldnames[lr:]:
                d[key] = self.restval
        return d

for row in MyDictReader(open("f", 'rb'), fieldnames=CSV_FIELDS, restval=""):
    print(row)
Run Code Online (Sandbox Code Playgroud)

产量

{'field2': '2', 'field3': '3', 'field1': '1'}
{'field2': '', 'field3': '', 'field1': ''}
{'field2': '', 'field3': '', 'field1': ''}
{'field2': 'b', 'field3': 'c', 'field1': 'a'}
Run Code Online (Sandbox Code Playgroud)