fre*_*ish 12 python csv quoting
我正在寻找一种方式来定义自定义quoting用csv.writerPython编写的.有4种内置的方法来测量值:
csv.QUOTE_ALL, csv.QUOTE_MINIMAL, csv.QUOTE_NONNUMERIC, csv.QUOTE_NONE
Run Code Online (Sandbox Code Playgroud)
但是我需要一个能模仿Postgres的引用机制FORCE QUOTE *,即它会引用所有非None值.使用csv.QUOTE_ALLPython会将None转为''但我希望有空字符串.
是否有可能使用内置csv模块(我对hacks不感兴趣,我已经这样做了:P)?或者我被迫写/获得一些自定义的csv解析器?
一般来说:是否可以为csv模块编写自定义引用机制?
Mar*_*ers 11
禁用csv引用并自行添加引号:
def quote(col):
if col is None:
return ''
# uses double-quoting style to escape existing quotes
return '"{}"'.format(str(col).replace('"', '""'))
writer = csv.writer(fileobj, quoting=csv.QUOTE_NONE, escapechar='', quotechar='')
for row in rows:
writer.writerow(map(quote, row))
Run Code Online (Sandbox Code Playgroud)
通过设置escapechar和quotechar清空字符串,您可以避免模块引用已经引用的值.
只要您不在csv值中使用分隔符,上述工作就可以了.
请注意,到这时候,自己编写逗号分隔的行会更容易:
with open(filename, 'w'), fd:
for row in rows:
fd.write(','.join(map(quote, row)) + '\r\n')
Run Code Online (Sandbox Code Playgroud)
我写了自己的csv编写器,它完全符合我的要求:
class PostgresCSVWriter(object):
def __init__(self, stream, quotechar="\"", delimiter=",", escapechar="\\"):
self.stream = stream
self.quotechar = quotechar
self.delimiter = delimiter
self.escapechar = escapechar
self.buffer_size = 16384
def _convert_value(self, obj):
if obj is None:
return ""
value = str(obj)
value = value.replace(self.quotechar, self.quotechar+self.quotechar)
value = value.replace(self.delimiter, self.escapechar+self.delimiter)
return self.quotechar+value+self.quotechar
def _convert_row(self, row):
return self.delimiter.join(self._convert_value(v) for v in row) + "\r\n"
def writerow(self, row):
self.stream.write(self._convert_row(row))
def writerows(self, rows):
data = ""
counter = 0
for row in rows:
buf = self._convert_row(row)
data += buf
counter += len(buf)
if counter >= self.buffer_size:
self.stream.write(data)
data = ""
counter = 0
if data:
self.stream.write(data)
Run Code Online (Sandbox Code Playgroud)
如果有人发现任何问题,请告诉我.我仍在寻找带csv模块的解决方案.