使用 csv.DictWriter 输出内存中的 gzipped csv 文件？

Question

使用 csv.DictWriter 输出内存中的 gzipped csv 文件？

fel*_*onc 4 python csv io gzip python-3.x

我想使用DictWriterPython 的csv模块生成一个使用 GZip 压缩的 .csv 文件。我需要在内存中完成这一切，因此不可能使用本地文件。

但是，我在处理 Python 3 中每个模块的类型要求时遇到了麻烦。假设我得到了正确的一般结构，我不能让两个模块一起工作，因为DictWriter需要写入io.StringIO缓冲区，而GZip需要一个io.BytesIO对象。

所以，当我尝试做：

buffer = io.BytesIO()
compressed = gzip.GzipFile(fileobj=buffer, mode='wb')
dict_writer = csv.DictWriter(buffer, ["a", "b"], extrasaction="ignore")

Run Code Online (Sandbox Code Playgroud)

我得到：

TypeError: a bytes-like object is required, not 'str'

尝试使用io.StringIOwithGZip也不起作用。我该怎么办？

Answer 1

blh*_*ing 9

您可以使用io.TextIOWrapper将文本流无缝转换为二进制流：

import io
import gzip
import csv
buffer = io.BytesIO()
with gzip.GzipFile(fileobj=buffer, mode='wb') as compressed:
    with io.TextIOWrapper(compressed, encoding='utf-8') as wrapper:
        dict_writer = csv.DictWriter(wrapper, ["a", "b"], extrasaction="ignore")
        dict_writer.writeheader()
        dict_writer.writerows([{'a': 1, 'b': 2}, {'a': 4, 'b': 3}])
print(buffer.getvalue()) # dump the compressed binary data
buffer.seek(0)
dict_reader = csv.DictReader(io.TextIOWrapper(gzip.GzipFile(fileobj=buffer, mode='rb'), encoding='utf-8'))
print(list(dict_reader)) # see if uncompressing the compressed data gets us back what we wrote

Run Code Online (Sandbox Code Playgroud)

这输出：

b'\x1f\x8b\x08\x00\x9c6[\\\x02\xffJ\xd4I\xe2\xe5\xe52\xd41\x02\x92&:\xc6@\x12\x00\x00\x00\xff\xff\x03\x00\x85k\xa2\x9e\x12\x00\x00\x00'
[OrderedDict([('a', '1'), ('b', '2')]), OrderedDict([('a', '4'), ('b', '3')])]

Run Code Online (Sandbox Code Playgroud)

归档时间：	7 年前
查看次数：	1677 次
最近记录：	7 年前