Python gzip:有没有办法从字符串解压缩?

jld*_*ont 29 python gzip

我已经阅读了这篇关于这个问题的SO帖子无济于事.

我正在尝试解压缩来自URL的.gz文件.

url_file_handle=StringIO( gz_data )
gzip_file_handle=gzip.open(url_file_handle,"r")
decompressed_data = gzip_file_handle.read()
gzip_file_handle.close()
Run Code Online (Sandbox Code Playgroud)

...但我得到TypeError:强制转换为Unicode:需要字符串或缓冲区,找到cStringIO.StringI

这是怎么回事?

Traceback (most recent call last):  
  File "/opt/google/google_appengine-1.2.5/google/appengine/tools/dev_appserver.py", line 2974, in _HandleRequest
    base_env_dict=env_dict)
  File "/opt/google/google_appengine-1.2.5/google/appengine/tools/dev_appserver.py", line 411, in Dispatch
    base_env_dict=base_env_dict)
  File "/opt/google/google_appengine-1.2.5/google/appengine/tools/dev_appserver.py", line 2243, in Dispatch
    self._module_dict)
  File "/opt/google/google_appengine-1.2.5/google/appengine/tools/dev_appserver.py", line 2161, in ExecuteCGI
    reset_modules = exec_script(handler_path, cgi_path, hook)
  File "/opt/google/google_appengine-1.2.5/google/appengine/tools/dev_appserver.py", line 2057, in ExecuteOrImportScript
    exec module_code in script_module.__dict__
  File "/home/jldupont/workspace/jldupont/trunk/site/app/server/tasks/debian/repo_fetcher.py", line 36, in <module>
    main()
  File "/home/jldupont/workspace/jldupont/trunk/site/app/server/tasks/debian/repo_fetcher.py", line 30, in main
    gziph=gzip.open(fh,'r')
  File "/usr/lib/python2.5/gzip.py", line 49, in open
    return GzipFile(filename, mode, compresslevel)
  File "/usr/lib/python2.5/gzip.py", line 95, in __init__
    fileobj = self.myfileobj = __builtin__.open(filename, mode or 'rb')
TypeError: coercing to Unicode: need string or buffer, cStringIO.StringI found
Run Code Online (Sandbox Code Playgroud)

小智 46

如果您的数据已经在字符串中,请尝试zlib,它声称完全兼容gzip:

import zlib
decompressed_data = zlib.decompress(gz_data, 16+zlib.MAX_WBITS)
Run Code Online (Sandbox Code Playgroud)

了解更多:http://docs.python.org/library/zlib.html

  • 这也是我一开始就遇到的.但它很快证明它不起作用:"zlib.error:解压缩数据时出错-3:错误的标题检查" (8认同)
  • 添加参数使调用看起来像`decompressed_data = zlib.decompress(gz_data,16 + zlib.MAX_WBITS)`,对我来说就像一个魅力.感谢http://stackoverflow.com/a/2695575/3635816 (6认同)

Jeh*_*iah 36

gzip.open是打开文件的简写,你想要的是gzip.GzipFile你可以传递fileobj

open(filename, mode='rb', compresslevel=9)
    #Shorthand for GzipFile(filename, mode, compresslevel).
Run Code Online (Sandbox Code Playgroud)

VS

class GzipFile
   __init__(self, filename=None, mode=None, compresslevel=9, fileobj=None)
   #    At least one of fileobj and filename must be given a non-trivial value.
Run Code Online (Sandbox Code Playgroud)

所以这对你有用

gzip_file_handle = gzip.GzipFile(fileobj=url_file_handle)
Run Code Online (Sandbox Code Playgroud)


Mon*_*oya 8

您可以使用gzip.decompressgzip 内置 Python 库(适用于 Python 3.2+)。

有关如何解压缩字节的示例:

import gzip
gzip.decompress(gzip_data)
Run Code Online (Sandbox Code Playgroud)

文档

https://docs.python.org/3.5/library/gzip.html#gzip.decompress