lxml中的POST方法表单使用submit_form引发TypeError

Mor*_*hit 10 python lxml python-3.x

我正在尝试使用lxml提交POST方法表单,我得到一个TypeError.这是引发此错误的最小示例:

>>> import lxml.html
>>> page = lxml.html.parse("http://www.webcom.com/html/tutor/forms/start.shtml")
>>> form = page.getroot().forms[0]
>>> form.fields['your_name'] = 'Morphit'
>>> result = lxml.html.parse(lxml.html.submit_form(form))
    Traceback (most recent call last):
          File "<stdin>", line 1, in <module>
            File "/usr/lib/python3.3/site-packages/lxml/html/__init__.py", line 887, in submit_form
              return open_http(form.method, url, values)
            File "/usr/lib/python3.3/site-packages/lxml/html/__init__.py", line 907, in open_http_urllib
              return urlopen(url, data)
            File "/usr/lib/python3.3/urllib/request.py", line 160, in urlopen
              return opener.open(url, data, timeout)
            File "/usr/lib/python3.3/urllib/request.py", line 471, in open
              req = meth(req)
            File "/usr/lib/python3.3/urllib/request.py", line 1183, in do_request_
              raise TypeError(msg)
          TypeError: POST data should be bytes or an iterable of bytes. It cannot be of type str.
Run Code Online (Sandbox Code Playgroud)

我在网上其他地方找到了确切的错误,但我还没有看到它像lxml这样生成.有谁知道这是一个错误,或预期的行为以及如何解决它?

小智 1

来自https://github.com/lxml/lxml/pull/122/files

“在 python3 中,urlopen 需要 POST 数据的字节流。此补丁在传输之前将数据编码为 utf-8。” 在 src/lxml/html/__init__.py 中,更改第 918 行,

data = urlencode(values)
Run Code Online (Sandbox Code Playgroud)

data = urlencode(values).encode('utf-8')
Run Code Online (Sandbox Code Playgroud)