使用python在html doc中查找输入字段的值

Vla*_*lad 4 python

我试图从HTML文档中获取输入值,并希望解析隐藏输入字段的值.例如,如何使用python仅解析下面代码段中的值.

    <input type="hidden" autocomplete="off" id="post_form_id" name="post_form_id" value="d619a1eb3becdc05a3ebea530396782f" />
    <input type="hidden" name="fb_dtsg" value="AQCYsohu" autocomplete="off" />
Run Code Online (Sandbox Code Playgroud)

并且python函数的输出应该返回如下内容:

post_form_id : d619a1eb3becdc05a3ebea530396782f
fb_dtsg : AQCYsohu
Run Code Online (Sandbox Code Playgroud)

jte*_*ace 7

你可以使用BeautifulSoup:

>>> htmlstr = """    <input type="hidden" autocomplete="off" id="post_form_id" name="post_form_id" value="d619a1eb3becdc05a3ebea530396782f" />
...     <input type="hidden" name="fb_dtsg" value="AQCYsohu" autocomplete="off" />"""
>>> from BeautifulSoup import BeautifulSoup
>>> soup = BeautifulSoup(htmlstr)
>>> [(n['name'], n['value']) for n in soup.findAll('input')]
[(u'post_form_id', u'd619a1eb3becdc05a3ebea530396782f'), (u'fb_dtsg', u'AQCYsohu')]
Run Code Online (Sandbox Code Playgroud)