sam*_*sam 0 python iframe lxml
我正在从<iframe>中获取数据.然后我想从<iframe>中获取数据lxml.
我没有找到任何可以从中获取<iframe>的资源lxml.
任何人都可以帮我告诉我如何实现它?
您可以从中提取src属性iframe,然后从URL获取数据urllib,最后再次使用解析此数据lxml.
示例代码(未测试):
from lxml.etree import fromstring
data = urllib.urlopen(url) # fetching url with iframe
tree = fromstring(data.read()) # parsing tree to get src attribute
src_url = tree.cssselect("iframe").attrib['src']
data = urllib.urlopen(src_url) # open iframe src url
tree = fromstring(data.read()) # parsing iframe tree
Run Code Online (Sandbox Code Playgroud)