from lxml import html
import requests
url = 'https://www.data.gov/'
r = requests.get(url)
doc = html.fromstring(r.content)
link = doc.xpath('/html/body/header/div[4]/div/div/h4/label/small/a')
print(link)
Run Code Online (Sandbox Code Playgroud)
这不断给我:
[Element a at 0x1c64c963f48]
Run Code Online (Sandbox Code Playgroud)
响应而不是我在页面中寻找的实际数字?知道为什么吗?
另外,为什么我无法获取type(link)值来查看类型?
尝试下面的代码来获取"192,322"输出:
from lxml import html
import requests
url = 'https://www.data.gov/'
r = requests.get(url)
doc = html.fromstring(r.content)
try:
link = doc.xpath('//a[@href="/metrics"]/text()')[0]
print(link.split()[0])
except IndexError:
print("No link found")
Run Code Online (Sandbox Code Playgroud)