用美丽的汤提取href

Question

用美丽的汤提取href

我使用此代码来访问我的链接:

links = soup.find("span", { "class" : "hsmall" })
links.findNextSiblings('a')
for link in links:
  print link['href']
  print link.string

Run Code Online (Sandbox Code Playgroud)

链接没有ID或类或其他什么,它只是一个带有href属性的经典链接.

我的脚本的响应是:

print link['href']
TypeError: string indices must be integers

Run Code Online (Sandbox Code Playgroud)

你能帮助我获得href价值吗？谢谢 !

Answer 1

Chr*_*ett 9

链接仍指你的汤.发现.所以你可以这样做:

links = soup.find("span", { "class" : "hsmall" }).findNextSiblings('a')
for link in links:
    print link['href']
    print link.string

Run Code Online (Sandbox Code Playgroud)

Answer 2

Koo*_*len 4

好的，现在可以使用以下代码：

linkSpan = soup.find("span", { "class" : "hsmall" })
link = [tag.attrMap['href'] for tag in linkSpan.findAll('a', {'href': True})]
for lien in link:
  print "LINK = " + lien`

Run Code Online (Sandbox Code Playgroud)

归档时间：	14 年，3 月前
查看次数：	9963 次
最近记录：	14 年，3 月前