现在尝试使用 BeautifulSoup 和 Python 3 从类中提取“href”。

Question

现在尝试使用 BeautifulSoup 和 Python 3 从类中提取“href”。

我似乎无法让它发挥作用。我有我的脚本转到一个站点并将数据抓取到我的变量中，但是当我尝试从我得到的特定类中info提取数据时，或者当我尝试各种不同的组合时它不起作用。我哪里搞砸了？当我将它刮到我的信息变量中时，它的内部有一个和。hrefNoneclass='business-name'href

import requests
from bs4 import BeautifulSoup

count = 0
search_terms = "Bars"
location = "New Orleans, LA"
url = "https://www.yellowpages.com/search"
q = {'search_terms': search_terms, 'geo_location_terms': location}
page = requests.get(url, params=q)
url_link = page.url
page_num = str(count)
searched_page = url_link + '&page=' + str(count)
page = requests.get(searched_page)
soup = BeautifulSoup(page.text, 'html.parser')
info = soup.findAll('div', {'class': 'info'})
for each_business in info:
    # This is the spot that is broken. I can't make it work! 
    yp_bus_url = each_business.get('class_','business-name')['href']
    print(yp_bus_url)

Run Code Online (Sandbox Code Playgroud)

Answer 1

von*_*Oak 2

我认为这就是您所需要的：

for each_business in info:
    yp_bus_url = each_business.find('a', {'class': 'business-name'}).get('href')
    print(yp_bus_url)

Run Code Online (Sandbox Code Playgroud)

归档时间：	7 年，9 月前
查看次数：	4880 次
最近记录：	7 年，9 月前