使用 BeautifulSoup 获取 span 标签的值

new*_*111 10 html python parsing beautifulsoup python-3.x

我有许多 facebook 群组,我想了解其成员的数量。一个例子是这个组:https : //www.facebook.com/groups/347805588637627/ 我看过页面上的检查元素,它是这样存储的:

<span id="count_text">9,413 members</span>
Run Code Online (Sandbox Code Playgroud)

我试图从页面中删除“9,413 名成员”。我曾尝试使用 BeautifulSoup,但无法解决。

谢谢

编辑:

from bs4 import BeautifulSoup
import requests

url = "https://www.facebook.com/groups/347805588637627/"
r  = requests.get(url)
data = r.text
soup = BeautifulSoup(data, "html.parser")
span = soup.find("span", id="count_text")
print(span.text)
Run Code Online (Sandbox Code Playgroud)

Hen*_*rik 8

如果页面中有多个 span 标签:

from bs4 import BeautifulSoup
soup = BeautifulSoup(your_html_input, 'html.parser')
span = soup.find("span", id="count_text")
span.text
Run Code Online (Sandbox Code Playgroud)


Bal*_*rol 3

您可以使用text解析的范围的属性:

>>> from bs4 import BeautifulSoup
>>> soup = BeautifulSoup('<span id="count_text">9,413 members</span>',   'html.parser')
>>> soup.span
<span id="count_text">9,413 members</span> 
>>> soup.span.text
'9,413 members'
Run Code Online (Sandbox Code Playgroud)