new*_*111 10 html python parsing beautifulsoup python-3.x
我有许多 facebook 群组,我想了解其成员的数量。一个例子是这个组:https : //www.facebook.com/groups/347805588637627/ 我看过页面上的检查元素,它是这样存储的:
<span id="count_text">9,413 members</span>
Run Code Online (Sandbox Code Playgroud)
我试图从页面中删除“9,413 名成员”。我曾尝试使用 BeautifulSoup,但无法解决。
谢谢
编辑:
from bs4 import BeautifulSoup
import requests
url = "https://www.facebook.com/groups/347805588637627/"
r = requests.get(url)
data = r.text
soup = BeautifulSoup(data, "html.parser")
span = soup.find("span", id="count_text")
print(span.text)
Run Code Online (Sandbox Code Playgroud)
如果页面中有多个 span 标签:
from bs4 import BeautifulSoup
soup = BeautifulSoup(your_html_input, 'html.parser')
span = soup.find("span", id="count_text")
span.text
Run Code Online (Sandbox Code Playgroud)
您可以使用text解析的范围的属性:
>>> from bs4 import BeautifulSoup
>>> soup = BeautifulSoup('<span id="count_text">9,413 members</span>', 'html.parser')
>>> soup.span
<span id="count_text">9,413 members</span>
>>> soup.span.text
'9,413 members'
Run Code Online (Sandbox Code Playgroud)
| 归档时间: |
|
| 查看次数: |
33839 次 |
| 最近记录: |