Nab*_*ikh 5 html beautifulsoup dataframe web-scraping python-3.x
在这里,我试图从 Python 代码中指定的网站中提取表格。我能够获得 HTML 表,而且我无法使用 Python 转换为数据框。这是代码
# import libraries
import requests
from bs4 import BeautifulSoup
# specify url
url = 'http://my-trade.in/'
# request html
page = requests.get(url)
# Parse html using BeautifulSoup, you can use a different parser like lxml if present
soup = BeautifulSoup(page.content, 'html.parser')
tbl =soup.find("table",{"id":"MainContent_dataGridView1"})Run Code Online (Sandbox Code Playgroud)
小智 10
您可以为此使用 pandas read_html 函数,并记住将您获得的 html 转换为字符串,否则您会遇到一些解析错误。
import requests
from bs4 import BeautifulSoup
import pandas as pd
url = 'http://my-trade.in/'
page = requests.get(url)
soup = BeautifulSoup(page.content, 'html.parser')
tbl = soup.find("table",{"id":"MainContent_dataGridView1"})
data_frame = pd.read_html(str(tbl))[0]
Run Code Online (Sandbox Code Playgroud)