与其直接解析JSON API调用的结果,不如使用python包装器:
import wikipedia
page = wikipedia.page("List_of_works_by_Vincent_van_Gogh")
print page.links
Run Code Online (Sandbox Code Playgroud)
还有其他客户和包装器。
另外,这是使用BeautifulSoupHTML解析器的选项:
>>> from bs4 import BeautifulSoup
>>> url = "http://en.wikipedia.org/wiki/List_of_works_by_Vincent_van_Gogh"
>>> soup = BeautifulSoup(urlopen(url))
>>> table = soup.find('table', class_="wikitable")
>>> for row in table.find_all('tr')[1:]:
... print(row.find_all('td')[1].text)
...
Still Life with Cabbage and Clogs
Crouching Boy with Sickle, Black chalk and watercolor
Woman Sewing, Watercolor
Woman with White Shawl
...
Run Code Online (Sandbox Code Playgroud)
| 归档时间: |
|
| 查看次数: |
2683 次 |
| 最近记录: |