我打算做网络抓取,但我似乎陷入了第一步.
import urllib.request
from bs4 import beautifulSoup
wiki = "https://en.wikipedia.org/wiki/List_of_state_and_union_territory_capitals_in_India"
page = urllib.urlopen(wiki)
soup = BeautifulSoup(page)
print(soup.prettify())
Run Code Online (Sandbox Code Playgroud)
我写这些行只是为了测试,但它显示了一个错误
Traceback (most recent call last):
File "C:/python programs/Web Scraping/wiki.py", line 3, in <module>
from bs4 import beautifulSoup
ModuleNotFoundError: No module named 'bs4'
Process finished with exit code 1
Run Code Online (Sandbox Code Playgroud)
我试图清除它的事情.
1)pip install beautifulsoup4(尝试使用easy_install)
2)检查环境变量中的python路径.我在路径中包含了C:\ python和C:\ python\Scripts.
3)尝试从crummy.com下载Beautiful Soup,然后从`python setup.py install命令安装.
我花了将近一整天的时间来清理它,尝试了几乎所有的解决方案,现在它确实令人沮丧.但如果有人仍想将其标记为重复,您可以自由地进行复制.
有什么我错过了吗?