Reh*_* Yz 3 python web-scraping
运行以下代码时出现错误。这是我用于网页抓取。
from requests_html import HTMLSession
url = 'https://www.beerwulf.com/en-gb/c/beers?segment=Beers&catalogCode=Beer_1'
s = HTMLSession()
r = s.get(url)
r.html.render(sleep=1)
print(r.status_code)
Run Code Online (Sandbox Code Playgroud)
我收到以下错误
Traceback (most recent call last):
File "c:/Python/Python385/web_scraping/task1.py", line 8, in <module>
r.html.render(sleep=1)
File "C:\Python\Python385\lib\site-packages\requests_html.py", line 598, in render
content, result, page = self.session.loop.run_until_complete(self._async_render(url=self.url,
script=script, sleep=sleep, wait=wait, content=self.html, reload=reload, scrolldown=scrolldown,
timeout=timeout, keep_page=keep_page))
File "C:\Python\Python385\lib\asyncio\base_events.py", line 616, in run_until_complete
return future.result()
File "C:\Python\Python385\lib\site-packages\requests_html.py", line 512, in _async_render
await page.goto(url, options={'timeout': int(timeout * 1000)})
File "C:\Python\Python385\lib\site-packages\pyppeteer\page.py", line 885, in goto
raise error
pyppeteer.errors.TimeoutError: Navigation Timeout Exceeded: 8000 ms exceeded.
Run Code Online (Sandbox Code Playgroud)
Pra*_*tha 13
默认超时为r.html.render()8 秒。如果所有 JavaScript 代码未在 8 秒内加载,则会出现此错误。您可以通过更改默认超时来解决此问题,假设为 20 秒。使用以下代码:
r.html.render(timeout=20)
Run Code Online (Sandbox Code Playgroud)
| 归档时间: |
|
| 查看次数: |
3098 次 |
| 最近记录: |