Mat*_*ias 3 proxy scrapy deferred scrapyd proxytunnel
我正在使用Scrapy中的代理周期,有时某些代理有错误,这正在阻止我的蜘蛛。 ?
这是需要修改的代码:https : //github.com/scrapy/scrapy/blob/master/scrapy/core/downloader/handlers/http11.py
小智 5
像这样:
from scrapy.core.downloader.handlers.http11 import TunnelError
class RetryMiddleware(RetryMiddleware):
def process_exception(self, request, exception, spider):
if ( isinstance(exception, self.EXCEPTIONS_TO_RETRY) or isinstance(exception, TunnelError) ) \
and 'dont_retry' not in request.meta:
return self._retry(request, exception, spider)
Run Code Online (Sandbox Code Playgroud)
在settings.py中:
DOWNLOADER_MIDDLEWARES = {
'myproject.myretry.RetryMiddleware': 200,
'scrapy.contrib.downloadermiddleware.retry.RetryMiddleware': None
}
Run Code Online (Sandbox Code Playgroud)
归档时间: |
|
查看次数: |
719 次 |
最近记录: |