scrapy教程中扭曲的关键未处理错误

Vin*_*tro 5 scrapy python-2.7 scrapy-spider

我是编程的新手,我正在尝试学习scrapy,使用scrapy教程:http://doc.scrapy.org/en/latest/intro/tutorial.html

所以我运行"scrapy crawl dmoz"命令并得到此错误:

2015-07-14 16:11:02 [scrapy] INFO: Scrapy 1.0.1 started (bot: tutorial)
2015-07-14 16:11:02 [scrapy] INFO: Optional features available: ssl, http11
2015-07-14 16:11:02 [scrapy] INFO: Overridden settings: {'NEWSPIDER_MODULE':     'tu
torial.spiders', 'SPIDER_MODULES': ['tutorial.spiders'], 'BOT_NAME':   'tutorial'}

2015-07-14 16:11:05 [scrapy] INFO: Enabled extensions: CloseSpider, TelnetConsol
e, LogStats, CoreStats, SpiderState
Unhandled error in Deferred:
2015-07-14 16:11:06 [twisted] CRITICAL: Unhandled error in Deferred:
2015-07-14 16:11:07 [twisted] CRITICAL:
Run Code Online (Sandbox Code Playgroud)

我正在使用Windows 7和python 2.7.有谁知道这是什么问题?我怎么能解决这个问题?

编辑:我的蜘蛛文件代码是:

# This package will contain the spiders of your Scrapy project
#
# Please refer to the documentation for information on how to create and manage
# your spiders.
import scrapy


class DmozSpider(scrapy.Spider):
    name = "dmoz"
    allowed_domains = ["dmoz.org"]
     start_urls = [
    "http://www.dmoz.org/computers/programming/languages/python/books/",
    "http://www.dmoz.org/computer/programming/languages/python/resources/"
]

    def parse(self, response):
        filename = response.url.split("/")[-2] + '.html'
        with open(filename,'wb') as f:
            f.write(response.body)
Run Code Online (Sandbox Code Playgroud)

items.py代码:

import scrapy

class DmozItem(scrapy.Item):
    title = scrapy.Field()
    link = scrapy.Field()
    desc = scrapy.Field()
Run Code Online (Sandbox Code Playgroud)

点子列表:

  • bootstrap-admin(0.3.3)
  • cffi(1.1.2)
  • 特征(14.3.0)
  • 密码学(0.9.3)
  • cssselect(0.9.1)
  • Django(1.7.7)
  • django-auth-ldap(1.2.4)
  • django-debug-toolbar(1.3.0)
  • django-mssql(1.6.2)
  • django-pyodbc(0.2.6)
  • django-pyodbc-azure(1.2.2)
  • django-redator(0.2.3)
  • django-reversion(1.8.5)
  • django-summernote(0.6.0)
  • django-windows-tools(0.1.1)
  • django-wysiwyg-redactor(0.4.3.2)
  • enum34(1.0.4)
  • ez-setup(0.9)
  • flup(1.0.2)
  • 艾达娜(2.0)
  • ipaddress(1.0.13)
  • iso8601(0.1.4)
  • 伐木(0.4.9.6)
  • lxml(3.4.4)
  • 机械化(0.2.5)
  • MySQL-python(1.2.4)
  • pbr(0​​.10.8)
  • 枕头(2.7.0)
  • 点子(7.1.0)
  • pyasn1(0.1.8)
  • pyasn1-modules(0.0.6)
  • pycparser(2.14)
  • pymongo(2.6)
  • pyodbc(3.0.7)
  • pyOpenSSL(0.15.1)
  • pypm(1.4.3)
  • python-ldap(2.4.18)
  • pythonselect(1.3)
  • pywin32(218.3)
  • queuelib(1.2.2)
  • Scrapy(1.0.1)
  • 硒(2.44.0)
  • 服务身份(14.0.0)
  • setuptools(18.0.1)
  • 六(1.9.0)
  • sqlparse(0.1.15)
  • 装卸工(1.3.0)
  • 扭曲(15.2.1)
  • virtualenv(1.11.6)
  • virtualenv-clone(0.2.5)
  • virtualenvwrapper(4.3.2)
  • virtualenvwrapper-powershell(12.7.8)
  • w3lib(1.11.0)
  • xlrd(0.9.2)
  • zope.interface(4.1.2)

对我可怜的英语的关注和讽刺,不是我的母语.

小智 2

我也开始学习scrapy,也遇到了和你一样的问题。折腾了一下午,最后发现是因为pywin32模块只下载而没有安装。您可以尝试在cmd中输入以下命令来完成pywin32模块的安装并再次尝试抓取:

python python27\scripts\pywin32_postinstall.py -install

我希望它会有所帮助!