公司电脑,加域,win10系统,当采集过程中重试次数多时,采集一部分数据后会一直重试,无法继续,原因不明。
与代理可用性无关,相同脚本在centos7下运行无此问题。
例如:
2018-04-25 08:44:42 [scrapy.downloadermiddlewares.retry] DEBUG: Retrying <GET https://www.autozi.com/goods/search.html?carModelId=1711281226033381&_=1524472973719&categoryId=148000000000000&categoryLevel=1> (failed 3 times): User timeout caused connection failure: Getting https://www.autozi.com/goods/search.html?carModelId=1711281226033381&_=1524472973719&categoryId=148000000000000&categoryLevel=1 took longer than 20.0 seconds..
2018-04-25 08:44:42 [scrapy.downloadermiddlewares.retry] DEBUG: Retrying <GET https://www.autozi.com/goods/search.html?carModelId=1711281226033381&_=1524472973719&categoryId=144000000000000&categoryLevel=1> (failed 3 times): User timeout caused connection failure: Getting https://www.autozi.com/goods/search.html?carModelId=1711281226033381&_=1524472973719&categoryId=144000000000000&categoryLevel=1 took longer than 20.0 seconds..
你的IP应该被封了吧