返回是200,也没有重定向,日志也是完全正常,但是抓到了另一个页面。
这是怎么回事呢,速度已经设置的很慢了,用的scrapy-redis,和这个有关系吗。
这是日志
2020-06-01 17:09:52 [scrapy.core.engine] DEBUG: Crawled (200) <GET https://www.lagou.com/utrack/trackMid.html?f=https%3A%2F%2Fwww.lagou.com%2Fjobs%2F7226747.html%3Fshow%3Dc05f75f0d8e64ff5b4d05d9f2d8e3989&t=1591002590&_ti=1> (referer: https://www.lagou.com/zhaopin/PHP/)
这是抓取到的内容
b'<html><head><meta http-equiv="X-UA-Compatible" content="IE=edge,chrome=1"><meta name="renderer" content="webkit"><meta http-equiv="Content-Type" content="text/html; charset=utf-8"></head><body><script src="/utrack/track.js?version=1.0.1.0" type="text/javascript"></script><script type="text/javascript" src="https://www.lagou.com/utrack/trackMid.js?version=1.0.0.3&t=1591002594"></script><input type="hidden" id="KEY" value="dR0fWCWdq7YXHQnturqsROc3FhxTTSNIe1HCX5cSXP"/><script type="text/javascript">HWPKEQnw();</script>\xe9\xa1\xb5\xe9\x9d\xa2\xe5\x8a\xa0\xe8\xbd\xbd\xe4\xb8\xad...<script type="text/javascript" crossorigin="anonymous" src="https://www.lagou.com/upload/oss.js?v=1010"></script></body></html>\n'
带你彻底掌握Scrapy,用Django+Elasticsearch搭建搜索引擎
了解课程