DEBUG: Filtered offsite request to ‘account.cnblogs.com’: <GET https://account.cnblogs.com:443/NewsAjax/GetAjaxNewsInfo?contentId=443>
老师
爬取博客园解析域名出错了 account.cnbolgs.com是登录页面吧 不知道为什么
name = "jobbole"
allowed_domains = ['news.cnblogs.com']
start_urls = ['https://news.cnblogs.com/']
源码写的跟您一样的
带你彻底掌握Scrapy,用Django+Elasticsearch搭建搜索引擎
了解课程