step3: 创建jobbole爬虫

scrapy startproject Redbacktest
cd Redbacktest

创建jobbole爬虫

scrapy genspider jobbole2 blog.jobbole.com

从pycharm中导入后创建main文件

from scrapy.cmdline import execute

import sys
sys.path.append("D:\PycharmProjects\Redbacktest")
execute(['scrapy','crawl','jobbole2'])

调试前修改“君子协议”

ROBOTSTXT_OBEY = False

 

断点调试response是否获取到值

 

 

posted @ 2017-08-29 13:48  daiwenxugo  阅读(155)  评论(0编辑  收藏  举报