摘要: import scrapy#####################################多页数据拿取​class XihSpider(scrapy.Spider): name = 'xih' # allowed_domains = ['www.xxx.com'] start_urls = 阅读全文
posted @ 2021-04-24 22:25 mjth 阅读(158) 评论(0) 推荐(0)
摘要: import reg='jQuery112409728083357185207_1619268467001({"rc":0,"rt":11,"svr":182993152,"lt":1,"full":1,"data":{"total":1,"diff":[{"f1":2,"f2":3159,"f3" 阅读全文
posted @ 2021-04-24 22:18 mjth 阅读(81) 评论(0) 推荐(0)
摘要: from urllib import requestimport urllib.requestimport jsonimport re,uuidfrom lxml import etreeclass peihuaImg(object): def __init__(self): super(peihu 阅读全文
posted @ 2021-04-24 17:19 mjth 阅读(211) 评论(0) 推荐(0)
摘要: 安装:pip install scrapy 创建一个工程 : scrapy startproject xxPro cd xxPro 在spiders中创建一个爬虫文件 -- scrapy genspider spiderName www.xxx.com 执行工程: scrapy crawl spid 阅读全文
posted @ 2021-04-24 14:34 mjth 阅读(439) 评论(0) 推荐(0)