随笔档案「2019年2月」 - oooooolr

https://scrapingclub.com/exercise/detail_cookie/

摘要：settings.py 阅读全文

posted @ 2019-02-27 14:47 oooooolr 阅读(438) 评论(0) 推荐(0)

摘要：b'token=P85TRJJ1C7; expires=Wed, 27-Feb-2019 01:54:18 GMT; Max-Age=30; Path=/'] 阅读全文

posted @ 2019-02-27 10:04 oooooolr 阅读(917) 评论(0) 推荐(0)

摘要：Sauce Labs uses cookies to give you the best online experience. If you continue to use this site, you agree to the use of cookies. Please see our priv 阅读全文

posted @ 2019-02-24 18:30 oooooolr 阅读(189) 评论(0) 推荐(0)

css选择问题

摘要：Short Dress $24.99 css('div .col')选取不到对象，返回空列表 css('div[class*=col]')可以返回对象 or xpath('//div//a[contains(@href,"list")]可以返回，此处注意"list"not'list'双引号 response.css('.pagination li:nth-chil... 阅读全文

posted @ 2019-02-23 19:32 oooooolr 阅读(182) 评论(0) 推荐(0)

从js中提取数据

摘要：You can use the Selector's built-in support for regular expressions through re(): Demo (using scrapy shell): https://stackoverflow.com/questions/29163 阅读全文

posted @ 2019-02-21 12:35 oooooolr 阅读(757) 评论(0) 推荐(0)

F12搜索json内容

摘要：阅读全文

posted @ 2019-02-21 11:19 oooooolr 阅读(1158) 评论(0) 推荐(0)

materials

摘要：http://interactivepython.org/runestone/static/pythonds/index.html https://blog.michaelyin.info/scrapy-exercises-make-you-prepared-for-web-scraping-cha 阅读全文

posted @ 2019-02-21 09:00 oooooolr 阅读(302) 评论(0) 推荐(0)

xpath ,css

摘要：https://docs.scrapy.org/en/latest/intro/tutorial.html xpath @选择属性 .当前目录下选择 //任意路径选择 /bookstore/book[position()<3]，选取最前面的两个属于 bookstore 元素的子元素的 book 元素阅读全文

posted @ 2019-02-13 20:32 oooooolr 阅读(120) 评论(0) 推荐(0)