摘要: 附 pip install pycryptodome 阅读全文
posted @ 2020-02-24 11:01 jswf 阅读(439) 评论(0) 推荐(0)
摘要: from lxml import etree import requests from urllib import request import time import os from queue import Queue import threading import re from multip 阅读全文
posted @ 2020-02-23 15:43 jswf 阅读(659) 评论(0) 推荐(0)
摘要: 附 https://blog.csdn.net/weixin_43430036/article/details/84871624 # -*- coding: utf-8 -*- from urllib import request import scrapy import json from sel 阅读全文
posted @ 2020-02-23 12:31 jswf 阅读(444) 评论(0) 推荐(0)
摘要: chrome.exe --remote-debugging-port=9222 --user-data-dir="C:\selenum\AutomationProfile" 此条命令复制到命令行,打开端口为9222的浏览器 ,勿关闭 (此前应先配置环境变量 否则无chrome.exe此命令) chr 阅读全文
posted @ 2020-02-21 17:00 jswf 阅读(446) 评论(0) 推荐(0)
摘要: from scrapy import signals import random class Test001UseragentMiddleware(object): USER_AGENT=[ "Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/535.1 阅读全文
posted @ 2020-02-21 11:23 jswf 阅读(363) 评论(0) 推荐(0)
摘要: 设置键名和键值 set [key] [value] get [key] 列表操作 lpush [key] [value] 从左侧向列表中添加值 rpush [key] [value] 从右侧向列表中添加值 lrange [key] [start] [end] 从左侧遍历列表 从初始位置到末尾位置遍历 阅读全文
posted @ 2020-02-19 20:32 jswf 阅读(185) 评论(0) 推荐(0)
摘要: 附 https://blog.csdn.net/weixin_43527241/article/details/88723408 常用命令 redis-server redis.windows.conf 启动服务器 redis-server --service-install redis.windo 阅读全文
posted @ 2020-02-19 18:11 jswf 阅读(136) 评论(0) 推荐(0)
摘要: 操作命令 建立项目scrapy startproject [项目名] You can start your first spider with: cd jxnsh scrapy genspider example example.com 构建爬虫文件 先转到项目目录下 正常情况下再执行scrapy 阅读全文
posted @ 2020-02-18 22:03 jswf 阅读(716) 评论(0) 推荐(0)
摘要: 初版 # -*- coding: utf-8 -*- import scrapy import requests from lxml import etree from selenium import webdriver from scrapy.http.response.html import H 阅读全文
posted @ 2020-02-17 20:57 jswf 阅读(214) 评论(0) 推荐(0)
摘要: 1 # -*- coding: utf-8 -*- 2 import scrapy 3 import requests 4 from lxml import etree 5 from selenium import webdriver 6 from scrapy.http.response.html 阅读全文
posted @ 2020-02-17 16:33 jswf 阅读(298) 评论(0) 推荐(0)