2020 年 3月 11 日随笔档案 - 共感的艺术

2020年3月11日

摘要： selenium模拟登录赶集网，未解决验证码问题，但是可以在下方设置time.sleep()，时间长一点手动操作验证码，实现成功登录： import selenium import selenium.webdriver import selenium.webdriver.common.keys im 阅读全文

posted @ 2020-03-11 11:14 共感的艺术阅读(505) 评论(0) 推荐(0)

selenium模拟登录京东，手动解决验证码问题，抓取购物车价格

摘要： selenium模拟登录京东，未解决验证码问题，但是可以在下方设置time.sleep()，时间长一点手动操作验证码，实现成功登录,并抓取了购物车价格： import selenium import selenium.webdriver import selenium.webdriver.commo 阅读全文

posted @ 2020-03-11 11:13 共感的艺术阅读(728) 评论(0) 推荐(0)

selenium模拟登录QQ空间,手动解决验证码问题

摘要： selenium模拟登录QQ空间,未解决验证码问题，但是可以在下方设置time.sleep()，时间长一点手动操作验证码，实现成功登录： #coding:utf-8 import selenium import selenium.webdriver import time #QQ空间现在登录需要验阅读全文

posted @ 2020-03-11 11:12 共感的艺术阅读(527) 评论(0) 推荐(0)

selenium无界面浏览器，访问百度搜索为例

摘要： selenium无界面浏览器，访问百度搜索，输入关键词，打印快照： import selenium import selenium.webdriver import selenium.webdriver.common.keys import time driver = selenium.webdri 阅读全文

posted @ 2020-03-11 11:10 共感的艺术阅读(172) 评论(0) 推荐(0)

selenium无界面浏览器

摘要： selenium无界面浏览器,需要用到PhantomJS： import selenium import selenium.webdriver import time #phantomjs.exe 路径需添加系统环境变量 executable_path为环境变量地址 driver=selenium. 阅读全文

posted @ 2020-03-11 11:09 共感的艺术阅读(613) 评论(0) 推荐(0)

selenium验证码的解决办法

摘要：转载自https://www.cnblogs.com/wuzhiyi/p/6020967.html selenium验证码的解决办法：在做web自动化登录时，每当遇到验证码有几种解决方式： 1，设置万能验证码 2，通过pytesser破解图片 3，通过代码控制等待手动输入验证(附代码) 下面就是第阅读全文

posted @ 2020-03-11 11:07 共感的艺术阅读(552) 评论(0) 推荐(0)

selenium简单识别验证码

摘要： selenium简单识别验证码，识别验证码图片，不太精准，原理上是这样： import subprocess #验证png图片不报错，可以识别图片为文字，但是不精准，jpg也可以识别，但是会报错，也会生出txt文档 #第一个是安装的Tesseract-OCR的路径，第二个是验证码的图片的路径 p=s 阅读全文

posted @ 2020-03-11 10:56 共感的艺术阅读(234) 评论(0) 推荐(0)

selenium暴力破解密码，正确密码终止程序

摘要： selenium暴力破解密码，实现正确密码终止程序，打印显示正确密码： #coding:utf-8 import selenium import selenium.webdriver import time #测试暴力破解登录地址用户名密码需要填写 def loginoa(username,p 阅读全文

posted @ 2020-03-11 10:55 共感的艺术阅读(792) 评论(0) 推荐(0)

selenium暴力破解密码，测试帝国后台

摘要： selenium暴力破解密码，实测帝国后台，在破解过程中，密码库丰富的话，在出现正确密码的时候，会终止程序，并且打印显示正确密码，但是帝国后台有限制用户名密码输入登录次数，可通过定时time.sleep(),找到临界点进行破解： #coding:utf-8 import selenium impor 阅读全文

posted @ 2020-03-11 10:53 共感的艺术阅读(568) 评论(0) 推荐(0)

selenium模拟web登录，测试帝国后台成功登录

摘要： selenium模拟web登录，测试过程中，已实现帝国后台的成功登录： #coding:utf-8 import selenium import selenium.webdriver import time #测试网站，帝国后台已成功登录 def loginoa(username,password 阅读全文

posted @ 2020-03-11 10:49 共感的艺术阅读(456) 评论(0) 推荐(0)

selenium的web浏览器

摘要： selenium的web浏览器：把相应的driver下载好后，需要放到python安装的目录下，不放上的话需要配置环境变量，并在浏览器括号内填写executable_path="driver安装路径"。放在python路径下不用填写 import selenium import selenium.w 阅读全文

posted @ 2020-03-11 10:47 共感的艺术阅读(146) 评论(0) 推荐(0)

BS4提取股票信息

摘要： BS4提取股票信息： # encoding:utf-8 import urllib import urllib.request from bs4 import BeautifulSoup def download(url): headers={"User-Agent":"Mozilla/5.0 (c 阅读全文

posted @ 2020-03-11 10:42 共感的艺术阅读(116) 评论(0) 推荐(0)

BS4CCS选择

摘要： BS4CCS选择： #coding:utf-8 from bs4 import BeautifulSoup import re html = """ <html><head><title>The Dormouse's story</title></head> <body> <p class="tit 阅读全文

posted @ 2020-03-11 10:41 共感的艺术阅读(111) 评论(0) 推荐(0)

BS4搜索

摘要： BS4搜索： #coding:utf-8 from bs4 import BeautifulSoup import re html = """ <html><head><title>The Dormouse's story</title></head> <body> <p class="title" 阅读全文

posted @ 2020-03-11 10:40 共感的艺术阅读(76) 评论(0) 推荐(0)

BS4遍历文档

摘要： BS4遍历文档： #coding:utf-8 from bs4 import BeautifulSoup html = """ <html><head><title>The Dormouse's story</title></head> <body> <p class="title" name="d 阅读全文

posted @ 2020-03-11 10:39 共感的艺术阅读(215) 评论(0) 推荐(0)

BS4初级其它类型

摘要： BS4初级其它类型： #coding:utf-8 from bs4 import BeautifulSoup html = """ <html><head><title>The Dormouse's story</title></head> <body> <p class="title" name= 阅读全文

posted @ 2020-03-11 10:38 共感的艺术阅读(72) 评论(0) 推荐(0)

BS4初级

摘要： BS4初级： #coding:utf-8 from bs4 import BeautifulSoup html = """ <html><head><title>The Dormouse's story</title></head> <body> <p class="title" name="dro 阅读全文

posted @ 2020-03-11 10:37 共感的艺术阅读(76) 评论(0) 推荐(0)

BS4

摘要： BS4: #coding:utf-8 from bs4 import BeautifulSoup #导入函数 html = """ <html><head><title>The Dormouse's story</title></head> <body> <p class="title" name= 阅读全文

posted @ 2020-03-11 10:35 共感的艺术阅读(116) 评论(0) 推荐(0)

运用python3中的urllib爬取贴吧的图片

摘要：运用python3中的urllib爬取贴吧的图片： import urllib import urllib.request import lxml import lxml.etree import re from urllib import parse #抓取贴吧页面数量信息 def gettieb 阅读全文

posted @ 2020-03-11 10:31 共感的艺术阅读(454) 评论(0) 推荐(0)