09 2019 档案
摘要:import requestsfrom bs4 import BeautifulSoupimport pandas as pdfrom pandas import DataFrame url='https://search.51job.com/list/120300,000000,0000,32,9
阅读全文
摘要:from bs4 import BeautifulSoup text='''<?xml version="1.0" encoding="ISO-8859-1"?><bookstore><book><title lang='eng'>Harry Potter</title><price>29.9</p
阅读全文
摘要:import requestsimport re #获得本要IP url='http://www.baidu.com/s?ie=utf-8&f=8&rsv_bp=1&rsv_idx=1&tn=baidu&wd=ip' res=requests.get(url)res.encoding='utf-8'
阅读全文
摘要:import requestsfrom lxml import etreeimport randomfrom fake_useragent import UserAgent ua=UserAgent()uas=[]for i in range(5): uas.append(ua.random) #生
阅读全文
摘要:import requestsfrom lxml import etree url='https://ie.icoa.cn/'head={'user-agent':'Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like
阅读全文
摘要:import requestsimport re url='http://tieba.baidu.com/photo/g/bw/picture/list?kw=%E6%9D%A8%E6%B4%8B&alt=jview&rn=200&tid=4748284434&pn=1&ps=1&pe=40&inf
阅读全文
摘要:\d[{n},{n,},{n,m}] 匹配十进制数字 n次,最少n次,最少n次最多m次 \D 匹配非十进制数字 [...] 表示一组字符,匹配里面任一字符 [^...]不在里面的任一字符 +匹配前面的子表达式; \s 空白字符; \S 除空白字符 (?:pattern)匹配但不取结果; ^ 表示开始
阅读全文
摘要:import requestsimport re url='https://list.jd.com/list.html?cat=9987,653,655'res=requests.get(url)image_pat='<img width="220" height="220" data-img="1
阅读全文
摘要:import requestsfrom lxml import etreefrom pandas import DataFrame url='https://search.51job.com/list/120800,000000,0000,32,9,99,%25E4%25BA%25A7%25E5%2
阅读全文
摘要:import requestsfrom lxml import etreeurl='https://www.baidu.com/'r=requests.get(url)r.encoding='utf-8'r.text root=etree.HTML(r.text)root.xpath('/html/
阅读全文
摘要:#图表 import matplotlib.pyplot as plt fig,ax=plt.subplots(2,2)ax[0,1].plot(x,y,'r--*',label='sin')ax[0,1].legend(loc='upper right')ax[0,1].grid()fig.sav
阅读全文

浙公网安备 33010602011771号