python爬虫 - 随笔分类 - Louiszj

python random模块（随机数）

摘要：random.random random.random()用于生成一个0到1的随机符点数: 0 <= n < 1.0 random.uniform random.uniform的函数原型为：random.uniform(a, b)，用于生成一个指定范围内的随机符点数，两个参数其中一个是上限，一个是下阅读全文

posted @ 2018-04-04 10:59 Louiszj 阅读(479) 评论(0) 推荐(0)

python 爬取5566图库图片

摘要：python 爬取5566图库图片 1 import requests 2 import random 3 import re 4 import time 5 import os 6 from bs4 import BeautifulSoup 7 8 9 class GetGirlsPhoto(object): 10 def __init__(s... 阅读全文

posted @ 2018-04-04 10:54 Louiszj 阅读(909) 评论(1) 推荐(0)

[转][python] 常用正则表达式爬取网页信息及分析HTML标签总结

摘要：转载至:https://blog.csdn.net/Eastmount/article/details/51082253 这篇文章主要是介绍Python爬取网页信息时，经常使用的正则表达式及方法。它是一篇总结性文章，实用性比较大，主要解决自己遇到的爬虫问题，也希望对你有所帮助~当然如果会Seleni 阅读全文

posted @ 2018-04-02 12:48 Louiszj 阅读(1496) 评论(0) 推荐(0)

[转]python爬虫实例项目大全

摘要：WechatSogou [1]- 微信公众号爬虫。基于搜狗微信搜索的微信公众号爬虫接口，可以扩展成基于搜狗搜索的爬虫，返回结果是列表，每一项均是公众号具体信息字典。【2018年3月28日完成】DouBanSpider [2]- 豆瓣读书爬虫。可以爬下豆瓣读书标签下的所有图书，按评分排名依次存储，阅读全文

posted @ 2018-03-28 14:13 Louiszj 阅读(373) 评论(0) 推荐(0)

Python 爬取笔趣看小说

摘要：# -*- coding:utf-8 -*- from bs4 import BeautifulSoup import requests import sys class DownLoader(object): def __init__(self): self.server = 'http://www.biqukan.com/' self.target... 阅读全文

posted @ 2018-03-28 12:23 Louiszj 阅读(1008) 评论(0) 推荐(0)

python 爬取豆瓣图书

摘要：#!-*-coding:utf-8-*- import requests import xlwt from bs4 import BeautifulSoup from collections import OrderedDict class DouBanBookSpider(object): def __init__(self, book_type, quantity): ... 阅读全文

posted @ 2018-03-28 12:20 Louiszj 阅读(643) 评论(0) 推荐(0)

Beautiful Soup 4.4.0 文档

摘要：http://beautifulsoup.readthedocs.io/zh_CN/latest/ 阅读全文

posted @ 2018-03-26 14:20 Louiszj 阅读(209) 评论(0) 推荐(0)

python requests模块

摘要：安装 Requests 在你已经安装好python的前提下：如果你没有安装 pip （啧啧），这个 Python installation guide 可以带你完成这一流程。获得源码 Requests 一直在 Github 上积极地开发，你可以一直从这里获取到代码。你可以克隆公共版本库：发送阅读全文

posted @ 2018-03-23 16:44 Louiszj 阅读(310) 评论(0) 推荐(0)

Louiszj

随笔分类 - python爬虫