Python爬虫基础示例

使用pip安装相关依赖:

pip install requests
pip install bs4

安装成功提示:Successfully installed *...


 

爬取中国天气网数据示例代码:

#-*- coding:utf-8 -*-

import requests
from bs4 import BeautifulSoup


if __name__ == '__main__':
    url = 'http://lishi.tianqi.com/wuhan/201806.html'
    r = requests.get(url)

    soup = BeautifulSoup(r.text, 'html.parser')
    weather_div = soup.select('div[class="tqtongji2"]')[0]
    ul_list = weather_div.select('ul')
    for ul in ul_list:
        li_list = ul.select('li')
        print("日期:%s\n最高气温:%s\n最低气温:%s\n天气:%s\n风向:%s\n风力:%s\n" %
              (li_list[0].string.encode('utf-8'),
               li_list[1].string.encode('utf-8'),
               li_list[2].string.encode('utf-8'),
               li_list[3].string.encode('utf-8'),
               li_list[4].string.encode('utf-8'),
               li_list[5].string.encode('utf-8'))
              )

 

执行代码结果:

posted @ 2018-08-02 15:11  BretGui  阅读(229)  评论(0编辑  收藏  举报