爬取微博热搜前十

1.本代码需引用request库和beautifulsoup库

2.打开微博网址,f12查看代码,(如图知道我们需要的代码是’class=td-02‘)

 

3.使用代码

import requests
from bs4 import BeautifulSoup

url = 'https://s.weibo.com/top/summary?cate=realtimehot'
headers = {'User-Agent':'Mozilla/5.0 (Windows NT 6.3; Win64; x64)'
                  'AppleWebKit/537.36 (KHTML, like Gecko) Chrome/69.0.3497.100 Safari/537.36'}
urls = requests.get(url, headers=headers)
urls.encoding = urls.apparent_encoding
text = urls.text
soup = BeautifulSoup(text, 'lxml')

a = soup.find_all(class_="td-02")
b = [i.get_text() for i in a]
print('热搜榜')
print('{:25}'.format('标题(搜索数)'))
for i,y in zip(b,d):
   print('{:20}\t{}\n'.format(i,y))

4.运行结果

posted @ 2020-03-20 14:17  big憨  阅读(305)  评论(0)    收藏  举报