python获取script里的内容

import requests
from bs4 import BeautifulSoup

url1 = "https://mip.keoaeic.org/journal_skills/6616.html"
html = requests.get(url1).content
html=html.decode('utf-8') # python3
soup = BeautifulSoup(html, "html.parser")
a = soup.select('script[type="application/ld+json"]') 
#查找<script[type="application/ld+json"]> 里面的内容,因为这个地址上面有多个相同的,只需要获取对应的下标内容即可。

t = list(a)[0].text
print(r)

  

posted @ 2020-03-16 11:48  ToDarcy  阅读(5170)  评论(0编辑  收藏  举报