随笔分类 - 爬虫采集

BeautifulSoup的一点使用小结(一)

摘要：最近在学习用Python采集，接触到BeautifulSoup。BeautifulSoup是用 Python 写的一个 HTML/XML 的解析器,它可以很好的处理不规范标记并生成剖析树。from BeautifulSoup import BeautifulSoupfrom BeautifulSoup import BeautifulStoneSoup import urllib2url=“xxxxxxxxxxxxxx”content = urllib2.urlopen(url).read()soup = BeautifulSoup(content,fromEncoding="gb1 阅读全文

posted @ 2013-08-30 17:29 流氓耗子阅读(415) 评论(0) 推荐(0)

国境以南太阳以西

随笔分类 - 爬虫采集

公告

国境以南 太阳以西

随笔分类 - 爬虫采集

公告

国境以南太阳以西