webspider

 

webspider.py

python 抓取每日一文文章

import urllib2

# get webpage
headers = {'User-Agent':'Mozilla/5.0 (Windows NT 5.1) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/30.0.1599.101 Safari/537.36'}
fd   = urllib2.Request('http://meiriyiwen.com/',headers = headers)  
data = urllib2.urlopen(fd).read()

# save as a file
f = open('issue.htm', 'w')
f.write(data)
f.close()
posted @ 2014-11-25 10:21  killent  阅读(180)  评论(0)    收藏  举报