摘要: 再用python爬取网页时,用模拟浏览器登陆,得到的中文字符出现乱码,该怎么解决呢?url = “http://newhouse.hfhouse.com/” req = urllib2.Request(url,headers = {"User-Agent": "Mozilla/5.0 (Windows NT 6.1; rv:24.0) Gecko/20100101 Firefox/24.0" }) reqHtml = urllib2.urlopen(req).read() #print reqHtml songtasteHtmlEncoding=' 阅读全文
posted @ 2013-09-29 18:57 Tinan 阅读(567) 评论(0) 推荐(0) 编辑