随笔分类 - spider
about spider
摘要:Introduction What is a WebSpider A WebSpider or crawler is an automated program that follows links on websites and calls a WebRobot to handle the cont...
阅读全文
摘要:1,得到网页上的链接地址: string matchString = @"]+href=\s*(?:'(?[^']+)'|""(?[^""]+)""|(?[^>\s]+))\s*[^>]*>";2,得到网页的标题: string matchString = @"(?.*)";3,去掉网页中...
阅读全文