2019 年 10月 17 日随笔档案 - w_poison

2019年10月17日

摘要：思路是抽取页面所有链接，根据网站host以及一些逻辑分析，剔除掉不必要的网址。计算每个xpath对应的链接数，取其中最大值。代码依赖于jsoup、httpclient 一、抽取网页所有链接并进行一些过滤 1 public static ArrayList<String> getList(String 阅读全文

posted @ 2019-10-17 18:34 w_poison 阅读(233) 评论(0) 推荐(0)

w_poison

公告