摘要:
1 Nutch common is 'bin/nutch crawl <urlDir> [-dir d] [-threads n] [-depth i] [-topN] ', nutch will generate segment foreach depth,and topN means each layer will collect topN urls. Generally each layer has one single segment,it depends onmaxNumSegments(1 is the default value) in Generat 阅读全文
posted @ 2011-08-11 17:41
剑迅
阅读(131)
评论(0)
推荐(0)
浙公网安备 33010602011771号