随笔分类 -  data mining

摘要:the following too tutorial are good !!http://abloz.com/2012/07/03/nutch-and-solr-search-enging.htmlhttp://wiki.apache.org/nutch/NutchTutorial 阅读全文
posted @ 2013-01-12 14:22 Aldrich 阅读(136) 评论(0) 推荐(0)
摘要:1. Hadoop move-code-to-data, which is distributed among clusters; while traditional distributed system like SETI@homemove-data-to-code2. Hadoop is usually used to process unstructured data, which makes SQL database not suitable3. SQL optimized for some legacy applications, and may mismatch contempor 阅读全文
posted @ 2012-11-25 11:25 Aldrich 阅读(164) 评论(0) 推荐(0)