随笔分类 -  数据挖掘-Hive

摘要:统计一天微博内“taobao.com”出现次数,每小时记录一次:for i in {00..23}; do hive -e "select count(*) from mds_anti_sass_log where source=3 and type=1 and args['content'] like '%taobao.com%' and dt=20131028 and hour=$i;" >> log.txt; done统计某一小时微博内“taobao.com”出现次数:hive -e "select args[&# 阅读全文
posted @ 2013-10-30 15:47 _Eric.Liu 阅读(858) 评论(0) 推荐(0)