随笔分类 -  大数据处理

sortquery(转)
摘要:以下代码用python编写。import osimport os.pathimport operatorimport heapq"""sort users' queries by frequency1. hashing queries and dividing into 10 files. (hash(query)%10)2. counting the number queries and sorting in each file using hashtable.3. merging files using heap queue algorithm.&qu 阅读全文
posted @ 2012-11-15 16:46 一个人的天空@ 阅读(918) 评论(0) 推荐(0)