牵牛花

2017年5月24日

摘要：这里要查询的是去过的国家数（country）的次数ct大于2的人的名字 select name ,count(country) ct from sz03 where ct >2 group by name; ERROR 1054 (42S22): Unknown column 'ct' in 'wh 阅读全文

posted @ 2017-05-24 15:08 牵牛花阅读(2025) 评论(0) 推荐(0)

hive中order by,sort by, distribute by, cluster by作用以及用法

摘要： hive中order by,sort by, distribute by, cluster by作用以及用法阅读全文

posted @ 2017-05-24 14:46 牵牛花阅读(169) 评论(0) 推荐(0)

hive中遇到的问题

摘要：结果图按照理解来说，应该只有一个1啊，难道这个sql有问题，自己没有理解对group by 的用法？以上是hive的下面的是mysql的感觉这条sql写的是有问题啊阅读全文

posted @ 2017-05-24 10:02 牵牛花阅读(247) 评论(0) 推荐(0)

2017年5月23日

解读：计数器Counter

摘要：解读：计数器Counter http://www.cnblogs.com/codeOfLife/p/5521356.html 这个讲的更详细阅读全文

posted @ 2017-05-23 17:29 牵牛花阅读(409) 评论(0) 推荐(0)

hadoop System times on machines may be out of sync. Check system time and time zones.

摘要：之前环境一直好好的，由于玩坏了一个mini3只能复制一个了，但是复制之后就出现这个问题了解决办法是设置xshell向每一个窗口发消息http://mofansheng.blog.51cto.com/8792265/1683336 设置时间 date -s "2012-11-03 10:25:25 阅读全文

posted @ 2017-05-23 16:31 牵牛花阅读(932) 评论(0) 推荐(0)

当客户端提交更新数据请求时，是先写入edits，然后再写入内存的

摘要： http://blog.sina.com.cn/s/blog_6f83c7470101b7d3.html http://blog.csdn.net/slq1023/article/details/49826081 当客户端提交更新数据请求时，是先写入edits，然后再写入内存的阅读全文

posted @ 2017-05-23 10:08 牵牛花阅读(221) 评论(0) 推荐(0)

大量小文件的优化策略

摘要：默认情况下，TextInputFormat对任务的切片机制是按照文件规划切片的，不管文件大小，都会有一个单独的切片，都会交给一个maptask，此时如果有很多小文件就会产生大量的maptask，导致处理效率低下优化1 最好的办法，在数据处理系统的最前端（预处理/采集）就将小文件合并成大大文件再上传阅读全文

posted @ 2017-05-23 08:59 牵牛花阅读(450) 评论(0) 推荐(0)

2017年5月22日

MapReduce Input Split 输入分/切片

摘要： MapReduce Input Split（输入分/切片）详解 public static long getMaxSplitSize(JobContext context) { return context.getConfiguration().getLong(SPLIT_MAXSIZE, Long 阅读全文

posted @ 2017-05-22 17:27 牵牛花阅读(388) 评论(0) 推荐(0)

hadoop partitioner个数与reducer个数的试验

摘要： job.setPartitionerClass(myPartitioner.class);//设置了5个 job.setNumReduceTasks(2); 1.当分区数等于rducer数量时，正常运行， 2.当分区数等于5时，reduce为1时，正常运行,有一个结果文件当reduce数量=2时报阅读全文

posted @ 2017-05-22 14:53 牵牛花阅读(216) 评论(0) 推荐(0)

2017年5月21日

Hadoop之MapReduce的两种任务模式

摘要： http://qianshangding.iteye.com/blog/2259421 Hadoop之MapReduce的两种任务模式阅读全文

posted @ 2017-05-21 10:23 牵牛花阅读(236) 评论(0) 推荐(0)

公告