Hive 避免小文件
set mapred.max.split.size=256000000;
set mapred.min.split.size.per.node=100000000;set mapred.min.split.size.per.rack=100000000;
set hive.input.format=org.apache.hadoop.hive.ql.io.CombineHiveInputFormat;
set hive.merge.mapfiles = true;
set hive.merge.mapredfiles = true ;
set hive.merge.size.per.task = 256000000;
set hive.merge.smallfiles.avgsize=16000000 ;
本文来自博客园,作者:小白啊小白,Fighting,转载请注明原文链接:https://www.cnblogs.com/ywjfx/p/16206529.html

浙公网安备 33010602011771号