压缩配置:
配置是否使用压缩
SET hive.exec.compress.output=true;
配置压缩格式(参考HIVE配置)
SET mapred.output.compression.codec=org.apache.hadoop.io.compress.SnappyCodec;
SET mapred.output.compression.codec=org.apache.hadoop.io.compress.BZip2Codec;
文件格式配置:
在创建hive表时配置:
STORED AS TEXTFILE
STORED AS RCFILE
STORED AS SEQUENCEFILE