but it's map only job

At 2012-09-27 05:39:39,"Chen Song" <chen.song...@gmail.com> wrote:
As far as I know, the number of files emitted would be determined by the number 
of mappers for a map only job and the number of reducers for a map reduce job.


So it totally depends how your query translates into a MR job.


You can enforce it by setting the property


mapred.reduce.tasks=1


Chen


On Wed, Sep 19, 2012 at 11:25 PM, 王锋 <wfeng1...@163.com> wrote:
Hi
   I tried to convert and merge many small text files using RCFiles using 
hivesql,but hive  produced some small rcfiles.
set hive.exec.compress.output=true;
set mapred.output.compress=true;
set mapred.output.compression.codec=com.hadoop.compression.lzo.LzoCodec;
set io.compression.codecs=com.hadoop.compression.lzo.LzoCodec;
hive.merge.mapfiles=true
hive.merge.mapredfiles=true
hive.merge.size.per.task=640000000
hive.merge.size.smallfiles.avgsize=80000000
insert  overwrite table rctable select .....


  the settings:
hive.merge.mapfiles=true
hive.merge.mapredfiles=true
hive.merge.size.per.task=640000000
hive.merge.size.smallfiles.avgsize=80000000
didn't work.


who could tell me how to solve it?





--
Chen Song




Reply via email to