[ https://issues.apache.org/jira/browse/HIVE-396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12741583#action_12741583 ]
Namit Jain commented on HIVE-396: --------------------------------- > I looked in hive-default.xml and didn't see any hive.merge.mapfiles. Should I > add it to hive-defult.xml and set it to false? YES > Out of curiosity, why do you default to merging map files first? There are production map-only jobs which lead to a lot of small files for many partitions, which increases the load on the name node, and too many mappers for processing the data across many partitions. > Hive performance benchmarks > --------------------------- > > Key: HIVE-396 > URL: https://issues.apache.org/jira/browse/HIVE-396 > Project: Hadoop Hive > Issue Type: New Feature > Reporter: Zheng Shao > Assignee: Yuntao Jia > Attachments: AlansMRcode.tgz, hive_benchmark_2009-06-18.pdf, > hive_benchmark_2009-06-18.tar.gz, hive_benchmark_2009-07-12.pdf, > hive_benchmark_2009-07-21.tar.gz > > > We need some performance benchmark to measure and track the performance > improvements of Hive. > Some references: > PIG performance benchmarks PIG-200 > PigMix: http://wiki.apache.org/pig/PigMix -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.