[ https://issues.apache.org/jira/browse/HIVE-2037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13004852#comment-13004852 ]
Ning Zhang commented on HIVE-2037: ---------------------------------- @joy, the unit tests are clean. > Merge result file size should honor hive.merge.size.per.task > ------------------------------------------------------------ > > Key: HIVE-2037 > URL: https://issues.apache.org/jira/browse/HIVE-2037 > Project: Hive > Issue Type: Bug > Reporter: Ning Zhang > Assignee: Ning Zhang > Attachments: HIVE-2037.patch > > > The merge job set mapred.min.split.size to the value of > hive.merge.size.per.task, which roughly equals to the output file size. > However the input split size is also determined by > mapred.min.split.size.per.node, mapred.min.split.size.per.rack, and > mapred.max.split.size. They should be set the same as > hive.merge.size.per.task as well. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira