[ https://issues.apache.org/jira/browse/MAPREDUCE-3685?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13595550#comment-13595550 ]
Mariappan Asokan commented on MAPREDUCE-3685: --------------------------------------------- Hi Ravi, I guess I am too late to comment since your patch has been committed already. In any case, I have the following comments since you asked:) * In {{closeOnDiskFile()}} the following lines of code {code} if (onDiskMapOutputs.size() >= (2 * ioSortFactor - 1)) { onDiskMerger.startMerge(onDiskMapOutputs); } {code} can be changed to {code} if (onDiskMapOutputs.size() >= ioSortFactor) { onDiskMerger.startMerge(onDiskMapOutputs); } {code} Please confirm. * In the class {{CompressAwarePath}} there is a nit in {{compareTo().}} The following lines: {code} } else if (this.getCompressedSize() > compPath.getCompressedSize()) { return 1; {code} can be simplified as: {code} } else { return 1; {code} The set will be partially ordered without an additional compare and without executing the line {code} return super.compareTo(obj); {code} * Since the patch fixes some performance issues, did you have a chance to run some benchmarks that show improvements? I know this will take some time. I will leave it to you. -- Asokan > There are some bugs in implementation of MergeManager > ----------------------------------------------------- > > Key: MAPREDUCE-3685 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-3685 > Project: Hadoop Map/Reduce > Issue Type: Bug > Components: mrv2 > Affects Versions: 0.23.1 > Reporter: anty.rao > Assignee: anty > Priority: Critical > Fix For: 0.23.7, 2.0.4-beta > > Attachments: MAPREDUCE-3685-branch-0.23.1.patch, > MAPREDUCE-3685-branch-0.23.1.patch, MAPREDUCE-3685-branch-0.23.1.patch, > MAPREDUCE-3685.branch-0.23.patch, MAPREDUCE-3685.branch-0.23.patch, > MAPREDUCE-3685.branch-0.23.patch, MAPREDUCE-3685.branch-0.23.patch, > MAPREDUCE-3685.branch-0.23.patch, MAPREDUCE-3685.patch, MAPREDUCE-3685.patch, > MAPREDUCE-3685.patch, MAPREDUCE-3685.patch, MAPREDUCE-3685.patch, > MAPREDUCE-3685.patch, MAPREDUCE-3685.patch > > -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira