[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3685?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13595550#comment-13595550
 ] 

Mariappan Asokan commented on MAPREDUCE-3685:
---------------------------------------------

Hi Ravi,
  I guess I am too late to comment since your patch has been committed already. 
 In any case, I have the following comments since you asked:)

* In {{closeOnDiskFile()}} the following lines of code
{code}
    if (onDiskMapOutputs.size() >= (2 * ioSortFactor - 1)) {
      onDiskMerger.startMerge(onDiskMapOutputs);
    }
{code}
can be changed to
{code}
    if (onDiskMapOutputs.size() >= ioSortFactor) {
      onDiskMerger.startMerge(onDiskMapOutputs);
    }
{code}
Please confirm.
* In the class {{CompressAwarePath}} there is a nit in {{compareTo().}}  The 
following lines:
{code}
        } else if (this.getCompressedSize() > compPath.getCompressedSize()) {
          return 1;
{code}
can be simplified as:
{code}
        } else {
          return 1;
{code}
The set will be partially ordered without an additional compare and without 
executing the line
{code}
      return super.compareTo(obj);
{code}
* Since the patch fixes some performance issues, did you have a chance to run 
some benchmarks that show improvements?  I know this will take some time.  I 
will leave it to you.

-- Asokan

                
> There are some bugs in implementation of MergeManager
> -----------------------------------------------------
>
>                 Key: MAPREDUCE-3685
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3685
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: mrv2
>    Affects Versions: 0.23.1
>            Reporter: anty.rao
>            Assignee: anty
>            Priority: Critical
>             Fix For: 0.23.7, 2.0.4-beta
>
>         Attachments: MAPREDUCE-3685-branch-0.23.1.patch, 
> MAPREDUCE-3685-branch-0.23.1.patch, MAPREDUCE-3685-branch-0.23.1.patch, 
> MAPREDUCE-3685.branch-0.23.patch, MAPREDUCE-3685.branch-0.23.patch, 
> MAPREDUCE-3685.branch-0.23.patch, MAPREDUCE-3685.branch-0.23.patch, 
> MAPREDUCE-3685.branch-0.23.patch, MAPREDUCE-3685.patch, MAPREDUCE-3685.patch, 
> MAPREDUCE-3685.patch, MAPREDUCE-3685.patch, MAPREDUCE-3685.patch, 
> MAPREDUCE-3685.patch, MAPREDUCE-3685.patch
>
>


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to