[ 
https://issues.apache.org/jira/browse/MAPREDUCE-5221?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13701043#comment-13701043
 ] 

Karthik Kambatla commented on MAPREDUCE-5221:
---------------------------------------------

The patch is easier to understand now. The patch can be summarized as follows 
(correct me if I am wrong): Use ReduceTask.CombinerRunner to run the combiner. 
That takes care of retrieving the combiner-class for the API being used and run 
the combiner accordingly. So, the current combine code (combineAndSpill) can be 
done away with.

Comment:
# From the looks of it, the {{Reporter}} passed to {{MergeManagerImpl}} 
constructor can only be a TaskReporter. For this patch, it should be okay to 
typecast it unconditionally. With the current if condition, it is easy to 
wonder how the combine works for Reporters which are not TaskReporter.
# Consequently, may be in a different JIRA, we should use Reporter/TaskReporter 
uniform everywhere - ShufflePlugin, MergeManagerImpl, and CombinerRunner.

Other than that, the patch looks good to me. Should we close this other JIRAs 
as duplicates?
                
> Reduce side Combiner is not used when using the new API
> -------------------------------------------------------
>
>                 Key: MAPREDUCE-5221
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5221
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>    Affects Versions: 2.0.4-alpha
>            Reporter: Siddharth Seth
>            Assignee: Tsuyoshi OZAWA
>         Attachments: MAPREDUCE-5221.1.patch, MAPREDUCE-5221.2.patch, 
> MAPREDUCE-5221.3.patch, MAPREDUCE-5221.4.patch, MAPREDUCE-5221.5.patch
>
>
> If a combiner is specified using o.a.h.mapreduce.Job.setCombinerClass - this 
> will silently ignored on the reduce side since the reduce side usage is only 
> aware of the old api combiner.
> This doesn't fail the job - since the new combiner key does not deprecate the 
> old key.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to