[ 
https://issues.apache.org/jira/browse/MAPREDUCE-4808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mariappan Asokan updated MAPREDUCE-4808:
----------------------------------------

    Attachment: mapreduce-4808.patch

I will briefly outline what the patch does and the rationale.
* It makes the {{MergeManager}} pluggable.  Rationale: {{MergeManager}} does 
the merge sorting on the reduce side and is part of the overall sorting that 
happens in MR data flow.
* It makes  {{MapOutput}} class overridable.
Rationale: {{MergeManager}} plugin implementations can make efficient use of 
JVM memory for data shuffling and provide their own implementation of 
{{MapOutput.}}
* It makes local job runs use {{MergeManager}} or plugin implementations to do 
the merge sort instead of {{Merger.}}
Rationale: Local job runs should also be able to use the sort plugin on the 
reduce side just like they can make use of the sort plugin on the map side.
                
> Allow reduce-side merge to be pluggable
> ---------------------------------------
>
>                 Key: MAPREDUCE-4808
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4808
>             Project: Hadoop Map/Reduce
>          Issue Type: New Feature
>    Affects Versions: 2.0.2-alpha
>            Reporter: Arun C Murthy
>            Assignee: Mariappan Asokan
>             Fix For: 2.0.3-alpha
>
>         Attachments: COMBO-mapreduce-4809-4812-4808.patch, 
> mapreduce-4808.patch, mapreduce-4808.patch
>
>
> Allow reduce-side merge to be pluggable for MAPREDUCE-2454

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to