[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3902?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13447292#comment-13447292
 ] 

Tsuyoshi OZAWA commented on MAPREDUCE-3902:
-------------------------------------------

Thanks for your enumerating remaining tasks, Siddharth. I'll support you as far 
as possible. 

And I've not yet explained you the relationship between container-reuse work 
and MAPREDUCE-4502, so it may confuse you. I'm sorry for the short of 
explanation. I'll give it to you briefly. I'm planning to implement 
MAPREDUCE-4502 and MAPREDUCE-4525 with container-reuse implementation, because 
MRAppMaster in container-reuse implementation has the feature to monitor 
whether the running tasks on the containers are "the last task at a machine or 
not", for the purpose of exiting JVMs on containers, as you know. This feature 
is very similar to monitor task progress per containers, for the purpose of 
starting to run combiner for multi-level aggregation (MAPREDUCE-4502 and 
MAPREDUCE-4525).
                                                                                
                                                                                
                    
The description here is not documented, so I'll write down my thought as the 
design note for MAPREDUCE-4502 and MAPREDUCE-4525 within next one week. I'm 
very appreciate if you review it.
                                                                                
                                                                                
                    
                                                                                
                                                                                
                    
Thanks,                                                                         
                                                                                
                    
Tsuyoshi          
                
> MR AM should reuse containers for map tasks, there-by allowing fine-grained 
> control on num-maps for users without need for CombineFileInputFormat etc.
> ------------------------------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-3902
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3902
>             Project: Hadoop Map/Reduce
>          Issue Type: Improvement
>          Components: applicationmaster, mrv2
>            Reporter: Arun C Murthy
>            Assignee: Siddharth Seth
>         Attachments: MAPREDUCE-3902.2.patch, MAPREDUCE-3902.patch
>
>
> The MR AM is now in a great position to reuse containers across (map) tasks. 
> This is something similar to JVM re-use we had in 0.20.x, but in a 
> significantly better manner:
> # Consider data-locality when re-using containers
> # Consider the new shuffle - ensure that reduces fetch output of the whole 
> container at once (i.e. all maps)  : MAPREDUCE-4525 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to