[ 
https://issues.apache.org/jira/browse/YARN-3712?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14560679#comment-14560679
 ] 

Jun Gong commented on YARN-3712:
--------------------------------

[~vinodkv] 

Our case: NM receives a event SHUTDOWN, and starts to clean up containers. If 
doing it synchronously and cleaning up takes a little long time, some 
containers might not be killed && cleaned up, then corresponding launching 
container process  "ContainersLauncher #.." will not exit until container 
finishes. It will result problem likes YARN-3585, NM hang.

> ContainersLauncher: handle event CLEANUP_CONTAINER asynchronously
> -----------------------------------------------------------------
>
>                 Key: YARN-3712
>                 URL: https://issues.apache.org/jira/browse/YARN-3712
>             Project: Hadoop YARN
>          Issue Type: Improvement
>          Components: nodemanager
>            Reporter: Jun Gong
>            Assignee: Jun Gong
>         Attachments: YARN-3712.01.patch, YARN-3712.02.patch
>
>
> It will save some time by handling event CLEANUP_CONTAINER asynchronously. 
> This improvement will be useful for cases that cleaning up container cost a 
> little long time(e.g. for our case: we are running Docker container on NM, it 
> will take above 1 seconds to clean up one docker container.  ) and many 
> containers to clean up(e.g. NM need clean up all running containers when NM 
> shutdown). 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to