[ 
https://issues.apache.org/jira/browse/YARN-1337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ravi Prakash updated YARN-1337:
-------------------------------

    Assignee:     (was: Ravi Prakash)

> Recover active container state upon nodemanager restart
> -------------------------------------------------------
>
>                 Key: YARN-1337
>                 URL: https://issues.apache.org/jira/browse/YARN-1337
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>          Components: nodemanager
>    Affects Versions: 2.3.0
>            Reporter: Jason Lowe
>
> To support work-preserving NM restart we need to recover the state of the 
> containers that were active when the nodemanager went down.  This includes 
> informing the RM of containers that have exited in the interim and a strategy 
> for dealing with the exit codes from those containers along with how to 
> reacquire the active containers and determine their exit codes when they 
> terminate.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to