Zhijie Shen created YARN-966:
--------------------------------

             Summary: The thread of ContainerLaunch#call will fail without any 
signal if getLocalizedResources() is called when the container is not at 
LOCALIZED
                 Key: YARN-966
                 URL: https://issues.apache.org/jira/browse/YARN-966
             Project: Hadoop YARN
          Issue Type: Bug
            Reporter: Zhijie Shen
            Assignee: Zhijie Shen


In ContainerImpl.getLocalizedResources(), there's:
{code}
assert ContainerState.LOCALIZED == getContainerState(); // TODO: FIXME!!
{code}

ContainerImpl.getLocalizedResources() is called in ContainerLaunch.call(), 
which is scheduled on a separate thread. If the container is not at LOCALIZED 
(e.g. it is at KILLING, see YARN-906), an AssertError will be thrown and fails 
the thread without notifying NM. Therefore, the container cannot receive more 
events, which are supposed to be sent from ContainerLaunch.call(), and move 
towards completion. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to