Ravi Prakash created YARN-167:
---------------------------------
Summary: AM stuck in KILL_WAIT for days when node is lost in the
middle
Key: YARN-167
URL: https://issues.apache.org/jira/browse/YARN-167
Project: Hadoop YARN
Issue Type: Bug
Affects Versions: 0.23.3
Reporter: Ravi Prakash
We found some jobs were stuck in KILL_WAIT for days on end. The RM shows them
as RUNNING. When you go to the AM, it shows it in the KILL_WAIT state, and a
few maps running. All these maps were scheduled on nodes which are now in the
RM's Lost nodes list.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira