[ 
https://issues.apache.org/jira/browse/YARN-906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13709195#comment-13709195
 ] 

Zhijie Shen commented on YARN-906:
----------------------------------

Did some investigation in this test failure. The test itself seems to have no 
problem. The test was timeout because the container state kept RUNNING after it 
was stopped, which was not expected.

Looked into the test log: after stopContainer was called, Container moved from 
LOCALIZED to KILLING, but didn't move on any more. However, looked into my 
local test log of a successful run: Container moved from LOCALIZED to KILLING, 
and then from KILLING to CONTAINER_CLEANEDUP_AFTER_KILL, during which the major 
work is to clean the localized container resources (observed the execution of 
file deletion). However, the failed test log didn't show any file deletion. 
Therefore, I guess there's something blocking during container resources 
cleanup. Thoughts?

More investigation is needed to further locate the problem.
                
> TestNMClient.testNMClientNoCleanupOnStop fails occasionally
> -----------------------------------------------------------
>
>                 Key: YARN-906
>                 URL: https://issues.apache.org/jira/browse/YARN-906
>             Project: Hadoop YARN
>          Issue Type: Bug
>            Reporter: Zhijie Shen
>            Assignee: Zhijie Shen
>
> See 
> https://builds.apache.org/job/PreCommit-YARN-Build/1435//testReport/org.apache.hadoop.yarn.client.api.impl/TestNMClient/testNMClientNoCleanupOnStop/

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to