[ https://issues.apache.org/jira/browse/YARN-3678?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14559649#comment-14559649 ]
Vinod Kumar Vavilapalli commented on YARN-3678: ----------------------------------------------- Tx for the update [~varun_saxena]. You mentioned LCE. But like I said before, LCE kills containers as the app-submitter. So, in your case, what is the user running the containers? > DelayedProcessKiller may kill other process other than container > ---------------------------------------------------------------- > > Key: YARN-3678 > URL: https://issues.apache.org/jira/browse/YARN-3678 > Project: Hadoop YARN > Issue Type: Bug > Components: nodemanager > Affects Versions: 2.6.0 > Reporter: gu-chi > Priority: Critical > > Suppose one container finished, then it will do clean up, the PID file still > exist and will trigger once singalContainer, this will kill the process with > the pid in PID file, but as container already finished, so this PID may be > occupied by other process, this may cause serious issue. > As I know, my NM was killed unexpectedly, what I described can be the cause. > Even rarely occur. -- This message was sent by Atlassian JIRA (v6.3.4#6332)