[
https://issues.apache.org/jira/browse/UIMA-3737?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jerry Cwiklik closed UIMA-3737.
-------------------------------
Resolution: Fixed
Dont remove cgroup if it has zombie processes as it may lead to a thread hang.
> DUCC Agent should not try to remove cgroup when kill -9 fails to stop a
> process
> -------------------------------------------------------------------------------
>
> Key: UIMA-3737
> URL: https://issues.apache.org/jira/browse/UIMA-3737
> Project: UIMA
> Issue Type: Bug
> Components: DUCC
> Affects Versions: 1.0-Ducc
> Reporter: Jerry Cwiklik
> Assignee: Jerry Cwiklik
>
> When an agent starts up it tries to clean up the node. It first waits for the
> OR publications and then checks the node for running processes. It then
> reconciles what is running against what should be running (using OR state).
> Any processes not in the OR state should be evicted with kill -9 and its
> cgroup removed.
> If there is a defunct process on a node, it cannot be killed. Such process
> still has associated cgroup which cannot be removed. Its been observed that
> the agent's attempt to remove the cgroup of a defunct process causes the CPU
> utilization to spike for a couple of minutes.
> Agent should detect if a process is defunct and not try to remove its cgroup
--
This message was sent by Atlassian JIRA
(v6.2#6252)