[
https://issues.apache.org/jira/browse/UIMA-5528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Richard Eckart de Castilho resolved UIMA-5528.
----------------------------------------------
Resolution: Abandoned
DUCC has been retired.
> UIMA-DUCC: improve agent monitoring of cgroups
> -----------------------------------------------
>
> Key: UIMA-5528
> URL: https://issues.apache.org/jira/browse/UIMA-5528
> Project: UIMA
> Issue Type: Improvement
> Components: DUCC
> Reporter: Jaroslaw Cwiklik
> Assignee: Jaroslaw Cwiklik
> Priority: Major
>
> Currently agent performs node cgroup validation at startup only. In older
> versions of RedHat it has been observed that cgroup memory subsystem
> disappears due to the OS bug. Subsequently all jobs fail due to cgroup
> creation failure.
> Modify agent monitoring of a node by trying to test cgroup creation at
> regular intervals. This check should be part of the node metrics collection.
> If the cgroup creation fails, the agent should mark the state of cgroups as
> 'Broken'. This new state will be displayed by duccmon.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)