[ https://issues.apache.org/jira/browse/IGNITE-6587?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Andrey Kuznetsov updated IGNITE-6587: ------------------------------------- Description: As described in [1], each Ignite node has a number of system-critical threads. We should implement a periodic check that calls failure handler when one of the following conditions has been detected: # Critical thread is not alive anymore. # Critical thread 'hangs' for a long time, e.g. while executing a task extracted from task queue. Actual list of system-critical threads can be found at [1]. [1] https://cwiki.apache.org/confluence/display/IGNITE/IEP-14+Ignite+failures+handling was: As described in [1], each Ignite node has a number of system-critical threads. We should implement a periodic check that calls failure handler when one of the following conditions has been detected: # Critical thread is not alive anymore. # Critical thread remains in BLOCKED state for a long time. Actual list of system-critical threads can be found at [1]. [1] https://cwiki.apache.org/confluence/display/IGNITE/IEP-14+Ignite+failures+handling > Ignite watchdog service > ----------------------- > > Key: IGNITE-6587 > URL: https://issues.apache.org/jira/browse/IGNITE-6587 > Project: Ignite > Issue Type: Improvement > Components: general > Affects Versions: 2.2 > Reporter: Alexey Goncharuk > Assignee: Andrey Gura > Priority: Major > Labels: IEP-5 > Fix For: 2.6 > > Attachments: watchdog.sh > > > As described in [1], each Ignite node has a number of system-critical > threads. We should implement a periodic check that calls failure handler when > one of the following conditions has been detected: > # Critical thread is not alive anymore. > # Critical thread 'hangs' for a long time, e.g. while executing a task > extracted from task queue. > Actual list of system-critical threads can be found at [1]. > [1] > https://cwiki.apache.org/confluence/display/IGNITE/IEP-14+Ignite+failures+handling -- This message was sent by Atlassian JIRA (v7.6.3#76005)