Hi, Your log doesn't have the full thread dumps and I can't find some information (e.g Topology Snapshots). However, I see that checkpoint thread was blocked for a long time:
[02:45:50,849][SEVERE][tcp-disco-msg-worker-[3dac150e 10.20.4.18:47500]-#2][G] Blocked system-critical thread has been detected. This can lead to cluster-wide undefined behaviour [workerName=db-checkpoint-thread, threadName=db-checkpoint-thread-#54, blockedFor=172s] But I see that it blocked not longer then 3 minutes. I guess that checkpoint lock can't be taken until some other operation will not be timeout. It can be some network related timeout or some operation timeout. So please check your configuration and find where you have 3 min timeout and check what is related to this timeout. BR, Andrei -- Sent from: http://apache-ignite-users.70518.x6.nabble.com/