I have a cluster(2.7.0) with persistence enabled, the cluster has three
server nodes(8c16g) and every node has 8 Gb data(1 primary and 1 backup).

When I restart one of them, the whole cluster will stop about 60 seconds and
do not process any request from the client.

>From the logs, I saw that the node is processing partition map exchange,
almost 60 seconds was spent on it.

My disk is SSD which has a high read and write speed.
The network is well too, the nodes can connect with each other, and the
latency is very low(less than 0.2 ms) when restarting.
I also have investigated the GC logs of three nodes, most of GC is finished
in 50ms.
My cache AtomicityMode is CacheAtomicityMode.ATOMIC, so there are no long
transactions.

So I want to ask is it normal that the cluster is stuck for 60 seconds?

If it is not normal, then where should the problem be most, how to optimize?

Looking forward to your replies.






--
Sent from: http://apache-ignite-users.70518.x6.nabble.com/

Reply via email to