[ClusterLabs] Speed up the resource moves in the case of a node hard shutdown

Maxim Mon, 12 Feb 2018 04:02:47 -0800

Hello,

[Sorry for a message duplication. Web mail client ruined the formattingof the previous e-mail =( ]

There is a simple configuration of two cluster nodes (built via RHEL 6pcs interface) with multiple master/slave resources, disabled fencingand the single sync interface.

All is ok mainly. But there is some problem of the cluster activityperformance when the master node is powered off (hard): the slave nodedetects that the master one is down after about 100-3500 ms. And themain question is how to avoid this 3 sec delay that occurred sometimes.

On the slave node i have a little script that checks the connection tothe master node. It detects a problem of a sync breakage within about100 ms. But corosync requires a much more time sometimes to figure outthe situation and mark the master node as offline one. It shows 'ok'ring status.


If i understand correctly then

1 the pacemaker actions (crm_resource --move) will not perform untilcorosync is not refreshed its ring state2 the detection of a problem (from a corosync side) can be speeded upvia timeout tuning in the corosync.conf3 there is no way to ask corosync to recheck its ring status or mark aring as failed manually


But maybe i'm missing something.

All i want is to move resources faster.

In my little script i tried to force the cluster software to moveresources to the slave node. But i've no success so far.


Could you please share your thoughts about the situation.
Thank you in advance.


Cluster software:
corosync - 2.4.3
pacemaker - 1.1.18
libqb - 1.0.2


corosync.conf:
totem {
      version: 2
      secauth: off
      cluster_name: cluster
      transport: udpu
      token: 2000
}

nodelist {
     node {
         ring0_addr: main-node
         nodeid: 1
    }

     node {
         ring0_addr: reserve-node
         nodeid: 2
     }
}

quorum {
     provider: corosync_votequorum
     two_node: 1
}


Regards,
Maxim.

_______________________________________________
Users mailing list: Users@clusterlabs.org
http://lists.clusterlabs.org/mailman/listinfo/users

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://bugs.clusterlabs.org

[ClusterLabs] Speed up the resource moves in the case of a node hard shutdown

Reply via email to