[ClusterLabs] reducing corosync-qnetd "response time"

Sherrard Burton Thu, 24 Oct 2019 06:58:42 -0700

background:

we are upgrading a (very) old HA cluster running heartbeat DRBD and NFS,with no stonith, to a much more modern implementation. for the existingcluster, as well as the new one, the disk space requirements makerunning a full three-node cluster infeasible, so i am trying toconfigure a quorum-only node using corosync-qnetd.

the installation went fine, the nodes can communicate, etc, and thecluster seema to perform as desired when gracefully shutting down orrestarting a node. but during my torture testing, simulating a nodecrash by stopping the network on one node leaves the remaining node inlimbo for approximately 20 seconds before it and the quorum-only nodedecide that they are indeed quorate.


the problem:

the intended implementation involves DRBD, and its resource-levelfencing freezes IO during the time that the remaining node is inquoratein order to avoid any possible data divergence/split-brain. thisprecaution is obviously desirable, and is the reason that i am trying toconfigure this cluster "properly".

my (admittedly naive) expectation is that the remaining node and thequorum-only node would continue ticking along as if nothing happened,and i am hoping that this delay is due to somemisconfiguration/oversight/bone-headedness on my part.

so i am seeking understanding on the reason for this delay, and whetherthere is any (prudent) way to reduce it. of course, any other advice onthe intended setup is welcome as well.


please let me know if you require any additional details.

TIA
Sherrard

details:

for testing purposes, i have stripped the cluster down to a single dummyresource


root@xen-nfs01:~# crm configure show
node 1: xen-nfs01
node 2: xen-nfs02
primitive dummy Dummy
property cib-bootstrap-options: \
        have-watchdog=false \
        stonith-enabled=false \
        dc-version=2.0.1-9e909a5bdd \
        cluster-infrastructure=corosync \
        cluster-name=xen-nfs01_xen-nfs02 \
        last-lrm-refresh=1571422320 \
        maintenance-mode=false


all nodes are running debian 10, with the following package versions

root@xen-nfs01:~# dpkg -l | grep -F -e corosync -e pacemaker

ii corosync 3.0.1-2 amd64cluster engine daemon and utilitiesii corosync-qdevice 3.0.0-4 amd64cluster engine quorum device daemonii crmsh 4.0.0~git20190108.3d56538-3 allCRM shell for the pacemaker cluster managerii libcorosync-common4:amd64 3.0.1-2 amd64cluster engine common libraryii pacemaker 2.0.1-5 amd64cluster resource managerii pacemaker-cli-utils 2.0.1-5 amd64cluster resource manager command line utilitiesii pacemaker-common 2.0.1-5 allcluster resource manager common filesii pacemaker-resource-agents 2.0.1-5 allcluster resource manager general resource agents


root@xen-quorum:~# dpkg -l | grep -F -e corosync -e pacemaker

ii corosync-qnetd 3.0.0-4 amd64cluster engine quorum device network daemon



/etc/corosync/corosync.conf:
totem {
  version: 2
  cluster_name: xen-nfs01_xen-nfs02
  crypto_cipher: aes256
  crypto_hash: sha512
}
logging {
  fileline: off
  to_stderr: yes
  to_logfile: yes
  logfile: /var/log/corosync/corosync.log
  to_syslog: yes
  debug: off
  logger_subsys {
    subsys: QUORUM
    debug: off
  }
}
nodelist {
  node {
    name: xen-nfs01
    nodeid: 1
    ring0_addr: 192.168.250.50
  }
  node {
    name: xen-nfs02
    nodeid: 2
    ring0_addr: 192.168.250.51
  }
}
quorum {
  provider: corosync_votequorum
  device {
    model: net
    votes: 1
    net {
      tls: on
      host: xen-quorum
      algorithm: ffsplit
    }
  }
}


logs:

Oct 24 02:33:48 xen-nfs01 corosync[9946]: [TOTEM ] Token has not beenreceived in 750 msOct 24 02:33:48 xen-nfs01 corosync[9946]: [TOTEM ] A processor failed,forming new configuration.Oct 24 02:33:49 xen-nfs01 corosync[9946]: [VOTEQ ] waiting for quorumdevice Qdevice poll (but maximum for 30000 ms)Oct 24 02:33:49 xen-nfs01 corosync[9946]: [TOTEM ] A new membership(1:1388) was formed. Members left: 2Oct 24 02:33:49 xen-nfs01 corosync[9946]: [TOTEM ] Failed to receivethe leave message. failed: 2Oct 24 02:33:49 xen-nfs01 corosync[9946]: [CPG ] downlist left_list:1 receivedOct 24 02:33:49 xen-nfs01 pacemaker-attrd[10527]: notice: Nodexen-nfs02 state is now lostOct 24 02:33:49 xen-nfs01 pacemaker-attrd[10527]: notice: Removing allxen-nfs02 attributes for peer lossOct 24 02:33:49 xen-nfs01 pacemaker-attrd[10527]: notice: Purged 1 peerwith id=2 and/or uname=xen-nfs02 from the membership cacheOct 24 02:33:49 xen-nfs01 pacemaker-fenced[10525]: notice: Nodexen-nfs02 state is now lostOct 24 02:33:49 xen-nfs01 pacemaker-fenced[10525]: notice: Purged 1peer with id=2 and/or uname=xen-nfs02 from the membership cacheOct 24 02:33:49 xen-nfs01 pacemaker-based[10524]: notice: Nodexen-nfs02 state is now lostOct 24 02:33:49 xen-nfs01 pacemaker-based[10524]: notice: Purged 1 peerwith id=2 and/or uname=xen-nfs02 from the membership cacheOct 24 02:33:49 xen-nfs01 pacemaker-controld[10529]: warning:Stonith/shutdown of node xen-nfs02 was not expectedOct 24 02:33:49 xen-nfs01 pacemaker-controld[10529]: notice: Statetransition S_IDLE -> S_POLICY_ENGINE


Oct 24 02:34:12 xen-nfs01 corosync[9946]:   [QUORUM] Members[1]: 1

Oct 24 02:34:12 xen-nfs01 corosync[9946]: [MAIN ] Completed servicesynchronization, ready to provide service.Oct 24 02:34:12 xen-nfs01 pacemakerd[10522]: notice: Node xen-nfs02state is now lostOct 24 02:34:12 xen-nfs01 pacemaker-controld[10529]: notice: Nodexen-nfs02 state is now lostOct 24 02:34:12 xen-nfs01 pacemaker-controld[10529]: warning:Stonith/shutdown of node xen-nfs02 was not expectedOct 24 02:34:13 xen-nfs01 pacemaker-schedulerd[10528]: notice:Calculated transition 411, saving inputs in/var/lib/pacemaker/pengine/pe-input-1278.bz2Oct 24 02:34:13 xen-nfs01 pacemaker-controld[10529]: notice: Transition411 (Complete=0, Pending=0, Fired=0, Skipped=0, Incomplete=0,Source=/var/lib/pacemaker/pengine/pe-input-1278.bz2): CompleteOct 24 02:34:13 xen-nfs01 pacemaker-controld[10529]: notice: Statetransition S_TRANSITION_ENGINE -> S_IDLE

Oct 24 02:33:49 xen-quorum corosync-qnetd[3353]: Oct 24 02:33:49 debugClient ::ffff:192.168.250.50:38362 (cluster xen-nfs01_xen-nfs02, node_id1) sent membership node list.Oct 24 02:33:49 xen-quorum corosync-qnetd[3353]: Oct 24 02:33:49 debugmsg seq num = 61Oct 24 02:33:49 xen-quorum corosync-qnetd[3353]: Oct 24 02:33:49 debugring id = (1.56c)Oct 24 02:33:49 xen-quorum corosync-qnetd[3353]: Oct 24 02:33:49 debugheuristics = UndefinedOct 24 02:33:49 xen-quorum corosync-qnetd[3353]: Oct 24 02:33:49 debugnode list:Oct 24 02:33:49 xen-quorum corosync-qnetd[3353]: Oct 24 02:33:49 debugnode_id = 1, data_center_id = 0, node_state = not setOct 24 02:33:49 xen-quorum corosync-qnetd[3353]: Oct 24 02:33:49 debugffsplit: Membership for cluster xen-nfs01_xen-nfs02 is not yet stableOct 24 02:33:49 xen-quorum corosync-qnetd[3353]: Oct 24 02:33:49 debugAlgorithm result vote is Wait for reply

Oct 24 02:34:12 xen-quorum corosync-qnetd[3353]: Oct 24 02:34:12 warningClient ::ffff:192.168.250.51:36628 doesn't sent any message during20000ms. DisconnectingOct 24 02:34:12 xen-quorum corosync-qnetd[3353]: Oct 24 02:34:12 debugClient ::ffff:192.168.250.51:36628 (init_received 1, clusterxen-nfs01_xen-nfs02, node_id 2) disconnectOct 24 02:34:12 xen-quorum corosync-qnetd[3353]: Oct 24 02:34:12 debugffsplit: Membership for cluster xen-nfs01_xen-nfs02 is now stableOct 24 02:34:12 xen-quorum corosync-qnetd[3353]: Oct 24 02:34:12 debugffsplit: Quorate partition selectedOct 24 02:34:12 xen-quorum corosync-qnetd[3353]: Oct 24 02:34:12 debugnode list:Oct 24 02:34:12 xen-quorum corosync-qnetd[3353]: Oct 24 02:34:12 debugnode_id = 1, data_center_id = 0, node_state = not setOct 24 02:34:12 xen-quorum corosync-qnetd[3353]: Oct 24 02:34:12 debugffsplit: No client gets NACKOct 24 02:34:12 xen-quorum corosync-qnetd[3353]: Oct 24 02:34:12 debugSending vote info to client ::ffff:192.168.250.50:38362 (clusterxen-nfs01_xen-nfs02, node_id 1)Oct 24 02:34:12 xen-quorum corosync-qnetd[3353]: Oct 24 02:34:12 debugmsg seq num = 30Oct 24 02:34:12 xen-quorum corosync-qnetd[3353]: Oct 24 02:34:12 debugvote = ACKOct 24 02:34:12 xen-quorum corosync-qnetd[3353]: Oct 24 02:34:12 debugClient ::ffff:192.168.250.50:38362 (cluster xen-nfs01_xen-nfs02, node_id1) replied back to vote info messageOct 24 02:34:12 xen-quorum corosync-qnetd[3353]: Oct 24 02:34:12 debugmsg seq num = 30Oct 24 02:34:12 xen-quorum corosync-qnetd[3353]: Oct 24 02:34:12 debugffsplit: All ACK votes sent for cluster xen-nfs01_xen-nfs02Oct 24 02:34:12 xen-quorum corosync-qnetd[3353]: Oct 24 02:34:12 debugClient ::ffff:192.168.250.50:38362 (cluster xen-nfs01_xen-nfs02, node_id1) sent quorum node list.Oct 24 02:34:12 xen-quorum corosync-qnetd[3353]: Oct 24 02:34:12 debugmsg seq num = 62Oct 24 02:34:12 xen-quorum corosync-qnetd[3353]: Oct 24 02:34:12 debugquorate = 1Oct 24 02:34:12 xen-quorum corosync-qnetd[3353]: Oct 24 02:34:12 debugnode list:Oct 24 02:34:12 xen-quorum corosync-qnetd[3353]: Oct 24 02:34:12 debugnode_id = 2, data_center_id = 0, node_state = deadOct 24 02:34:12 xen-quorum corosync-qnetd[3353]: Oct 24 02:34:12 debugnode_id = 1, data_center_id = 0, node_state = memberOct 24 02:34:12 xen-quorum corosync-qnetd[3353]: Oct 24 02:34:12 debugAlgorithm result vote is No change



root@xen-nfs02:~# date; service networking stop; journalctl -f
Thu Oct 24 02:33:47 UTC 2019
-- Logs begin at Mon 2019-10-14 21:10:14 UTC. --
Oct 24 02:33:21 xen-nfs02 pacemakerd[32123]:  notice: Quorum acquired

Oct 24 02:33:21 xen-nfs02 pacemakerd[32123]: notice: Node xen-nfs01state is now memberOct 24 02:33:21 xen-nfs02 pacemaker-controld[32130]: notice: Statetransition S_IDLE -> S_INTEGRATIONOct 24 02:33:21 xen-nfs02 pacemaker-controld[32130]: warning: AnotherDC detected: xen-nfs01 (op=noop)Oct 24 02:33:21 xen-nfs02 pacemaker-controld[32130]: notice: Statetransition S_ELECTION -> S_RELEASE_DCOct 24 02:33:21 xen-nfs02 pacemaker-attrd[32128]: notice: Detectedanother attribute writer (xen-nfs01), starting new electionOct 24 02:33:21 xen-nfs02 pacemaker-controld[32130]: notice: Statetransition S_PENDING -> S_NOT_DC

Oct 24 02:33:47 xen-nfs02 systemd[1]: Stopping Raise network interfaces...
Oct 24 02:33:47 xen-nfs02 systemd[1]: networking.service: Succeeded.
Oct 24 02:33:47 xen-nfs02 systemd[1]: Stopped Raise network interfaces.

Oct 24 02:33:48 xen-nfs02 corosync[30190]: [TOTEM ] Token has not beenreceived in 750 msOct 24 02:33:48 xen-nfs02 corosync[30190]: [TOTEM ] A processorfailed, forming new configuration.Oct 24 02:33:48 xen-nfs02 corosync[30190]: [KNET ] link: host: 1link: 0 is downOct 24 02:33:48 xen-nfs02 corosync[30190]: [KNET ] host: host: 1(passive) best link: 0 (pri: 1)Oct 24 02:33:48 xen-nfs02 corosync[30190]: [KNET ] host: host: 1 hasno active linksOct 24 02:33:49 xen-nfs02 corosync[30190]: [VOTEQ ] waiting for quorumdevice Qdevice poll (but maximum for 30000 ms)Oct 24 02:33:49 xen-nfs02 corosync[30190]: [TOTEM ] A new membership(2:1388) was formed. Members left: 1Oct 24 02:33:49 xen-nfs02 corosync[30190]: [TOTEM ] Failed to receivethe leave message. failed: 1Oct 24 02:33:49 xen-nfs02 corosync[30190]: [CPG ] downlistleft_list: 1 receivedOct 24 02:33:49 xen-nfs02 pacemaker-attrd[32128]: notice: Lostattribute writer xen-nfs01Oct 24 02:33:49 xen-nfs02 pacemaker-attrd[32128]: notice: Nodexen-nfs01 state is now lostOct 24 02:33:49 xen-nfs02 pacemaker-attrd[32128]: notice: Removing allxen-nfs01 attributes for peer lossOct 24 02:33:49 xen-nfs02 pacemaker-attrd[32128]: notice: Purged 1 peerwith id=1 and/or uname=xen-nfs01 from the membership cacheOct 24 02:33:49 xen-nfs02 pacemaker-based[32125]: notice: Nodexen-nfs01 state is now lostOct 24 02:33:49 xen-nfs02 pacemaker-based[32125]: notice: Purged 1 peerwith id=1 and/or uname=xen-nfs01 from the membership cacheOct 24 02:33:49 xen-nfs02 pacemaker-fenced[32126]: notice: Nodexen-nfs01 state is now lostOct 24 02:33:49 xen-nfs02 pacemaker-fenced[32126]: notice: Purged 1peer with id=1 and/or uname=xen-nfs01 from the membership cacheOct 24 02:33:49 xen-nfs02 pacemaker-controld[32130]: notice: Our peeron the DC (xen-nfs01) is deadOct 24 02:33:49 xen-nfs02 pacemaker-controld[32130]: notice: Statetransition S_NOT_DC -> S_ELECTIONOct 24 02:34:00 xen-nfs02 corosync-qdevice[31543]: Server didn't sendecho reply message on timeOct 24 02:34:00 xen-nfs02 corosync[30190]: [QUORUM] This node iswithin the non-primary component and will NOT provide any services.

Oct 24 02:34:00 xen-nfs02 corosync[30190]:   [QUORUM] Members[1]: 2

Oct 24 02:34:00 xen-nfs02 corosync[30190]: [MAIN ] Completed servicesynchronization, ready to provide service.

Oct 24 02:34:00 xen-nfs02 pacemakerd[32123]:  warning: Quorum lost
Oct 24 02:34:00 xen-nfs02 pacemaker-controld[32130]:  warning: Quorum lost

Oct 24 02:34:00 xen-nfs02 pacemaker-controld[32130]: notice: Nodexen-nfs01 state is now lostOct 24 02:34:00 xen-nfs02 pacemakerd[32123]: notice: Node xen-nfs01state is now lostOct 24 02:34:01 xen-nfs02 pacemaker-attrd[32128]: notice: Recordedlocal node as attribute writer (was unset)Oct 24 02:34:01 xen-nfs02 pacemaker-controld[32130]: notice: Statetransition S_ELECTION -> S_INTEGRATIONOct 24 02:34:01 xen-nfs02 pacemaker-schedulerd[32129]: warning: Fencingand resource management disabled due to lack of quorumOct 24 02:34:01 xen-nfs02 pacemaker-schedulerd[32129]: notice: * Startdummy ( xen-nfs02 ) due to no quorum (blocked)Oct 24 02:34:01 xen-nfs02 pacemaker-schedulerd[32129]: notice:Calculated transition 4, saving inputs in/var/lib/pacemaker/pengine/pe-input-471.bz2Oct 24 02:34:01 xen-nfs02 pacemaker-controld[32130]: notice: Transition4 (Complete=0, Pending=0, Fired=0, Skipped=0, Incomplete=0,Source=/var/lib/pacemaker/pengine/pe-input-471.bz2): CompleteOct 24 02:34:01 xen-nfs02 pacemaker-controld[32130]: notice: Statetransition S_TRANSITION_ENGINE -> S_IDLEOct 24 02:34:04 xen-nfs02 corosync-qdevice[31543]: Can't connect toqnetd host (-5980): Network address is presently unreachable

Oct 24 02:34:12 xen-nfs02 corosync-qdevice[31543]: Connect timeout
_______________________________________________
Manage your subscription:
https://lists.clusterlabs.org/mailman/listinfo/users

ClusterLabs home: https://www.clusterlabs.org/

[ClusterLabs] reducing corosync-qnetd "response time"

Reply via email to