[Pacemaker] IPaddr2 cloned address doesn't survive node standby

Andreas Ntaflos Fri, 17 May 2013 12:31:26 -0700

In a two-node cluster I am trying to use a cloned IP address with acloned Bind 9 instance, in an active-active way. Why? Because simple IPfailover does not work well with Bind, as it only answers queries on theaddresses that are bound to the NIC when starting up (I know aboutBind's "interface-interval" setting, but the minimum of one minute isfar too long). Using Ubuntu 12.04.2, Corosync 1.4.2 and Pacemaker 1.1.6.

So my configuration sees to it that the cloned address is set on bothnodes and Bind is started afterwards (op params omitted for readability):


node dns01
node dns02
primitive p_bind9 lsb:bind9
primitive p_ip_service_ns ocf:heartbeat:IPaddr2 \
  params ip="192.168.114.17" cidr_netmask="24" nic="eth0" \
    clusterip_hash="sourceip-sourceport"
clone cl_bind9 p_bind9 \
  meta interleave="false"
clone cl_ip_service_ns p_ip_service_ns \
  meta globally-unique="true" clone-max="2" \
    clone-node-max="2" interleave="true"
order o_ip_before_bind9 inf: cl_ip_service_ns cl_bind9

(suggestions to improve or correct this configuration gladly accepted)

After Corosync starts up the first time everything seems correct, I cansee the cluster/cloned/service IP address and the CLUSTERIP iptablesrules on both nodes.

But after putting dns01 in standby and then bringing it online again thecloned address is no longer present on dns01, only on dns02. iptablesrules are also gone from dns01.

Then, putting dns02 into standby the IP address is moved to dns01, andafter going online again no longer present on dns01 (neither areiptables rules).

So the IP address is moved between the nodes, each move accompanied by arestart of the Bind service (cl_bind9/p_bind9).

All of this doesn't seem right to me. Shouldn't the cloned IP addressalways be present on *both* nodes when they are online?


Andreas

PS: In the end this configuration works since the Bind 9 service isalways available to answer queries on the cluster address (as long asthere is one node online) but it seems that the Bind 9 clones arerestarted too often and too liberally when things change. This, however,may be a separate issue, possibly related to the order directive and theinterleave meta params.


_______________________________________________
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://bugs.clusterlabs.org

[Pacemaker] IPaddr2 cloned address doesn't survive node standby

Reply via email to