In a two-node cluster I am trying to use a cloned IP address with a cloned Bind 9 instance, in an active-active way. Why? Because simple IP failover does not work well with Bind, as it only answers queries on the addresses that are bound to the NIC when starting up (I know about Bind's "interface-interval" setting, but the minimum of one minute is far too long). Using Ubuntu 12.04.2, Corosync 1.4.2 and Pacemaker 1.1.6.

So my configuration sees to it that the cloned address is set on both nodes and Bind is started afterwards (op params omitted for readability):

node dns01
node dns02
primitive p_bind9 lsb:bind9
primitive p_ip_service_ns ocf:heartbeat:IPaddr2 \
  params ip="192.168.114.17" cidr_netmask="24" nic="eth0" \
    clusterip_hash="sourceip-sourceport"
clone cl_bind9 p_bind9 \
  meta interleave="false"
clone cl_ip_service_ns p_ip_service_ns \
  meta globally-unique="true" clone-max="2" \
    clone-node-max="2" interleave="true"
order o_ip_before_bind9 inf: cl_ip_service_ns cl_bind9

(suggestions to improve or correct this configuration gladly accepted)

After Corosync starts up the first time everything seems correct, I can see the cluster/cloned/service IP address and the CLUSTERIP iptables rules on both nodes.

But after putting dns01 in standby and then bringing it online again the cloned address is no longer present on dns01, only on dns02. iptables rules are also gone from dns01.

Then, putting dns02 into standby the IP address is moved to dns01, and after going online again no longer present on dns01 (neither are iptables rules).

So the IP address is moved between the nodes, each move accompanied by a restart of the Bind service (cl_bind9/p_bind9).

All of this doesn't seem right to me. Shouldn't the cloned IP address always be present on *both* nodes when they are online?

Andreas

PS: In the end this configuration works since the Bind 9 service is always available to answer queries on the cluster address (as long as there is one node online) but it seems that the Bind 9 clones are restarted too often and too liberally when things change. This, however, may be a separate issue, possibly related to the order directive and the interleave meta params.

_______________________________________________
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://bugs.clusterlabs.org

Reply via email to