Hello,
I am running a 3-node Pacemaker cluster (2 "real" nodes and 1 quorum node in standby) on Ubuntu 12.04 server (amd64) with Pacemaker 1.1.8 and Corosync 2.1.0. My cluster configuration is: http://pastebin.com/6TPkWtbt Recently, pengine died on storage0 (where the resources were running) which also happened to be the DC at the time. Consequently, Pacemaker went into recovery mode and released its role as DC, at which point storage1 took over the DC role and migrated the resources away from storage0 and onto storage1. Looking through the logs, it seems like storage0 came back into the cluster before the migration of the resources began: Dec 03 08:31:20 [3165] storage1 crmd: info: peer_update_callback: Client storage0/peer now has status [online] (DC=true) ... Dec 03 08:31:20 [3164] storage1 pengine: notice: LogActions: Start rscXXX (storage1) Thus, why did the migration occur, rather than aborting and having the resources simply remain running on storage0? Here are the logs from each of the nodes: storage0: http://pastebin.com/ZqqnH9uf storage1: http://pastebin.com/rvSLVcZs Thanks, Andrew
_______________________________________________ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://bugs.clusterlabs.org