Hello.
corosync-1.4.1
pacemaker-1.1.5
pacemaker runs with "ver: 1"
I run on strange problem. Hope someone can help me.
I have 9 nodes cluster. All was fine till I need to reboot a node.
After reboot it don`t want to come back to cluster with "not in our
membership" error.
I happens with other 2 nodes on this cluster.
Network is fine.
rm -rf /var/lib/heartbeat/crm/* not helps.
I ask for help at IRC and we do this:
I run one node with debug for few sec and I strace cib process. Both in
links below.
In debug logs we found "cib not connected" error but can`t understand
reason of this.
Debug logs: http://dl.dropbox.com/u/1932700/corosync.log.debug.gz
cib strace: http://dl.dropbox.com/u/1932700/cib-starce.log.gz
P.S. I have equal problem on other cluster and "fix" it with shutdown
all nodes(corosync + pacemaker), rm -rf /var/lib/heartbeat/crm/* ,
startup all nodes. But it`s not really an option. :-)
--
Best regards,
Proskurin Kirill
_______________________________________________
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker
Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker