On 29 Feb 2012, at 14:28, Marcus Bointon wrote: > Well since none of it's working, I have no problem throwing it all away and > starting again!
My crashes have gone away, but I have other issues with the same server. The corosync service starts, and is found by the other node: ============ Last updated: Wed Feb 29 15:07:55 2012 Last change: Wed Feb 29 15:00:10 2012 via crmd on www5 Stack: openais Current DC: www5 - partition with quorum Version: 1.1.6-9971ebba4494012a93c03b40a2c58ec0eb60f50c 2 Nodes configured, 2 expected votes 0 Resources configured. ============ Node www4: pending Online: [ www5 ] Running 'crm status' on www4 just gives "Connection to cluster failed: connection failed". In the log I have these lines from cib: Feb 29 15:00:18 www4 cib: [24712]: info: crm_log_init_worker: Changed active directory to /var/lib/heartbeat/cores/hacluster Feb 29 15:00:18 www4 cib: [24712]: info: retrieveCib: Reading cluster configuration from: /var/lib/heartbeat/crm/cib.xml (diges t: /var/lib/heartbeat/crm/cib.xml.sig) Feb 29 15:00:18 www4 cib: [24712]: WARN: retrieveCib: Cluster configuration not found: /var/lib/heartbeat/crm/cib.xml Feb 29 15:00:18 www4 cib: [24712]: WARN: readCibXmlFile: Primary configuration corrupt or unusable, trying backup... Feb 29 15:00:18 www4 cib: [24712]: WARN: readCibXmlFile: Continuing with an empty configuration. Feb 29 15:00:18 www4 cib: [24712]: info: validate_with_relaxng: Creating RNG parser context Feb 29 15:00:18 www4 corosync[24705]: [pcmk ] info: spawn_child: Forked child 24712 for process cib Feb 29 15:00:18 www4 cib: [24712]: info: startCib: CIB Initialization completed successfully Feb 29 15:00:18 www4 cib: [24712]: info: get_cluster_type: Cluster type is: 'openais' Feb 29 15:00:18 www4 cib: [24712]: notice: crm_cluster_connect: Connecting to cluster infrastructure: classic openais (with plu gin) Feb 29 15:00:18 www4 cib: [24712]: info: init_ais_connection_classic: Creating connection to our Corosync plugin Feb 29 15:00:18 www4 cib: [24712]: info: init_ais_connection_classic: Connection to our AIS plugin (9) failed: Library error (2 ) Feb 29 15:00:18 www4 cib: [24712]: CRIT: cib_init: Cannot sign in to the cluster... terminating cib appears to be fine on www5. I've never touched anything in /var/lib/heartbeat/crm - this is a completely vanilla config, though it may be that there are remnants of the old heartbeat config (which was only on www4) causing this. Can I just copy the contents of that folder from the other server? Marcus -- Marcus Bointon Synchromedia Limited: Creators of http://www.smartmessages.net/ UK info@hand CRM solutions mar...@synchromedia.co.uk | http://www.synchromedia.co.uk/ _______________________________________________ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems