On 29 Feb 2012, at 14:28, Marcus Bointon wrote:

> Well since none of it's working, I have no problem throwing it all away and 
> starting again!

My crashes have gone away, but I have other issues with the same server. The 
corosync service starts, and is found by the other node:

============
Last updated: Wed Feb 29 15:07:55 2012
Last change: Wed Feb 29 15:00:10 2012 via crmd on www5
Stack: openais
Current DC: www5 - partition with quorum
Version: 1.1.6-9971ebba4494012a93c03b40a2c58ec0eb60f50c
2 Nodes configured, 2 expected votes
0 Resources configured.
============

Node www4: pending
Online: [ www5 ]

Running 'crm status' on www4 just gives "Connection to cluster failed: 
connection failed". In the log I have these lines from cib:

Feb 29 15:00:18 www4 cib: [24712]: info: crm_log_init_worker: Changed active 
directory to /var/lib/heartbeat/cores/hacluster
Feb 29 15:00:18 www4 cib: [24712]: info: retrieveCib: Reading cluster 
configuration from: /var/lib/heartbeat/crm/cib.xml (diges
t: /var/lib/heartbeat/crm/cib.xml.sig)
Feb 29 15:00:18 www4 cib: [24712]: WARN: retrieveCib: Cluster configuration not 
found: /var/lib/heartbeat/crm/cib.xml
Feb 29 15:00:18 www4 cib: [24712]: WARN: readCibXmlFile: Primary configuration 
corrupt or unusable, trying backup...
Feb 29 15:00:18 www4 cib: [24712]: WARN: readCibXmlFile: Continuing with an 
empty configuration.
Feb 29 15:00:18 www4 cib: [24712]: info: validate_with_relaxng: Creating RNG 
parser context
Feb 29 15:00:18 www4 corosync[24705]:   [pcmk  ] info: spawn_child: Forked 
child 24712 for process cib
Feb 29 15:00:18 www4 cib: [24712]: info: startCib: CIB Initialization completed 
successfully
Feb 29 15:00:18 www4 cib: [24712]: info: get_cluster_type: Cluster type is: 
'openais'
Feb 29 15:00:18 www4 cib: [24712]: notice: crm_cluster_connect: Connecting to 
cluster infrastructure: classic openais (with plu
gin)
Feb 29 15:00:18 www4 cib: [24712]: info: init_ais_connection_classic: Creating 
connection to our Corosync plugin
Feb 29 15:00:18 www4 cib: [24712]: info: init_ais_connection_classic: 
Connection to our AIS plugin (9) failed: Library error (2
)
Feb 29 15:00:18 www4 cib: [24712]: CRIT: cib_init: Cannot sign in to the 
cluster... terminating

cib appears to be fine on www5. I've never touched anything in 
/var/lib/heartbeat/crm - this is a completely vanilla config, though it may be 
that there are remnants of the old heartbeat config (which was only on www4) 
causing this. Can I just copy the contents of that folder from the other server?

Marcus
-- 
Marcus Bointon
Synchromedia Limited: Creators of http://www.smartmessages.net/
UK info@hand CRM solutions
mar...@synchromedia.co.uk | http://www.synchromedia.co.uk/



_______________________________________________
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to