Lars Marowsky-Bree escribió:
On 2008-08-29T17:23:27, Adrian Chapela <[EMAIL PROTECTED]> wrote:
Hello,
I am starting again to config Heartbeat, because all of my old
configurations don't satisfied me. Now I compiled the last stable version
(2.1.4) and then I started a node. This node restart every time I started:
Message from [EMAIL PROTECTED] at Fri Aug 29 17:20:18 2008 ...
debian2 heartbeat: [3707]: EMERG: Rebooting system. Reason:
/usr/lib/heartbeat/cib
What could be the reason ?
There will be logs before that - something prevents the CIB from
starting up cleanly, and heartbeat's failfast logic causes it to abandon
the node.
This error is a syslog error, is a screen output.
In ha-log there is the next:
cib[3717]: 2008/08/29_17:51:26 info: retrieveCib: Reading cluster
configuration from: /var/lib/heartbeat/crm/cib.xml (digest:
/var/lib/heartbeat/crm/cib.xml.sig)
cib[3717]: 2008/08/29_17:51:26 WARN: retrieveCib: Cluster configuration
not found: /var/lib/heartbeat/crm/cib.xml
cib[3717]: 2008/08/29_17:51:26 WARN: readCibXmlFile: Primary
configuration corrupt or unusable, trying backup...
cib[3717]: 2008/08/29_17:51:26 info: retrieveCib: Reading cluster
configuration from: /var/lib/heartbeat/crm/cib.xml.last (digest:
/var/lib/heartbeat/crm/cib.xml.sig.last)
cib[3717]: 2008/08/29_17:51:26 WARN: retrieveCib: Cluster configuration
not found: /var/lib/heartbeat/crm/cib.xml.last
cib[3717]: 2008/08/29_17:51:26 WARN: readCibXmlFile: Continuing with an
empty configuration.
cib[3717]: 2008/08/29_17:51:26 WARN: readCibXmlFile: No value for
admin_epoch was specified in the configuration.
cib[3717]: 2008/08/29_17:51:26 WARN: readCibXmlFile: The reccomended
course of action is to shutdown, run crm_verify and fix any errors it
reports.
cib[3717]: 2008/08/29_17:51:26 WARN: readCibXmlFile: We will default to
zero and continue but may get confused about which configuration to use
if multiple nodes are powered up at the same time.
cib[3717]: 2008/08/29_17:51:26 info: log_data_element: readCibXmlFile:
[on-disk] <cib generated="true" admin_epoch="0" epoch="0" num_updates="0">
cib[3717]: 2008/08/29_17:51:26 info: log_data_element: readCibXmlFile:
[on-disk] <configuration>
cib[3717]: 2008/08/29_17:51:26 info: log_data_element: readCibXmlFile:
[on-disk] <crm_config/>
cib[3717]: 2008/08/29_17:51:26 info: log_data_element: readCibXmlFile:
[on-disk] <nodes/>
cib[3717]: 2008/08/29_17:51:26 info: log_data_element: readCibXmlFile:
[on-disk] <resources/>
cib[3717]: 2008/08/29_17:51:26 info: log_data_element: readCibXmlFile:
[on-disk] <constraints/>
cib[3717]: 2008/08/29_17:51:26 info: log_data_element: readCibXmlFile:
[on-disk] </configuration>
cib[3717]: 2008/08/29_17:51:26 info: log_data_element: readCibXmlFile:
[on-disk] <status/>
cib[3717]: 2008/08/29_17:51:26 info: log_data_element: readCibXmlFile:
[on-disk] </cib>
ccm[3716]: 2008/08/29_17:51:26 info: Hostname: debianquagga2
cib[3717]: 2008/08/29_17:51:26 notice: readCibXmlFile: Enabling DTD
validation on the existing (sane) configuration
cib[3717]: 2008/08/29_17:51:26 info: startCib: CIB Initialization
completed successfully
attrd[3720]: 2008/08/29_17:51:26 info: register_heartbeat_conn:
Hostname: debianquagga2
attrd[3720]: 2008/08/29_17:51:26 info: register_heartbeat_conn: UUID:
9dc2e49f-8fee-494f-9744-b8e511e2f525
cib[3717]: 2008/08/29_17:51:26 info: register_heartbeat_conn: Hostname:
debianquagga2
cib[3717]: 2008/08/29_17:51:26 info: register_heartbeat_conn: UUID:
9dc2e49f-8fee-494f-9744-b8e511e2f525
cib[3717]: 2008/08/29_17:51:26 info: ccm_connect: Registering with CCM...
cib[3717]: 2008/08/29_17:51:26 WARN: ccm_connect: CCM Activation failed
cib[3717]: 2008/08/29_17:51:26 WARN: ccm_connect: CCM Connection failed
1 times (30 max)
crmd[3721]: 2008/08/29_17:51:27 WARN: do_cib_control: Couldn't complete
CIB registration 1 times... pause and retry
crmd[3721]: 2008/08/29_17:51:27 info: crmd_init: Starting crmd's mainloop
mgmtd[3722]: 2008/08/29_17:51:27 info: login to cib: 1, ret:-10
mgmtd[3722]: 2008/08/29_17:51:28 info: login to cib: 2, ret:-10
crmd[3721]: 2008/08/29_17:51:29 info: crm_timer_popped: Wait Timer
(I_NULL) just popped!
mgmtd[3722]: 2008/08/29_17:51:29 info: login to cib: 3, ret:-10
cib[3717]: 2008/08/29_17:51:29 info: ccm_connect: Registering with CCM...
cib[3717]: 2008/08/29_17:51:29 WARN: ccm_connect: CCM Activation failed
cib[3717]: 2008/08/29_17:51:29 WARN: ccm_connect: CCM Connection failed
2 times (30 max)
crmd[3721]: 2008/08/29_17:51:30 WARN: do_cib_control: Couldn't complete
CIB registration 2 times... pause and retry
mgmtd[3722]: 2008/08/29_17:51:30 info: login to cib: 4, ret:-10
ccm[3716]: 2008/08/29_17:51:30 info: Break tie for 2 nodes cluster
ccm[3716]: 2008/08/29_17:51:30 info: G_main_add_SignalHandler: Added
signal handler for signal 15
mgmtd[3722]: 2008/08/29_17:51:31 info: login to cib failed
mgmtd[3722]: 2008/08/29_17:51:31 ERROR: Can't initialize management
library.Shutting down.(-1)
heartbeat[3706]: 2008/08/29_17:51:31 WARN: Managed
/usr/lib/heartbeat/mgmtd -v process 3722 exited with return code 1.
heartbeat[3706]: 2008/08/29_17:51:31 ERROR: Respawning client
"/usr/lib/heartbeat/mgmtd -v":
heartbeat[3706]: 2008/08/29_17:51:31 info: Starting child client
"/usr/lib/heartbeat/mgmtd -v" (0,0)
crmd[3721]: 2008/08/29_17:51:32 info: crm_timer_popped: Wait Timer
(I_NULL) just popped!
heartbeat[3724]: 2008/08/29_17:51:32 info: Starting
"/usr/lib/heartbeat/mgmtd -v" as uid 0 gid 0 (pid 3724)
mgmtd[3724]: 2008/08/29_17:51:32 info: G_main_add_SignalHandler: Added
signal handler for signal 15
mgmtd[3724]: 2008/08/29_17:51:32 WARN: Core dumps could be lost if
multiple dumps occur.
mgmtd[3724]: 2008/08/29_17:51:32 WARN: Consider setting non-default
value in /proc/sys/kernel/core_pattern (or equivalent) for maximum
supportability
mgmtd[3724]: 2008/08/29_17:51:32 WARN: Consider setting
/proc/sys/kernel/core_uses_pid (or equivalent) to 1 for maximum
supportability
mgmtd[3724]: 2008/08/29_17:51:32 info: G_main_add_SignalHandler: Added
signal handler for signal 10
mgmtd[3724]: 2008/08/29_17:51:32 info: G_main_add_SignalHandler: Added
signal handler for signal 12
mgmtd[3724]: 2008/08/29_17:51:32 info: init_crm
mgmtd[3724]: 2008/08/29_17:51:32 info: login to cib: 0, ret:-10
Something incompatible in compilation ??
I have no cib.xml configuration now, I am only testing if it starts OK.
Strange; if there's no CIB installed, this shouldn't happen.
Let us know what's in the logs before this.
Regards,
Lars
_______________________________________________
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems