In a nut shell whenever my secondary goes down the primary also goes
down. When i mean go down , i mean when i stop heartbeat by doing
/etc/init.d/heartbeat stop or by simply rebooting the machine for a
specific reason....
Its hard to get log entries as the primary is a production machine and i
cannot work on it during normal work hours as everything will go
down.... But here is info i can give you.... Once i have more info from
your help to look for i will do some checks after hours....
I am using Debian Etch
I am using heartbeat version 2.0.7-2 ( as per Debian apt-get )
I am also using DRBD version 8.0.7-1~bpo40+1 ( as per Debian apt-get)
I am using Heartbeat in NON xml
the ha.conf and haresources files are the same on both machines. I know
i am a bit behind in the version of heartbeat.... but not sure why it
would do this... it was working fine at one point and then this issue
started happening....
Any ideas, what should i look for... as i mentioned i will try and see
if i can get more loggin...
Below please find my conf files...
ha.conf
----------
logfacility daemon # This is deprecated
keepalive 2 # Interval between heartbeat (HB) packets.
deadtime 60 # How quickly HB determines a dead node.
warntime 5 # Time HB will issue a late HB.
initdead 120 # Time delay needed by HB to report a dead
node.
udpport 694 # UDP port HB uses to communicate between
nodes.
bcast eth0 eth1
coredumps true
auto_failback on # Auto promotion of primary node upon
return to cluster.
node joe # Node name must be same as uname -n.
node stewie # Node name must be same as uname -n.
respawn hacluster /usr/lib/heartbeat/ipfail
use_logd yes # Use system logging.
logfile /var/log/hb.log # Heartbeat logfile.
debugfile /var/log/heartbeat-debug.log # Debugging logfile.
haresources file
----------------
joe IPaddr::xx.xx.xx.150 drbddisk::mail drbddisk::web \
Filesystem::/dev/drbd0::/var/mail/virtual::ext3::defaults \
Filesystem::/dev/drbd1::/var/www::ext3::defaults \
postfix dovecot mysql apache2 proftpd \
MailTo::alerts2::BigImportantWebServer
--
Rob Morin
Dido Internet Inc.
Montreal,Canada
http://www.dido.ca
514-990-4444
--
Rob Morin
Dido Internet Inc.
Montreal,Canada
http://www.dido.ca
514-990-4444
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems