In a nut shell whenever my secondary goes down the primary also goes down. When i mean go down , i mean when i stop heartbeat by doing /etc/init.d/heartbeat stop or by simply rebooting the machine for a specific reason....

Its hard to get log entries as the primary is a production machine and i cannot work on it during normal work hours as everything will go down.... But here is info i can give you.... Once i have more info from your help to look for i will do some checks after hours....

I am using Debian Etch
I am using heartbeat version 2.0.7-2 ( as per Debian apt-get )
I am also using DRBD version 8.0.7-1~bpo40+1 ( as per Debian apt-get)

I am using Heartbeat in NON xml

the ha.conf and haresources files are the same on both machines. I know i am a bit behind in the version of heartbeat.... but not sure why it would do this... it was working fine at one point and then this issue started happening....

Any ideas, what should i look for... as i mentioned i will try and see if i can get more loggin...

Below please find my conf files...

ha.conf
----------

logfacility     daemon        # This is deprecated
keepalive 2                   # Interval between heartbeat (HB) packets.
deadtime 60                   # How quickly HB determines a dead node.
warntime 5                    # Time HB will issue a late HB.
initdead 120 # Time delay needed by HB to report a dead node. udpport 694 # UDP port HB uses to communicate between nodes.
bcast eth0 eth1
coredumps true
auto_failback on # Auto promotion of primary node upon return to cluster.
node    joe      # Node name must be same as uname -n.
node    stewie      # Node name must be same as uname -n.
respawn hacluster /usr/lib/heartbeat/ipfail
use_logd yes                  # Use system logging.
logfile /var/log/hb.log       # Heartbeat logfile.
debugfile /var/log/heartbeat-debug.log # Debugging logfile.

haresources file
----------------

joe IPaddr::xx.xx.xx.150 drbddisk::mail drbddisk::web \
Filesystem::/dev/drbd0::/var/mail/virtual::ext3::defaults \
Filesystem::/dev/drbd1::/var/www::ext3::defaults \
postfix dovecot mysql apache2 proftpd \
MailTo::alerts2::BigImportantWebServer

--

Rob Morin
Dido Internet Inc.
Montreal,Canada
http://www.dido.ca
514-990-4444


--

Rob Morin
Dido Internet Inc.
Montreal,Canada
http://www.dido.ca
514-990-4444

_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to