[Linux-HA] Stopping heartbeat on secondary node causes primary to fail

Chris Huber-Lantz Mon, 08 Aug 2011 00:01:19 -0700

Hello All,

I am having an issue in a 2-node heartbeat cluster where the primary 
node's resources are relinquished if the secondary node's heartbeat 
service is stopped. Below is the ha.cf file:


logfacility local0
logfile /var/log/ha-log
debugfile /var/log/ha-debug
udpport 694
keepalive 2 # 2 second
deadtime 20
warntime 10
initdead 40
ucast eth5 192.168.0.2
node node1
node node2
auto_failback on
watchdog /dev/watchdog

As you can see we are using "auto_failback on" which *should* only 
pertain to when the main server is taken down and subsequently brought 
up as to re-assume control of the primary resources. However I have 
noticed several forum posts regarding this setting causing unexplained 
behavior, although not specifically the behavior we are seeing.

Is it possible this setting is the culprit or would there be any other 
reason that stopping heartbeat on the secondary node would cause the 
primary to drop its resources?

Any help is greatly appreciated!

- Chris

-- 
Regards,
     Chris

Chris Huber-Lantz
ScratchSpace Inc.
(831) 621-7928
http://www.scratchspace.com

_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

[Linux-HA] Stopping heartbeat on secondary node causes primary to fail

Reply via email to