Hello All,
I am having an issue in a 2-node heartbeat cluster where the primary
node's resources are relinquished if the secondary node's heartbeat
service is stopped. Below is the ha.cf file:
logfacility local0
logfile /var/log/ha-log
debugfile /var/log/ha-debug
udpport 694
keepalive 2 # 2 second
deadtime 20
warntime 10
initdead 40
ucast eth5 192.168.0.2
node node1
node node2
auto_failback on
watchdog /dev/watchdog
As you can see we are using "auto_failback on" which *should* only
pertain to when the main server is taken down and subsequently brought
up as to re-assume control of the primary resources. However I have
noticed several forum posts regarding this setting causing unexplained
behavior, although not specifically the behavior we are seeing.
Is it possible this setting is the culprit or would there be any other
reason that stopping heartbeat on the secondary node would cause the
primary to drop its resources?
Any help is greatly appreciated!
- Chris
--
Regards,
Chris
Chris Huber-Lantz
ScratchSpace Inc.
(831) 621-7928
http://www.scratchspace.com
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems