Hi Lars
service heartbeat status
I checked the logs of both heartbeats and the last thing in those logs
(between the time heartbeat was stopped), is the daily memory stats of
heartbeat but nothing else until I restarted heartbeat on the node it
was stopped on.
I did not actually check for heartbeat processes, could it be somehow
that the "service heartbeat status" reported incorrectly somehow?
Lars Marowsky-Bree wrote:
On 2008-09-11T09:41:24, Jeffery Soo <[EMAIL PROTECTED]> wrote:
Hi Lars
I wish I could not believe it either :)
The services were running fine but the heartbeat service itself (I mean
literally the heartbeat service, but the services heartbeat is supposed to
control were still running fine and active on the same node) was stopped at
some point and I don't understand why.
I am using 2.1.3-3.el5.centos
Thanks for anything you guys can think of.
How did you arrive at the conclusion that heartbeat itself was stopped?
The processes can't be gone completely without there being something at
least in the logs of the other node. That is ... "impossible" is always
difficult to claim, but it would involve never-before-seen byzantine
failures of not one, but two nodes and several processes.
Regards,
Lars
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems