On 4/9/07, Kevin Jamieson <[EMAIL PROTECTED]> wrote:
kisalay wrote:

> I have a 2 node 2.0.8 Linux HA setup.
> I have observed that when stop is issued on my setup, as soon as the start
> returns, the stop hangs indefinitely, and the only way to stop heartbeat is
> to do killall.

or wait for the really long timeout

this was fixed last week IIRC


I've noticed similar behaviour on heartbeat 2.0.7, though haven't gotten
around to filing a bugzilla on it.

In the situation I've observed, it looks like a race between shutting
down of the heartbeat parent process and either the setpgid() or the
SIGTERM signal handler installation in a newly created child process
(the log indicates heartbeat is killing the crmd but the crmd appears to
never receive a SIGTERM).

Mar 15 21:52:22 main heartbeat: [4499]: info: Starting child client
"/usr/lib/heartbeat/crmd" (90,90)
Mar 15 21:52:22 main heartbeat: [4499]: info: killing
/usr/lib/heartbeat/crmd process group 4586 with signal 15
Mar 15 21:52:22 main heartbeat: [4586]: info: Starting
"/usr/lib/heartbeat/crmd" as uid 90 gid 90 (pid 4586)
Mar 15 21:52:22 main crmd: [4586]: info: init_start:main.c Starting crmd
Mar 15 21:52:22 main crmd: [4586]: info: G_main_add_SignalHandler: Added
signal handler for signal 15
Mar 15 21:52:22 main crmd: [4586]: info: G_main_add_TriggerHandler:
Added signal manual handler
Mar 15 21:52:22 main crmd: [4586]: info: G_main_add_SignalHandler: Added
signal handler for signal 17

Kevin

_______________________________________________
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

_______________________________________________
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to