On Fri, Apr 30, 2010 at 08:32:22AM -0500, Brodie, Kent wrote:
> >  What does "ps axf" say?
> 
> Andrew-- I wish I knew.   As I said, the system "locks up" when this
> happens (shutting down corosync).   By locks up, I mean, locks up.   I
> can't log in, I can't control-c the session attempting the shutdown, I
> can't do ANYTHING even if I had another idle terminal window right there
> ready to check.     
> 
> Is there a debug-log setting I can set, so that the next few time(s) I
> shutdown, I can prepare the system for capturing something useful?

sounds like a POSIX realtime process running into an endless loop
on all available cores.  or a kernel panic.

enable serial console, or netconsole or both.

put a (ulimited) mlock'ed realtime busybox getty on the serial port ;-)

or realtime your other session (and all its parents!) in time.
or un-realtime corosync.

for example heartbeat setrlimit's itself for cpu every minute,
so it will be killed in case it runs into an endless loop...
does corosync behave similarly?

-- 
: Lars Ellenberg
: LINBIT | Your Way to High Availability
: DRBD/HA support and consulting http://www.linbit.com

DRBD® and LINBIT® are registered trademarks of LINBIT, Austria.
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to