On Fri, Apr 30, 2010 at 08:32:22AM -0500, Brodie, Kent wrote: > > What does "ps axf" say? > > Andrew-- I wish I knew. As I said, the system "locks up" when this > happens (shutting down corosync). By locks up, I mean, locks up. I > can't log in, I can't control-c the session attempting the shutdown, I > can't do ANYTHING even if I had another idle terminal window right there > ready to check. > > Is there a debug-log setting I can set, so that the next few time(s) I > shutdown, I can prepare the system for capturing something useful?
sounds like a POSIX realtime process running into an endless loop on all available cores. or a kernel panic. enable serial console, or netconsole or both. put a (ulimited) mlock'ed realtime busybox getty on the serial port ;-) or realtime your other session (and all its parents!) in time. or un-realtime corosync. for example heartbeat setrlimit's itself for cpu every minute, so it will be killed in case it runs into an endless loop... does corosync behave similarly? -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT® are registered trademarks of LINBIT, Austria. _______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
