On Fri, 09 Apr 2004, martin f krafft wrote:
> also sprach martin f krafft <[EMAIL PROTECTED]> [2004.04.09.0034 +0200]:
> > > No. nmi_watchdog=1 means IOAPIC, nmi_watchdog=2 means LAPIC. You
> > > need to have the NMI Watchdog compiled into the kernel.
> > 
> > This is my problem. I have looked around the LAPIC stuff and in the
> > Watchdog section, but there is nothing about an NMI watchdog. I have
> > enabled the Hangcheck timer and the Software watchdog.
> 
> Okay, I did find it -- kernel/Documentation/nmi_watchdog.txt told me
> all that I needed.
> 
> Last question here: is it adviseable to run the NMI watchdog on
> a productive system? After all, uptime is important, and as far as

It won't hurt.  But nmi_watchdog is only for usermode and kernemode hangs.
The NMI watchdog is useless against nasty bugs (hw or sw) that make the
hardware unstable.

I run my machine with it enabled, since if it ever decides to hang, I want
an OOPs to have a slim chance of being produced before the TCO watchdog
kicks in.  

I get about 101 NMIs per second on each CPU using nmi_watchdog=1. HZ=100 in
this machine, I suppose...  This is a not-really-SMP machine, with a single
P4 HT processor.

The software watchdog will reboot your machine (and AFAIK it might very well
be using NMIs to do it, too).  If you can use a chipset watchdog, however,
that's much better (e.g. the TCO timers in most Intel systems, and
_especially_ the IPMI watchdog in servers with a baseboard controller worth
something).

You can only have one watchdog listening on /dev/watchdog, btw.

> I understand, the NMI watchdog should be able to reboot the machine
> more often than it happens without the NMI watchdog. Of course,
> I won't stop until this machine is rock stable. But in the mean
> time... ?

It might help, yes.  But only if the hang is not killing the CPU or the
northbridge.  Otherwise, the CPU cannot execute the NMI code :)  and
any software watchdog will be useless.

-- 
  "One disk to rule them all, One disk to find them. One disk to bring
  them all and in the darkness grind them. In the Land of Redmond
  where the shadows lie." -- The Silicon Valley Tarot
  Henrique Holschuh


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED] 
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]

Reply via email to