Ahmon Dancy wrote:
> 
> On the topic of various ways to try to fix the lockup problem..
> perhaps we should try a different approach.
> Does anyone know the exact nature of the lockup?  i.e.:
> Is the kernel getting stuck in a loop somewhere?
> Is the kernel waiting for something to happen that will never happen?
> Is it an actual hardware lockup? i.e., Is the CPU trying to talk to hardware
> that is no longer responding?

I haven't had a lockup in a long while, but IIRC there were two types:
one in which you got a mesg "TLB stuck on IPI wait" when one CPU hung
but the other was still working, and a total non-responsive lockup.

This second wouldn't respond to any interrupts, so I believe was 
a failure of the [IO] APIC or it's bus.  This thing is an accident
waiting to happen [open drain wired-or at BCLK/4].  Abit didn't 
help by placing the Southbridge PIIX4 where the IO-APIC lives so 
far away from the CPUs.

> Unfortunately, the best person to answer this is probably 
> Andre.. but he doesn't experience the problem.

He's also smart enough not to put a lockup generator^H^H^H^H^H^H
sound card in his machines.  I firmly believe the hardware side
of these things were never designed for reliability, still less
for real operating systems, and the drivers still less for SMP.

I have put my BP6es under brutal interrupt load:  full-dupex,
bi-directional runt-packet flood pings on two 100bTX ethercards
through crossover cables to other machines, plus md5sum /dev/hda1
and HZ=1000.  They stay stable.  I've got the /proc/interrupts
around somewhere.

I'd like to learn how to program the parport, especially register
an ISR, so I could try to reach even higher int loads.  I can get
~300kints/sec (near theoretical) with the APIC ICR, but that's
cheating because it doesn't use the weak IO-APIC.

-- Robert  author `cpuburn`  http://users.ev1.net/~redelm
--
=-          To unsubscribe, email [EMAIL PROTECTED] with the       -=
=-                body of "unsubscribe linux-abit".                 -=

Reply via email to