Hi,

I think I may have found the cause of the pst timeout panics.  I'm using
the Promise SX6000 RAID on -CURRENT, using the pst driver.  Unfortunately,
under sufficiently high I/O load, the box starts printing:

  "pst: timeout mfa=0x00327b90 cmd=0x01"

The 'mfa' address varies. It starts printing more and more rapidly, and
then eventually the machine wedges solid. Sometimes it makes it to:

  "panic: timeout table full"

Here's what I think is happening. Two timeouts are being scheduled every
time a timeout triggers, because pst_timeout schedules a timeout before
calling pst_rw to retry the operation. Then pst_rw schedules ANOTHER
timeout.  Both of these timeouts call pst_timeout, so they double every 10
seconds until there are a large enough number of timeouts firing, retrying
the same I/O operation, that the table fills and the machine panics.

Check out the following diff

  
http://www.freebsd.org/cgi/cvsweb.cgi/src/sys/dev/pst/pst-raid.c.diff?r1=1.8&r2=1.9&f=h

This is where pst_rw was changed to schedule its own timeouts, but the
timeout function didn't have its removed.

Do you think this could be the correct explanation? It seems like once
pst_timeout is called, the machine is doomed... I'm recompiling my kernel
now to test the fix under load.

--Aaron
Index: /sys/dev/pst/pst-raid.c
===================================================================
RCS file: /usr/cvs/src/sys/dev/pst/pst-raid.c,v
retrieving revision 1.11
diff -u -r1.11 pst-raid.c
--- /sys/dev/pst/pst-raid.c     24 Aug 2003 17:54:17 -0000      1.11
+++ /sys/dev/pst/pst-raid.c     8 Sep 2003 02:32:58 -0000
@@ -316,11 +316,6 @@
        mtx_unlock(&request->psc->iop->mtx);
        return;
     }
-    if (dumping)
-       request->timeout_handle.callout = NULL;
-    else
-       request->timeout_handle =
-           timeout((timeout_t*)pst_timeout, request, 10 * hz);
     if (pst_rw(request)) {
        iop_free_mfa(request->psc->iop, request->mfa);
        biofinish(request->bp, NULL, EIO);
_______________________________________________
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to "[EMAIL PROTECTED]"

Reply via email to