On 12/6/2011 10:41 AM, Julien Cigar wrote:
Hello,

I'm running 9.0-RC3 on a HP Proliant Microserver (N40L). A disk died in my graid3 array and I replaced it with a new one, and now have tons of:

ahcich3: Timeout on slot 5 port 0
ahcich3: is 00000000 cs 00000000 ss 00003f60 rs 00003f60 tfd 40 serr 00000000 cmd 0000ed17
ahcich3: Timeout on slot 0 port 0
ahcich3: is 00000000 cs 00000000 ss 00000003 rs 00000003 tfd 40 serr 00000000 cmd 0000e117
ahcich3: Timeout on slot 1 port 0
ahcich3: is 00000000 cs 00000000 ss 000003fe rs 000003fe tfd 40 serr 00000000 cmd 0000e917
ahcich3: Timeout on slot 16 port 0
ahcich3: is 00000000 cs 00000000 ss 00030000 rs 00030000 tfd 40 serr 00000000 cmd 0000f217
ahcich3: Timeout on slot 15 port 0
ahcich3: is 00000000 cs 00000000 ss 00018000 rs 00018000 tfd 40 serr 00000000 cmd 0000f017
ahcich3: Timeout on slot 19 port 0
ahcich3: is 00000000 cs 00000000 ss 00780000 rs 00780000 tfd 40 serr 00000000 cmd 0000f617
ahcich3: Timeout on slot 11 port 0
ahcich3: is 00000000 cs 00000000 ss 000ff800 rs 000ff800 tfd 40 serr 00000000 cmd 0000f317
ahcich3: Timeout on slot 13 port 0
ahcich3: is 00000000 cs 00000000 ss 00006000 rs 00006000 tfd 40 serr 00000000 cmd 0000ef17
ahcich3: Timeout on slot 11 port 0
ahcich3: is 00000000 cs 00000000 ss 001ff800 rs 001ff800 tfd 40 serr 00000000 cmd 0000f417
ahcich3: Timeout on slot 19 port 0
ahcich3: is 00000000 cs 00000000 ss 00380000 rs 00380000 tfd 40 serr 00000000 cmd 0000f517
ahcich3: Timeout on slot 29 port 0
ahcich3: is 00000000 cs 00000000 ss e000001f rs e000001f tfd 40 serr 00000000 cmd 0000e417
ahcich3: Timeout on slot 27 port 0
ahcich3: is 00000000 cs 00000000 ss 18000000 rs 18000000 tfd 40 serr 00000000 cmd 0000fc17
ahcich3: Timeout on slot 4 port 0
ahcich3: is 00000000 cs 00000000 ss 00001ff0 rs 00001ff0 tfd 40 serr 00000000 cmd 0000ec17
ahcich3: Timeout on slot 28 port 0
ahcich3: is 00000000 cs 00000000 ss 70000000 rs 70000000 tfd 40 serr 00000000 cmd 0000fe17
ahcich3: Timeout on slot 8 port 0
ahcich3: is 00000000 cs 00000000 ss 0000ff00 rs 0000ff00 tfd 40 serr 00000000 cmd 0000ef17
ahcich3: Timeout on slot 29 port 0
ahcich3: is 00000000 cs 00000000 ss 60000000 rs 60000000 tfd 40 serr 00000000 cmd 0000ff17
ahcich3: Timeout on slot 16 port 0
ahcich3: is 00000000 cs 00000000 ss 00070000 rs 00070000 tfd 40 serr 00000000 cmd 0000f217
ahcich3: Timeout on slot 19 port 0
ahcich3: is 00000000 cs 00000000 ss 00780000 rs 00780000 tfd 40 serr 00000000 cmd 0000f617
ahcich3: Timeout on slot 7 port 0
ahcich3: is 00000000 cs 00000000 ss 00007f80 rs 00007f80 tfd 40 serr 00000000 cmd 0000ee17
ahcich3: Timeout on slot 16 port 0
ahcich3: is 00000000 cs 00000000 ss 00070000 rs 00070000 tfd 40 serr 00000000 cmd 0000f217
ahcich3: Timeout on slot 0 port 0
ahcich3: is 00000000 cs 00000000 ss 00000007 rs 00000007 tfd 40 serr 00000000 cmd 0000e217
ahcich3: Timeout on slot 20 port 0
ahcich3: is 00000000 cs 00000000 ss 01b00000 rs 01b00000 tfd 40 serr 00000000 cmd 0000f817
ahcich3: Timeout on slot 20 port 0
ahcich3: is 00000000 cs 00000000 ss 00b00000 rs 00b00000 tfd 40 serr 00000000 cmd 0000f717
ahcich3: Timeout on slot 15 port 0

(...)

Those are Seagate disks:

jcigar@backup conf % sudo camcontrol devlist
<VB0250EAVER HPG0>                 at scbus0 target 0 lun 0 (pass0,ada0)
<ST31000528AS CC38>                at scbus1 target 0 lun 0 (pass1,ada1)
<ST31000528AS CC38>                at scbus2 target 0 lun 0 (pass2,ada2)
<ST31000333AS CC1H>                at scbus3 target 0 lun 0 (pass3,ada3)

The controller is:

ahci0@pci0:0:17:0: class=0x010601 card=0x1609103c chip=0x43911002 rev=0x40 hdr=0x00
    vendor     = 'ATI Technologies Inc'
    device     = 'SB7x0/SB8x0/SB9x0 SATA Controller [AHCI mode]'
    class      = mass storage
    subclass   = SATA

jcigar@backup conf % vmstat -i
interrupt                          total       rate
irq17: ehci0 ehci1+                    2          0
irq18: ohci0 ohci1+                   30          0
irq256: bge0                       31354          4
irq257: ahci0                   19012658       2477
irq258: hpet0:t0                 4926229        641
irq259: hpet0:t1                 4635261        603
Total                           28605534       3727


Any idea what could be the cause of this ... ?


Thanks,
Julien




_______________________________________________
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to "freebsd-questions-unsubscr...@freebsd.org"
I had very similar situation with AHCI timeouts. SMART was not showing any problem, but finally I decided to remove drive and perform low level tests. I have found very long access time to some sectors (I use HDDScan for windows). I have replaced drive with working fine and my problem are gone (so far).

Thanks,
_______________________________________________
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to "freebsd-questions-unsubscr...@freebsd.org"

Reply via email to