On 12/6/2011 10:41 AM, Julien Cigar wrote:
Hello,
I'm running 9.0-RC3 on a HP Proliant Microserver (N40L). A disk died
in my graid3 array and I replaced it with a new one, and now have tons
of:
ahcich3: Timeout on slot 5 port 0
ahcich3: is 00000000 cs 00000000 ss 00003f60 rs 00003f60 tfd 40 serr
00000000 cmd 0000ed17
ahcich3: Timeout on slot 0 port 0
ahcich3: is 00000000 cs 00000000 ss 00000003 rs 00000003 tfd 40 serr
00000000 cmd 0000e117
ahcich3: Timeout on slot 1 port 0
ahcich3: is 00000000 cs 00000000 ss 000003fe rs 000003fe tfd 40 serr
00000000 cmd 0000e917
ahcich3: Timeout on slot 16 port 0
ahcich3: is 00000000 cs 00000000 ss 00030000 rs 00030000 tfd 40 serr
00000000 cmd 0000f217
ahcich3: Timeout on slot 15 port 0
ahcich3: is 00000000 cs 00000000 ss 00018000 rs 00018000 tfd 40 serr
00000000 cmd 0000f017
ahcich3: Timeout on slot 19 port 0
ahcich3: is 00000000 cs 00000000 ss 00780000 rs 00780000 tfd 40 serr
00000000 cmd 0000f617
ahcich3: Timeout on slot 11 port 0
ahcich3: is 00000000 cs 00000000 ss 000ff800 rs 000ff800 tfd 40 serr
00000000 cmd 0000f317
ahcich3: Timeout on slot 13 port 0
ahcich3: is 00000000 cs 00000000 ss 00006000 rs 00006000 tfd 40 serr
00000000 cmd 0000ef17
ahcich3: Timeout on slot 11 port 0
ahcich3: is 00000000 cs 00000000 ss 001ff800 rs 001ff800 tfd 40 serr
00000000 cmd 0000f417
ahcich3: Timeout on slot 19 port 0
ahcich3: is 00000000 cs 00000000 ss 00380000 rs 00380000 tfd 40 serr
00000000 cmd 0000f517
ahcich3: Timeout on slot 29 port 0
ahcich3: is 00000000 cs 00000000 ss e000001f rs e000001f tfd 40 serr
00000000 cmd 0000e417
ahcich3: Timeout on slot 27 port 0
ahcich3: is 00000000 cs 00000000 ss 18000000 rs 18000000 tfd 40 serr
00000000 cmd 0000fc17
ahcich3: Timeout on slot 4 port 0
ahcich3: is 00000000 cs 00000000 ss 00001ff0 rs 00001ff0 tfd 40 serr
00000000 cmd 0000ec17
ahcich3: Timeout on slot 28 port 0
ahcich3: is 00000000 cs 00000000 ss 70000000 rs 70000000 tfd 40 serr
00000000 cmd 0000fe17
ahcich3: Timeout on slot 8 port 0
ahcich3: is 00000000 cs 00000000 ss 0000ff00 rs 0000ff00 tfd 40 serr
00000000 cmd 0000ef17
ahcich3: Timeout on slot 29 port 0
ahcich3: is 00000000 cs 00000000 ss 60000000 rs 60000000 tfd 40 serr
00000000 cmd 0000ff17
ahcich3: Timeout on slot 16 port 0
ahcich3: is 00000000 cs 00000000 ss 00070000 rs 00070000 tfd 40 serr
00000000 cmd 0000f217
ahcich3: Timeout on slot 19 port 0
ahcich3: is 00000000 cs 00000000 ss 00780000 rs 00780000 tfd 40 serr
00000000 cmd 0000f617
ahcich3: Timeout on slot 7 port 0
ahcich3: is 00000000 cs 00000000 ss 00007f80 rs 00007f80 tfd 40 serr
00000000 cmd 0000ee17
ahcich3: Timeout on slot 16 port 0
ahcich3: is 00000000 cs 00000000 ss 00070000 rs 00070000 tfd 40 serr
00000000 cmd 0000f217
ahcich3: Timeout on slot 0 port 0
ahcich3: is 00000000 cs 00000000 ss 00000007 rs 00000007 tfd 40 serr
00000000 cmd 0000e217
ahcich3: Timeout on slot 20 port 0
ahcich3: is 00000000 cs 00000000 ss 01b00000 rs 01b00000 tfd 40 serr
00000000 cmd 0000f817
ahcich3: Timeout on slot 20 port 0
ahcich3: is 00000000 cs 00000000 ss 00b00000 rs 00b00000 tfd 40 serr
00000000 cmd 0000f717
ahcich3: Timeout on slot 15 port 0
(...)
Those are Seagate disks:
jcigar@backup conf % sudo camcontrol devlist
<VB0250EAVER HPG0> at scbus0 target 0 lun 0 (pass0,ada0)
<ST31000528AS CC38> at scbus1 target 0 lun 0 (pass1,ada1)
<ST31000528AS CC38> at scbus2 target 0 lun 0 (pass2,ada2)
<ST31000333AS CC1H> at scbus3 target 0 lun 0 (pass3,ada3)
The controller is:
ahci0@pci0:0:17:0: class=0x010601 card=0x1609103c chip=0x43911002
rev=0x40 hdr=0x00
vendor = 'ATI Technologies Inc'
device = 'SB7x0/SB8x0/SB9x0 SATA Controller [AHCI mode]'
class = mass storage
subclass = SATA
jcigar@backup conf % vmstat -i
interrupt total rate
irq17: ehci0 ehci1+ 2 0
irq18: ohci0 ohci1+ 30 0
irq256: bge0 31354 4
irq257: ahci0 19012658 2477
irq258: hpet0:t0 4926229 641
irq259: hpet0:t1 4635261 603
Total 28605534 3727
Any idea what could be the cause of this ... ?
Thanks,
Julien
_______________________________________________
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to "freebsd-questions-unsubscr...@freebsd.org"
I had very similar situation with AHCI timeouts. SMART was not showing
any problem, but finally I decided to remove drive and perform low level
tests. I have found very long access time to some sectors (I use
HDDScan for windows). I have replaced drive with working fine and my
problem are gone (so far).
Thanks,
_______________________________________________
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to "freebsd-questions-unsubscr...@freebsd.org"