Re: Dell Power Edge 1950 SAS Raid1 'sd0: not queued: error 5'

2008-05-14 Thread Claer
On Wed, May 14 2008 at 24:09, David Gwynne wrote:
 i believe this has been fixed with revision 1.80 of src/sys/dev/ic/mfi.c. 
 could you please try -current (or at least 4.3) and see if the problem 
 persists?
OK. I'll try to upgrade these servers asap. (It's have to be done anyway =))


Claer

 On 14/05/2008, at 1:10 AM, Claer wrote:

 Hi list,

 Today one of our first Dell 1950 crashed in a strange way. I asked non
 IT people to restart it that's why I dont have console traces of the
 problem.

 Before the server became unresponsitive, I could see this in
 /var/log/messages :

 May 11 04:50:55 fw1 /bsd: sd0: not queued, error 5
 May 11 04:51:26 fw1 last message repeated 89 times
 May 11 04:51:26 fw1 last message repeated 34 times

 Googling for sd0: not queued, error 5 I found a thread with a similar
 log. http://readlist.com/lists/openbsd.org/misc/11/56564.html

 It seems the problem is not fixed for the release installed on this
 firewall (4.1). It's the first time in around 1 year that I got this
 problem.
 During the problem, telnet server 22 opened and closed the connection
 without displaying ssh banner. The network stack was still running
 and the carp interfaces did not change to BACKUP mode.

 As this firewall is used for tests it did not impact any users
 (exept myself ;)) but permits to run debug commands if suggested.
 I'll update the perc firmware as mentionned on the thread posted above.
 The server will be upgraded soon to 4.3 too.

 Any  help on how to avoid this problem is welcome.


 Claer

 dmeg :

 OpenBSD 4.1-stable (GENERIC) #1: Fri Aug 17 23:55:00 CEST 2007
[EMAIL PROTECTED]:/usr/src/sys/arch/i386/compile/GENERIC
 cpu0: Intel(R) Xeon(R) CPU 5110 @ 1.60GHz (GenuineIntel 686-class)
 1.60 GHz
 cpu0:
 FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,SBF,SSE3,MWAIT,DS-CPL,VMX,TM2,CX16,xTPR
 real mem  = 1072955392 (1047808K)
 avail mem = 971632640 (948860K)
 using 4278 buffers containing 53772288 bytes (52512K) of memory
 mainbus0 (root)
 bios0 at mainbus0: AT/286+ BIOS, date 03/26/07, BIOS32 rev. 0 @ 0xffe90,
 SMBIOS rev. 2.4 @ 0x3ffbc000 (62 entries)
 bios0: Dell Inc. PowerEdge 1950
 pcibios0 at bios0: rev 2.1 @ 0xf/0x1
 pcibios0: PCI IRQ Routing Table rev 1.0 @ 0xfaa60/368 (21 entries)
 pcibios0: PCI Interrupt Router at 000:31:0 (Intel 6321ESB LPC rev
 0x00)
 pcibios0: PCI bus #22 is the last bus
 bios0: ROM list: 0xc/0x9000! 0xc9000/0x1000 0xca000/0x1800
 0xcb800/0x5200 0xec000/0x4000!
 acpi at mainbus0 not configured
 ipmi0 at mainbus0: version 2.0 interface KCS iobase 0xca8/8 spacing 4
 cpu0 at mainbus0
 pci0 at mainbus0 bus 0: configuration mode 1 (no bios)
 pchb0 at pci0 dev 0 function 0 Intel 5000X Host rev 0x12
 ppb0 at pci0 dev 2 function 0 Intel 5000 PCIE rev 0x12
 pci1 at ppb0 bus 6
 ppb1 at pci1 dev 0 function 0 Intel 6321ESB PCIE rev 0x01
 pci2 at ppb1 bus 7
 ppb2 at pci2 dev 0 function 0 Intel 6321ESB PCIE rev 0x01
 pci3 at ppb2 bus 8
 ppb3 at pci3 dev 0 function 0 ServerWorks PCIE-PCIX rev 0xc3
 pci4 at ppb3 bus 9
 bnx0 at pci4 dev 0 function 0 Broadcom BCM5708 rev 0x12: irq 5
 ppb4 at pci2 dev 1 function 0 Intel 6321ESB PCIE rev 0x01
 pci5 at ppb4 bus 10
 ppb5 at pci1 dev 0 function 3 Intel 6321ESB PCIE-PCIX rev 0x01
 pci6 at ppb5 bus 11
 ppb6 at pci0 dev 3 function 0 Intel 5000 PCIE rev 0x12
 pci7 at ppb6 bus 1
 ppb7 at pci7 dev 0 function 0 Intel IOP333 PCIE-PCIX rev 0x00
 pci8 at ppb7 bus 2
 mfi0 at pci8 dev 14 function 0 Dell PERC 5 rev 0x00: irq 6
 mfi0: logical drives 1, version 5.1.1-0040, 256MB RAM
 scsibus0 at mfi0: 1 targets
 sd0 at scsibus0 targ 0 lun 0: DELL, PERC 5/i, 1.03 SCSI3 0/direct
 fixed
 sd0: 69376MB, 69376 cyl, 64 head, 32 sec, 512 bytes/sec, 142082048 sec
 total
 ppb8 at pci7 dev 0 function 2 Intel IOP333 PCIE-PCIX rev 0x00
 pci9 at ppb8 bus 3
 ppb9 at pci0 dev 4 function 0 Intel 5000 PCIE rev 0x12
 pci10 at ppb9 bus 12
 ppb10 at pci10 dev 0 function 0 vendor IDT, unknown product 0x8018 rev
 0x04
 pci11 at ppb10 bus 13
 ppb11 at pci11 dev 0 function 0 vendor IDT, unknown product 0x8018 rev
 0x04
 pci12 at ppb11 bus 14
 em0 at pci12 dev 0 function 0 Intel PRO/1000 QP (82571EB) rev 0x06:
 irq 5, address 00:15:17:3e:c8:dc
 em1 at pci12 dev 0 function 1 Intel PRO/1000 QP (82571EB) rev 0x06:
 irq 11, address 00:15:17:3e:c8:dd
 ppb12 at pci11 dev 1 function 0 vendor IDT, unknown product 0x8018 rev
 0x04
 pci13 at ppb12 bus 15
 em2 at pci13 dev 0 function 0 Intel PRO/1000 QP (82571EB) rev 0x06:
 irq 11, address 00:15:17:3e:c8:de
 em3 at pci13 dev 0 function 1 Intel PRO/1000 QP (82571EB) rev 0x06:
 irq 6, address 00:15:17:3e:c8:df
 ppb13 at pci0 dev 5 function 0 Intel 5000 PCIE rev 0x12
 pci14 at ppb13 bus 16
 ppb14 at pci0 dev 6 function 0 Intel 5000 PCIE rev 0x12
 pci15 at ppb14 bus 17
 ppb15 at pci15 dev 0 function 0 vendor IDT, unknown product 0x8018 rev
 0x04
 pci16 at ppb15 bus 18
 ppb16 at pci16 dev 0 function 0 vendor IDT, unknown product 0x8018 rev
 0x04
 pci17 at 

Re: Dell Power Edge 1950 SAS Raid1 'sd0: not queued: error 5'

2008-05-13 Thread David Gwynne
i believe this has been fixed with revision 1.80 of src/sys/dev/ic/ 
mfi.c. could you please try -current (or at least 4.3) and see if the  
problem persists?


dlg

On 14/05/2008, at 1:10 AM, Claer wrote:


Hi list,

Today one of our first Dell 1950 crashed in a strange way. I asked non
IT people to restart it that's why I dont have console traces of the
problem.

Before the server became unresponsitive, I could see this in
/var/log/messages :

May 11 04:50:55 fw1 /bsd: sd0: not queued, error 5
May 11 04:51:26 fw1 last message repeated 89 times
May 11 04:51:26 fw1 last message repeated 34 times

Googling for sd0: not queued, error 5 I found a thread with a  
similar

log. http://readlist.com/lists/openbsd.org/misc/11/56564.html

It seems the problem is not fixed for the release installed on this
firewall (4.1). It's the first time in around 1 year that I got this
problem.
During the problem, telnet server 22 opened and closed the  
connection

without displaying ssh banner. The network stack was still running
and the carp interfaces did not change to BACKUP mode.

As this firewall is used for tests it did not impact any users
(exept myself ;)) but permits to run debug commands if suggested.
I'll update the perc firmware as mentionned on the thread posted  
above.

The server will be upgraded soon to 4.3 too.

Any  help on how to avoid this problem is welcome.


Claer

dmeg :

OpenBSD 4.1-stable (GENERIC) #1: Fri Aug 17 23:55:00 CEST 2007
   [EMAIL PROTECTED]:/usr/src/sys/arch/i386/compile/GENERIC
cpu0: Intel(R) Xeon(R) CPU 5110 @ 1.60GHz (GenuineIntel 686-class)
1.60 GHz
cpu0:
FPU 
,V86 
,DE 
,PSE 
,TSC 
,MSR 
,PAE 
,MCE 
,CX8 
,APIC 
,SEP 
,MTRR 
,PGE 
,MCA 
,CMOV 
,PAT 
,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,SBF,SSE3,MWAIT,DS- 
CPL,VMX,TM2,CX16,xTPR

real mem  = 1072955392 (1047808K)
avail mem = 971632640 (948860K)
using 4278 buffers containing 53772288 bytes (52512K) of memory
mainbus0 (root)
bios0 at mainbus0: AT/286+ BIOS, date 03/26/07, BIOS32 rev. 0 @  
0xffe90,

SMBIOS rev. 2.4 @ 0x3ffbc000 (62 entries)
bios0: Dell Inc. PowerEdge 1950
pcibios0 at bios0: rev 2.1 @ 0xf/0x1
pcibios0: PCI IRQ Routing Table rev 1.0 @ 0xfaa60/368 (21 entries)
pcibios0: PCI Interrupt Router at 000:31:0 (Intel 6321ESB LPC rev
0x00)
pcibios0: PCI bus #22 is the last bus
bios0: ROM list: 0xc/0x9000! 0xc9000/0x1000 0xca000/0x1800
0xcb800/0x5200 0xec000/0x4000!
acpi at mainbus0 not configured
ipmi0 at mainbus0: version 2.0 interface KCS iobase 0xca8/8 spacing 4
cpu0 at mainbus0
pci0 at mainbus0 bus 0: configuration mode 1 (no bios)
pchb0 at pci0 dev 0 function 0 Intel 5000X Host rev 0x12
ppb0 at pci0 dev 2 function 0 Intel 5000 PCIE rev 0x12
pci1 at ppb0 bus 6
ppb1 at pci1 dev 0 function 0 Intel 6321ESB PCIE rev 0x01
pci2 at ppb1 bus 7
ppb2 at pci2 dev 0 function 0 Intel 6321ESB PCIE rev 0x01
pci3 at ppb2 bus 8
ppb3 at pci3 dev 0 function 0 ServerWorks PCIE-PCIX rev 0xc3
pci4 at ppb3 bus 9
bnx0 at pci4 dev 0 function 0 Broadcom BCM5708 rev 0x12: irq 5
ppb4 at pci2 dev 1 function 0 Intel 6321ESB PCIE rev 0x01
pci5 at ppb4 bus 10
ppb5 at pci1 dev 0 function 3 Intel 6321ESB PCIE-PCIX rev 0x01
pci6 at ppb5 bus 11
ppb6 at pci0 dev 3 function 0 Intel 5000 PCIE rev 0x12
pci7 at ppb6 bus 1
ppb7 at pci7 dev 0 function 0 Intel IOP333 PCIE-PCIX rev 0x00
pci8 at ppb7 bus 2
mfi0 at pci8 dev 14 function 0 Dell PERC 5 rev 0x00: irq 6
mfi0: logical drives 1, version 5.1.1-0040, 256MB RAM
scsibus0 at mfi0: 1 targets
sd0 at scsibus0 targ 0 lun 0: DELL, PERC 5/i, 1.03 SCSI3 0/direct
fixed
sd0: 69376MB, 69376 cyl, 64 head, 32 sec, 512 bytes/sec, 142082048 sec
total
ppb8 at pci7 dev 0 function 2 Intel IOP333 PCIE-PCIX rev 0x00
pci9 at ppb8 bus 3
ppb9 at pci0 dev 4 function 0 Intel 5000 PCIE rev 0x12
pci10 at ppb9 bus 12
ppb10 at pci10 dev 0 function 0 vendor IDT, unknown product 0x8018  
rev

0x04
pci11 at ppb10 bus 13
ppb11 at pci11 dev 0 function 0 vendor IDT, unknown product 0x8018  
rev

0x04
pci12 at ppb11 bus 14
em0 at pci12 dev 0 function 0 Intel PRO/1000 QP (82571EB) rev 0x06:
irq 5, address 00:15:17:3e:c8:dc
em1 at pci12 dev 0 function 1 Intel PRO/1000 QP (82571EB) rev 0x06:
irq 11, address 00:15:17:3e:c8:dd
ppb12 at pci11 dev 1 function 0 vendor IDT, unknown product 0x8018  
rev

0x04
pci13 at ppb12 bus 15
em2 at pci13 dev 0 function 0 Intel PRO/1000 QP (82571EB) rev 0x06:
irq 11, address 00:15:17:3e:c8:de
em3 at pci13 dev 0 function 1 Intel PRO/1000 QP (82571EB) rev 0x06:
irq 6, address 00:15:17:3e:c8:df
ppb13 at pci0 dev 5 function 0 Intel 5000 PCIE rev 0x12
pci14 at ppb13 bus 16
ppb14 at pci0 dev 6 function 0 Intel 5000 PCIE rev 0x12
pci15 at ppb14 bus 17
ppb15 at pci15 dev 0 function 0 vendor IDT, unknown product 0x8018  
rev

0x04
pci16 at ppb15 bus 18
ppb16 at pci16 dev 0 function 0 vendor IDT, unknown product 0x8018  
rev

0x04
pci17 at ppb16 bus 19
em4 at pci17 dev 0 function 0 Intel PRO/1000 QP (82571EB) rev 0x06:
irq 5, address 00:15:17:3e:c6:0c
em5 at pci17 dev 0 function 1 Intel PRO/1000 QP (82571EB)