Hello misc,

I run a server with two harddiscs running as a software RAID1 using ccd.
Yesterday I started to import a large database in PostgreSQL, and found allot 
of these errors in my logs:

error reading: Processor VRM
 error code: ae
 error code: ae
kcs_sendmsg: 18 22
bmc_io_wait fails : v=88 m=03 b=01 read_data
kcs_sendmsg: 10 27 b8
 error code: ae
 error code: ae
kcs_sendmsg: 18 22
bmc_io_wait fails : v=88 m=03 b=01 read_data
 error code: ae
 error code: ae
kcs_sendmsg: 18 22
bmc_io_wait fails : v=88 m=03 b=01 read_data
 error code: ae
 error code: ae
kcs_sendmsg: 18 22
bmc_io_wait fails : v=88 m=03 b=01 read_data


I'm guessing that one of the disks is broken, but how can I found out which 
one? And is the data still stored correctly, or does this mean the database 
will be corrupt?

Below you will (hopefully) find all relevant information.


Thanks,


Hans



[EMAIL PROTECTED]:~] cat /etc/fstab
/dev/wd0a / ffs rw 1 1
/dev/wd1a /altroot ffs xx 0 0
/dev/ccd0a /home ffs rw,nodev,nosuid 1 2
/dev/ccd0b /usr ffs rw,nodev 1 2
/dev/ccd0d /var ffs rw,nosuid 1 2


[EMAIL PROTECTED]:~] cat /etc/ccd.conf
#       $OpenBSD: ccd.conf,v 1.1 1996/08/24 20:52:22 deraadt Exp $
# Configuration file for concatenated disk devices
#
# ccd   ileave  flags   component devices
#ccd0   16      none    /dev/sd2e /dev/sd3e
ccd0    16      CCDF_MIRROR     /dev/wd0d       /dev/wd1d


[EMAIL PROTECTED]:~] cat /var/run/dmesg.boot
OpenBSD 3.9 (GENERIC) #617: Thu Mar  2 02:26:48 MST 2006
    [EMAIL PROTECTED]:/usr/src/sys/arch/i386/compile/GENERIC
cpu0: Intel(R) Pentium(R) III CPU family 1266MHz ("GenuineIntel" 686-class) 
1.27 GHz
cpu0: 
FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,FXSR,SSE
real mem  = 536166400 (523600K)
avail mem = 482222080 (470920K)
using 4278 buffers containing 26910720 bytes (26280K) of memory
mainbus0 (root)
bios0 at mainbus0: AT/286+(00) BIOS, date 10/18/01, BIOS32 rev. 0 @ 0xfda54
pcibios0 at bios0: rev 2.1 @ 0xf0000/0x10000
pcibios0: PCI IRQ Routing Table rev 1.0 @ 0xf3bb0/240 (13 entries)
pcibios0: PCI Interrupt Router at 000:15:0 ("ServerWorks CSB5" rev 0x00)
pcibios0: PCI bus #0 is the last bus
bios0: ROM list: 0xc0000/0x8000 0xc8000/0x8800 0xd0800/0x1800 0xd2000/0x1800
ipmi0 at mainbus0: version 1.5 interface KCS iobase 0xca2/2 spacing 1 irq 0
cpu0 at mainbus0
pci0 at mainbus0 bus 0: configuration mode 1 (no bios)
pchb0 at pci0 dev 0 function 0 "ServerWorks CNB20HE Host" rev 0x23
pci1 at pchb0 bus 1
pchb1 at pci0 dev 0 function 1 "ServerWorks CNB20HE Host" rev 0x01
pchb2 at pci0 dev 0 function 2 "ServerWorks CNB20HE Host" rev 0x01
pchb3 at pci0 dev 0 function 3 "ServerWorks CNB20HE Host" rev 0x01
pci2 at pchb3 bus 2
pciide0 at pci0 dev 2 function 0 "Promise PDC20267" rev 0x02: DMA, channel 0 
configured to native-PCI, channel 1 configured to native-PCI
pciide0: using irq 11 for native-PCI interrupt
wd0 at pciide0 channel 0 drive 0: <ST340016A>
wd0: 16-sector PIO, LBA, 38166MB, 78165360 sectors
wd0(pciide0:0:0): using PIO mode 4, Ultra-DMA mode 5
wd1 at pciide0 channel 1 drive 1: <ST340016A>
wd1: 16-sector PIO, LBA, 38166MB, 78165360 sectors
wd1(pciide0:1:1): using PIO mode 4, Ultra-DMA mode 5
fxp0 at pci0 dev 3 function 0 "Intel 8255x" rev 0x0d, i82550: irq 9, address 
00:03:47:bd:45:47
inphy0 at fxp0 phy 1: i82555 10/100 PHY, rev. 4
fxp1 at pci0 dev 4 function 0 "Intel 8255x" rev 0x0d, i82550: irq 5, address 
00:03:47:bd:45:48
inphy1 at fxp1 phy 1: i82555 10/100 PHY, rev. 4
vga1 at pci0 dev 12 function 0 "ATI Rage XL" rev 0x27
wsdisplay0 at vga1 mux 1: console (80x25, vt100 emulation)
wsdisplay0: screen 1-5 added (80x25, vt100 emulation)
piixpm0 at pci0 dev 15 function 0 "ServerWorks CSB5" rev 0x92
iic0 at piixpm0: disabled to avoid ipmi0 interactions
pciide1 at pci0 dev 15 function 1 "ServerWorks CSB5 IDE" rev 0x92: DMA
atapiscsi0 at pciide1 channel 0 drive 0
scsibus0 at atapiscsi0: 2 targets
cd0 at scsibus0 targ 0 lun 0: <SAMSUNG, CD-ROM SN-124, QM15> SCSI0 5/cdrom 
removable
cd0(pciide1:0:0): using PIO mode 4, DMA mode 2, Ultra-DMA mode 2
ohci0 at pci0 dev 15 function 2 "ServerWorks OSB4/CSB5 USB" rev 0x05: irq 10, 
version 1.0, legacy support
usb0 at ohci0: USB revision 1.0
uhub0 at usb0
uhub0: ServerWorks OHCI root hub, rev 1.00/1.00, addr 1
uhub0: 4 ports with 4 removable, self powered
pchb4 at pci0 dev 15 function 3 "ServerWorks CSB5 LPC" rev 0x00
isa0 at mainbus0
isadma0 at isa0
pckbc0 at isa0 port 0x60/5
pckbd0 at pckbc0 (kbd slot)
pckbc0: using irq 1 for kbd slot
wskbd0 at pckbd0: console keyboard, using wsdisplay0
pcppi0 at isa0 port 0x61
midi0 at pcppi0: <PC speaker>
spkr0 at pcppi0
npx0 at isa0 port 0xf0/16: using exception 16
pccom0 at isa0 port 0x3f8/8 irq 4: ns16550a, 16 byte fifo
pccom1 at isa0 port 0x2f8/8 irq 3: ns16550a, 16 byte fifo
fdc0 at isa0 port 0x3f0/6 irq 6 drq 2
fd0 at fdc0 drive 0: 1.44MB 80 cyl, 2 head, 18 sec
biomask fdc5 netmask ffe5 ttymask ffe7
pctr: 686-class user-level performance counters enabled
mtrr: Pentium Pro MTRR support
dkcsum: wd0 matches BIOS drive 0x80
dkcsum: wd1 matches BIOS drive 0x81
root on wd0a
rootdev=0x0 rrootdev=0x300 rawdev=0x302
WARNING: / was not properly unmounted
wd1d: DMA error reading fsbn 503872 of 503872-503887 (wd1 bn 2592322; cn 2571 
tn 11 sn 61), retrying
wd0d: DMA error reading fsbn 503952 of 503952-503967 (wd0 bn 2592402; cn 2571 
tn 13 sn 15), retrying
wd1: transfer error, downgrading to Ultra-DMA mode 4
wd1(pciide0:1:1): using PIO mode 4, Ultra-DMA mode 4
wd1d: DMA error reading fsbn 503872 of 503872-503887 (wd1 bn 2592322; cn 2571 
tn 11 sn 61), retrying
wd0: transfer error, downgrading to Ultra-DMA mode 4
wd0(pciide0:0:0): using PIO mode 4, Ultra-DMA mode 4
wd0d: DMA error reading fsbn 503952 of 503952-503967 (wd0 bn 2592402; cn 2571 
tn 13 sn 15), retrying
wd1: transfer error, downgrading to Ultra-DMA mode 3
wd1(pciide0:1:1): using PIO mode 4, Ultra-DMA mode 3
wd1d: DMA error reading fsbn 503872 of 503872-503887 (wd1 bn 2592322; cn 2571 
tn 11 sn 61), retrying
wd0: transfer error, downgrading to Ultra-DMA mode 3
wd0(pciide0:0:0): using PIO mode 4, Ultra-DMA mode 3
wd0d: DMA error reading fsbn 503952 of 503952-503967 (wd0 bn 2592402; cn 2571 
tn 13 sn 15), retrying
wd1: transfer error, downgrading to Ultra-DMA mode 2
wd1(pciide0:1:1): using PIO mode 4, Ultra-DMA mode 2
wd1d: DMA error reading fsbn 503872 of 503872-503887 (wd1 bn 2592322; cn 2571 
tn 11 sn 61), retrying
wd0: transfer error, downgrading to Ultra-DMA mode 2
wd0(pciide0:0:0): using PIO mode 4, Ultra-DMA mode 2
wd0d: DMA error reading fsbn 503952 of 503952-503967 (wd0 bn 2592402; cn 2571 
tn 13 sn 15), retrying
wd1: soft error (corrected)
wd1: transfer error, downgrading to Ultra-DMA mode 1
wd1(pciide0:1:1): using PIO mode 4, Ultra-DMA mode 1
wd1d: DMA error reading fsbn 503888 of 503888-503903 (wd1 bn 2592338; cn 2571 
tn 12 sn 14), retrying
wd0: soft error (corrected)
wd1: transfer error, downgrading to Ultra-DMA mode 0
wd1(pciide0:1:1): using PIO mode 4, Ultra-DMA mode 0
wd1d: DMA error reading fsbn 503888 of 503888-503903 (wd1 bn 2592338; cn 2571 
tn 12 sn 14), retrying
wd1: transfer error, downgrading to DMA mode 2
wd1(pciide0:1:1): using PIO mode 4, DMA mode 2
wd1d: DMA error reading fsbn 503888 of 503888-503903 (wd1 bn 2592338; cn 2571 
tn 12 sn 14), retrying
wd1: transfer error, downgrading to PIO mode 4
wd1(pciide0:1:1): using PIO mode 4
wd1d: DMA error reading fsbn 503888 of 503888-503903 (wd1 bn 2592338; cn 2571 
tn 12 sn 14), retrying
wd1: soft error (corrected)
wd0: transfer error, downgrading to Ultra-DMA mode 1
wd0(pciide0:0:0): using PIO mode 4, Ultra-DMA mode 1
wd0d: DMA error reading fsbn 504016 of 504016-504031 (wd0 bn 2592466; cn 2571 
tn 14 sn 16), retrying
wd0: transfer error, downgrading to Ultra-DMA mode 0
wd0(pciide0:0:0): using PIO mode 4, Ultra-DMA mode 0
wd0d: DMA error reading fsbn 504016 of 504016-504031 (wd0 bn 2592466; cn 2571 
tn 14 sn 16), retrying
wd0: transfer error, downgrading to DMA mode 2
wd0(pciide0:0:0): using PIO mode 4, DMA mode 2
wd0d: DMA error reading fsbn 504016 of 504016-504031 (wd0 bn 2592466; cn 2571 
tn 14 sn 16), retrying
wd0: transfer error, downgrading to PIO mode 4
wd0(pciide0:0:0): using PIO mode 4
wd0d: DMA error reading fsbn 504016 of 504016-504031 (wd0 bn 2592466; cn 2571 
tn 14 sn 16), retrying
wd0: soft error (corrected)


[EMAIL PROTECTED]:~] dmesg |tail -n 50
bmc_io_wait fails : v=88 m=03 b=01 read_data
 error code: ae
 error code: ae
kcs_sendmsg: 18 22
bmc_io_wait fails : v=88 m=03 b=01 read_data
 error code: ae
 error code: ae
kcs_sendmsg: 18 22
bmc_io_wait fails : v=88 m=03 b=01 read_data
 error code: ae
 error code: ae
kcs_sendmsg: 18 22
bmc_io_wait fails : v=88 m=03 b=01 read_data
 error code: ae
 error code: ae
kcs_sendmsg: 18 22
bmc_io_wait fails : v=88 m=03 b=01 read_data
kcs_sendmsg: 10 27 b8
 error code: ae
 error code: ae
kcs_sendmsg: 18 22
bmc_io_wait fails : v=88 m=03 b=01 read_data
 error code: ae
 error code: ae
kcs_sendmsg: 18 22
bmc_io_wait fails : v=88 m=03 b=01 read_data
 error code: ae
 error code: ae
kcs_sendmsg: 18 22
bmc_io_wait fails : v=88 m=03 b=01 read_data
 error code: ae
kcs_sendmsg: 10 2d b8
error reading: Processor VRM
 error code: ae
 error code: ae
kcs_sendmsg: 18 22
bmc_io_wait fails : v=88 m=03 b=01 read_data
kcs_sendmsg: 10 27 b8
 error code: ae
 error code: ae
kcs_sendmsg: 18 22
bmc_io_wait fails : v=88 m=03 b=01 read_data
 error code: ae
 error code: ae
kcs_sendmsg: 18 22
bmc_io_wait fails : v=88 m=03 b=01 read_data
 error code: ae
 error code: ae
kcs_sendmsg: 18 22
bmc_io_wait fails : v=88 m=03 b=01 read_data

Reply via email to