Re: uvm_fault after fsck on OpenBSD 3.9

2008-05-08 Thread Kirk Ismay

You can probably test if I'm barking up the right tree or barking
mad by booting a 4.3 bsd.rd and see if you can fsck your root
partition.  Since you appear to have a serial console, I'd try to
do this by booting single user, mount -f / (to skip the fsck), start
the rest of the system, and copy over a 4.3 bsd.rd, then reboot off
it.  If the fsck works, reboot, and upgrade the machine, please.

Nick.
  
Turned out to be bad RAM. Fortunately the system had 2 512MB sticks, so 
we just pulled one and its running fine.


I'll be upgrading soon.

--

Sincerely, 
Kirk Ismay

System Administrator

--
Net Idea
201-625 Front Street Nelson, BC V1L 4B6
P:250-352-3512 | F:250-352-9780 | TF:1-888-352-3512

Check out our brand new website! www.netidea.com



Re: uvm_fault after fsck on OpenBSD 3.9

2008-05-06 Thread Nick Holland
Kirk Ismay wrote:
> I get the following on boot:
> 
> Automatic boot in progress: starting file system checks.
> /dev/rwd0a: 1707 files, 16951 used, 283059 free (35 frags, 35378 blocks, 
> 0.0% fragmentation)
> /dev/rwd0a: MARKING FILE SYSTEM CLEAN
> uvm_fault(0xd05e1aa0, 0x800, 0, 1) -> e
> kernel: page fault trap, code=0
> Stopped at  vget+0x1d:  movl0x3c(%esi),%edx
> 
> I suspect a hardware issue, but as it's on a rack on the other side of 
> the country, I can't easily tell.  What else might be causing this ?
> 
> Thanks.
> 
> dmesg, trace & ps below:
[snipped for size, but thanks!]

First reaction is "upgrade, that's too old".
Second reaction is "upgrade, as that may fix your problem".

FROM MEMORY (mine, not your computer's, and the only error checking
my memory has is this list!), I think there were some file system
bugs found and fixed that might possibly cause crashes like this
since the 3.9 days.  (I know there were things found and fixed, just
not sure if it was before or after 3.9, or if they look like the
problem you are seeing).

You can probably test if I'm barking up the right tree or barking
mad by booting a 4.3 bsd.rd and see if you can fsck your root
partition.  Since you appear to have a serial console, I'd try to
do this by booting single user, mount -f / (to skip the fsck), start
the rest of the system, and copy over a 4.3 bsd.rd, then reboot off
it.  If the fsck works, reboot, and upgrade the machine, please.

Nick.



uvm_fault after fsck on OpenBSD 3.9

2008-05-06 Thread Kirk Ismay

I get the following on boot:

Automatic boot in progress: starting file system checks.
/dev/rwd0a: 1707 files, 16951 used, 283059 free (35 frags, 35378 blocks, 
0.0% fragmentation)

/dev/rwd0a: MARKING FILE SYSTEM CLEAN
uvm_fault(0xd05e1aa0, 0x800, 0, 1) -> e
kernel: page fault trap, code=0
Stopped at  vget+0x1d:  movl0x3c(%esi),%edx

I suspect a hardware issue, but as it's on a rack on the other side of 
the country, I can't easily tell.  What else might be causing this ?


Thanks.

dmesg, trace & ps below:

[ using 493460 bytes of bsd ELF symbol table ]
Copyright (c) 1982, 1986, 1989, 1991, 1993
   The Regents of the University of California.  All rights reserved.
Copyright (c) 1995-2006 OpenBSD. All rights reserved.  
http://www.OpenBSD.org


OpenBSD 3.9 (GENERIC) #617: Thu Mar  2 02:26:48 MST 2006
   [EMAIL PROTECTED]:/usr/src/sys/arch/i386/compile/GENERIC
cpu0: Intel(R) Celeron(R) CPU 2.80GHz ("GenuineIntel" 686-class) 2.80 GHz
cpu0: 
FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,SBF,SSE3,MWAIT,TM2,CNXT-ID

real mem  = 1063755776 (1038824K)
avail mem = 963919872 (941328K)
using 4278 buffers containing 53288960 bytes (52040K) of memory
mainbus0 (root)
bios0 at mainbus0: AT/286+(12) BIOS, date 02/27/06, BIOS32 rev. 0 @ 0xfa000
apm0 at bios0: Power Management spec V1.2
apm0: AC on, battery charge unknown
apm0: flags 70102 dobusy 1 doidle 1
pcibios0 at bios0: rev 3.0 @ 0xf/0xcb24
pcibios0: PCI IRQ Routing Table rev 1.0 @ 0xfc9f0/288 (16 entries)
pcibios0: PCI Exclusive IRQs: 5 9 10 12
pcibios0: PCI Interrupt Router at 000:31:0 ("Intel 82801FB LPC" rev 0x00)
pcibios0: PCI bus #6 is the last bus
bios0: ROM list: 0xc/0x9400! 0xcc000/0x4000! 0xd/0x1800 
0xd2000/0x1800

cpu0 at mainbus0
pci0 at mainbus0 bus 0: configuration mode 1 (no bios)
pchb0 at pci0 dev 0 function 0 "Intel E7221 MCH Host" rev 0x05
ppb0 at pci0 dev 1 function 0 "Intel E7221 PCIE" rev 0x05
pci1 at ppb0 bus 1
ppb1 at pci1 dev 0 function 0 "Intel PCIE-PCIE" rev 0x09
pci2 at ppb1 bus 2
em0 at pci2 dev 1 function 0 "Intel PRO/1000MT (82546GB)" rev 0x03: irq 
9, address 00:04:23:cd:4f:12
em1 at pci2 dev 1 function 1 "Intel PRO/1000MT (82546GB)" rev 0x03: irq 
12, address 00:04:23:cd:4f:13

"Intel IOxAPIC" rev 0x09 at pci1 dev 0 function 1 not configured
vga1 at pci0 dev 2 function 0 "Intel E7221 Video" rev 0x05: aperture at 
0xd040, size 0x800

wsdisplay0 at vga1 mux 1: console (80x25, vt100 emulation)
wsdisplay0: screen 1-5 added (80x25, vt100 emulation)
ppb2 at pci0 dev 28 function 0 "Intel 82801FB PCIE" rev 0x03
pci3 at ppb2 bus 3
ppb3 at pci0 dev 28 function 2 "Intel 82801FB PCIE" rev 0x03
pci4 at ppb3 bus 4
bge0 at pci4 dev 0 function 0 "Broadcom BCM5721" rev 0x11, BCM5750 B1 
(0x4101): irq 5, address 00:30:48:86:dc:60

brgphy0 at bge0 phy 1: BCM5750 10/100/1000baseT PHY, rev. 0
ppb4 at pci0 dev 28 function 3 "Intel 82801FB PCIE" rev 0x03
pci5 at ppb4 bus 5
bge1 at pci5 dev 0 function 0 "Broadcom BCM5721" rev 0x11, BCM5750 B1 
(0x4101): irq 10, address 00:30:48:86:dc:61

brgphy1 at bge1 phy 1: BCM5750 10/100/1000baseT PHY, rev. 0
uhci0 at pci0 dev 29 function 0 "Intel 82801FB USB" rev 0x03: irq 10
usb0 at uhci0: USB revision 1.0
uhub0 at usb0
uhub0: Intel UHCI root hub, rev 1.00/1.00, addr 1
uhub0: 2 ports with 2 removable, self powered
uhci1 at pci0 dev 29 function 1 "Intel 82801FB USB" rev 0x03: irq 10
usb1 at uhci1: USB revision 1.0
uhub1 at usb1
uhub1: Intel UHCI root hub, rev 1.00/1.00, addr 1
uhub1: 2 ports with 2 removable, self powered
uhci2 at pci0 dev 29 function 2 "Intel 82801FB USB" rev 0x03: irq 5
usb2 at uhci2: USB revision 1.0
uhub2 at usb2
uhub2: Intel UHCI root hub, rev 1.00/1.00, addr 1
uhub2: 2 ports with 2 removable, self powered
uhci3 at pci0 dev 29 function 3 "Intel 82801FB USB" rev 0x[ using 493460 
bytes of bsd ELF symbol table ]

Copyright (c) 1982, 1986, 1989, 1991, 1993
   The Regents of the University of California.  All rights reserved.
Copyright (c) 1995-2006 OpenBSD. All rights reserved.  
http://www.OpenBSD.org


OpenBSD 3.9 (GENERIC) #617: Thu Mar  2 02:26:48 MST 2006
   [EMAIL PROTECTED]:/usr/src/sys/arch/i386/compile/GENERIC
cpu0: Intel(R) Celeron(R) CPU 2.80GHz ("GenuineIntel" 686-class) 2.80 GHz
cpu0: 
FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,SBF,SSE3,MWAIT,TM2,CNXT-ID

real mem  = 1063755776 (1038824K)
avail mem = 963919872 (941328K)
using 4278 buffers containing 53288960 bytes (52040K) of memory
mainbus0 (root)
bios0 at mainbus0: AT/286+(12) BIOS, date 02/27/06, BIOS32 rev. 0 @ 0xfa000
apm0 at bios0: Power Management spec V1.2
apm0: AC on, battery charge unknown
apm0: flags 70102 dobusy 1 doidle 1
pcibios0 at bios0: rev 3.0 @ 0xf/0xcb24
pcibios0: PCI IRQ Routing Table rev 1.0 @ 0xfc9f0/288 (16 entries)
pcibios0: PCI Exclusive IRQs: 5 9 10 12
pcibios0: PCI Interrupt Router at 000:31:0 ("Intel 82801FB LPC" rev 0x00)
pcibi