Re: random crashes on a firewall with OpenBSD 4.5-stable
ropers wrote: > 2009/6/26 Jussi Peltola : >> memtest86+: it can prove it's broken, but if it doesn't >> find problems it doesn't guarantee there are none. This is correct. > I've said this before, but I've yet to see faulty RAM whose problems > memtest86+ will not detect during a 24hr burn-in test. Sure, I've seen > faulty RAM that memtest86+ said was ok during a single-pass test, but > if you let memtest86+ run for 24 hours it'll probably find just about > any error. > > Of course, depending on your circumstances it may be more economical > to just chuck the suspect RAM instead of wasting 24 hours. And > granted, YMMV. But if anyone has ever seen any faulty RAM whose > problems a 24hr burn-in test with memtest86+ could not detect, I'd be > very interested in hearing that. How about this... Some years ago, Walmart had some Athlon-based "$200 PCs", they used SDRAM, 100MHz, IIRC. For giggles one day, I stuck some oddball SDRAM modules in one of them, and not too surprisingly, the thing failed to boot. In addition to being blatantly the wrong speed (66MHz), it was some really odd junk that didn't work in much of anything that used normal SDRAM of any speed. I didn't expect it to work, it confirmed my expectations; any OS that was loaded on the disk refused to boot very far before barfing all over itself with this RAM installed. Having enjoyed that part (yeah, I'm oddly amused), I figured running memtest86 would be an interesting test. Well, I'm somewhat disturbed to say that memtest86 had no problem "testing" all that junk^Woddball RAM, and told me it was all perfect. It may have been..but it certainly didn't work in that machine, and yet, it passed every diagnostic memtest86 threw at it for hours. I don't recall how long I left it cooking, but I know it was more than 24 hours, and I think it was for a few days before I needed that bit of shelf space and shut down the test. You can argue that memtest86 was correct that the memory itself was good, but since no other OS seemed to be able to make that RAM work in that machine, I don't think it is a very convincing argument -- that machine's memory subsystem was clearly broke, the diagnosis easy to confirm (swap RAM, system works, swap back, system won't boot). Yes, I think this qualifies as a memtest86 bug, as it missed something very basic, and maybe it's been fixed by now, but it still proves the point: passing diagnostics only means the diagnostics didn't find anything, it doesn't mean things are good. (years before that, I saw a great demonstration warning of this: one of the machines I sold and support had a very good internal diagnostic that included a looping RAM test, BUT if you installed the DIP (the old, traditional IC style RAM) with pin 1 not properly plugged into the socket, the system would pass the internal diagnostics very well as long as you wished to run them. You see, this machine's diagnostics tested RAM in 64k pages. Pin #1 on a 256kbit RAM chip happened to be used to pick which of the four(1) 64k pages were selected on the RAM chip, so the diagnostics happened to just test the same 64k bits of that chip four times, and said, "no problems found", even though the OS or apps would crash rather soon after booting. Fortunately, my then young eyes were good enough to spot the pin bent under the socket.) memtest86 is a very impressive memory diagnostic program, it does good things and does them well, but passing memtest86, as with any diagnostic, just means "no problem FOUND". Nick. (1) those thinking, "hey, one pin can't select more than TWO pages of RAM" need not try to correct me, I'm right, you don't understand how this stuff works. :) Hint: the chips had only 16 pins, including power, data, address, ground...and yet had an org of 256k X 1bit)
Re: random crashes on a firewall with OpenBSD 4.5-stable
On Sun, Aug 2, 2009 at 10:45 AM, ropers wrote: > Of course, depending on your circumstances it may be more economical > to just chuck the suspect RAM instead of wasting 24 hours. And > granted, YMMV. But if anyone has ever seen any faulty RAM whose > problems a 24hr burn-in test with memtest86+ could not detect, I'd be > very interested in hearing that. > > regards, > --ropers > > I have seen errors appear after more than 24 hours: 36 or 48 hours. Yes, it can happen. Cheers, -- Mattieu Baptiste "/earth is 102% full ... please delete anyone you can."
Re: random crashes on a firewall with OpenBSD 4.5-stable
2009/6/26 Jussi Peltola : > memtest86+: it can prove it's broken, but if it doesn't > find problems it doesn't guarantee there are none. I've said this before, but I've yet to see faulty RAM whose problems memtest86+ will not detect during a 24hr burn-in test. Sure, I've seen faulty RAM that memtest86+ said was ok during a single-pass test, but if you let memtest86+ run for 24 hours it'll probably find just about any error. Of course, depending on your circumstances it may be more economical to just chuck the suspect RAM instead of wasting 24 hours. And granted, YMMV. But if anyone has ever seen any faulty RAM whose problems a 24hr burn-in test with memtest86+ could not detect, I'd be very interested in hearing that. regards, --ropers
Re: random crashes on a firewall with OpenBSD 4.5-stable
On 2009-08-01, Comete wrote: > as suggested, i finally changed the RAID controler (Compaq Smart Array > 431) with the same device and the firewall still crashes with the same > error. I think the idea was probably to change it for a completely different type of device (or for just a plain SCSI controller). > So if it is not the hardware could it be the software ? yes, quite possibly the cac(4) driver. > How can i submit a bug report and where ? What informations do you need ? > what can i do or type when i get the error with the debug prompt ? as a starter: trace, ps.
Re: random crashes on a firewall with OpenBSD 4.5-stable
Hello, as suggested, i finally changed the RAID controler (Compaq Smart Array 431) with the same device and the firewall still crashes with the same error. So if it is not the hardware could it be the software ? How can i submit a bug report and where ? What informations do you need ? what can i do or type when i get the error with the debug prompt ? It seems that the firewall now crashes 3 times a week and it's very annoying. Thanks for the help :) Michal a icrit : Other servers?? I don't mean PDU, I mean PSU...the power supply in the server. If your shearing a power supply across 2 servers I would be shocked :) -Original Message- From: Comete [mailto:com...@daknet.org] Sent: 26 June 2009 13:48 To: Michal Subject: Re: random crashes on a firewall with OpenBSD 4.5-stable No problem with the PSU and voltage limits. The PSU isn't used at its full capacity and the other servers plugged on it work well. Could it be a bad network interface ? Michal a icrit : Just stabbing the dark here, test your Voltage Rails on your PSU. Check they are within limits. I find unexplained crash's can be traced back to PSU's quite often -Original Message- From: owner-m...@openbsd.org [mailto:owner-m...@openbsd.org] On Behalf Of Comhte Sent: 26 June 2009 12:22 To: Misc OpenBSD Cc: Daniel Gracia Garallar Subject: Re: random crashes on a firewall with OpenBSD 4.5-stable Well i have tested the RAM with memtest, no error. maybe another idea ? Thanks Daniel Gracia Garallar a C)crit : Oh and maybe bad RAM; I've hit some nasty errors with these faulty DIMMs... :/ ComC(te escribiC3: Hi, we are using the last OpenBSD 4.5-stable release on an old Compaq Proliant ML350 as a firewall with spamd. But we encounter randomly some system crashes (once a week or two weeks). The system always displays the same message: uvm_fault (0xd080d9e00x0,0,1) -> e kernel: page fault trap, code=0 Stopped at cac_pci_l0_intr_pending+0xb push 0x34 (%eax) What do you think it could be ? I thought about maybe a hardware problem but where exactly... I join my dmesg below Thanks for your advice ! OpenBSD 4.5-stable (GENERIC) #9: Sun May 17 22:59:17 CEST 2009 r...@arwen.saintlo.fr:/usr/src/sys/arch/i386/compile/GENERIC cpu0: Intel(R) Pentium(R) III CPU family 1266MHz ("GenuineIntel" 686-class) 1.27 GHz cpu0: FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX, FXSR,SSE real mem = 267988992 (255MB) avail mem = 250839040 (239MB) mainbus0 at root bios0 at mainbus0: AT/286+ BIOS, date 12/31/99, BIOS32 rev. 0 @ 0xf, SMBIOS rev. 2.3 @ 0xec000 (31 entries) bios0: vendor Compaq version "D11" date 01/29/2002 bios0: Compaq ProLiant ML350 G2 acpi0 at bios0: rev 0 acpi0: tables DSDT FACP APIC SPCR acpi0: wakeup devices PBTN(S5) acpitimer0 at acpi0: 3579545 Hz, 32 bits acpimadt0 at acpi0 addr 0xfee0: PC-AT compat cpu0 at mainbus0: apid 3 (boot processor) cpu0: apic clock running at 132MHz ioapic0 at mainbus0: apid 8 pa 0xfec0, version 11, 16 pins ioapic0: misconfigured as apic 0, remapped to apid 8 ioapic1 at mainbus0: apid 2 pa 0xfec01000, version 11, 16 pins ioapic1: misconfigured as apic 0, remapped to apid 2 acpiprt0 at acpi0: bus 0 (PCI0) acpiprt1 at acpi0: bus 2 (PCI1) acpicpu0 at acpi0 acpitz0 at acpi0: critical temperature 31 degC acpibtn0 at acpi0: PBTN bios0: ROM list: 0xc/0x8000 0xc8000/0x1800 0xc9800/0x1800 0xcb000/0x1800 0xcc800/0x4000! 0xd0800/0x1800 0xee000/0x2000! pci0 at mainbus0 bus 0: configuration mode 1 (bios) pchb0 at pci0 dev 0 function 0 "ServerWorks CNB20LE Host" rev 0x06 pchb1 at pci0 dev 0 function 1 "ServerWorks CNB20LE Host" rev 0x06 pci1 at pchb1 bus 2 em0 at pci1 dev 1 function 0 "Intel PRO/1000T (82544GC)" rev 0x02: apic 2 int 0 (irq 5), address 00:02:b3:b9:0d:a4 em1 at pci1 dev 2 function 0 "Intel PRO/1000T (82544GC)" rev 0x02: apic 2 int 2 (irq 15), address 00:02:b3:b9:0d:7d re0 at pci1 dev 3 function 0 "D-Link Systems DGE-528T" rev 0x10: RTL8169/8110SB (0x1000), apic 2 int 4 (irq 15), address 00:1c:f0:6f:38:7e rgephy0 at re0 phy 7: RTL8169S/8110S PHY, rev. 3 cac0 at pci1 dev 4 function 0 "DEC Compaq SMART RAID 42xx" rev 0x01: apic 2 int 6 (irq 11), Smart Array 431 scsibus0 at cac0: 1 targets sd0 at scsibus0 targ 0 lun 0: SCSI2 0/direct fixed sd0: 34727MB, 512 bytes/sec, 71122560 sec total re1 at pci1 dev 5 function 0 "D-Link Systems DGE-528T" rev 0x10: RTL8169/8110SB (0x1000), apic 2 int 8 (irq 15), address 00:1c:f0:62:eb:12 rgephy1 at re1 phy 7: RTL8169S/8110S PHY, rev. 3 fxp0 at pci0 dev 1 function 0 "Intel 8255x" rev 0x08, i82559: apic 2 int 10 (irq 5), address 00:02:a5:44:33:f7 inphy0 at fxp0 phy 1: i82555 10/100 PHY, rev. 4 ahc0 at pci0 dev 2 function 0 "Adaptec AHA-3960D U160" rev 0x01: apic 2 int 11 (irq 11) scsibus1 at ahc0: 16 targets, initiator 7 ahc1 at pci0 dev
Re: random crashes on a firewall with OpenBSD 4.5-stable
Can't read that? Custom compiled kernel and cac error speaks by themselves; dirty solution, try other disk controller. Best solution, discard you don't have bad hardware and, if everything is ok, make contact with developers and help searching for a code patch to improve the RAID adapter driver. Regards! Dani ComC(te escribiC3: Hi, we are using the last OpenBSD 4.5-stable release on an old Compaq Proliant ML350 as a firewall with spamd. But we encounter randomly some system crashes (once a week or two weeks). The system always displays the same message: uvm_fault (0xd080d9e00x0,0,1) -> e kernel: page fault trap, code=0 Stopped at cac_pci_l0_intr_pending+0xb push 0x34 (%eax) What do you think it could be ? I thought about maybe a hardware problem but where exactly... I join my dmesg below Thanks for your advice ! OpenBSD 4.5-stable (GENERIC) #9: Sun May 17 22:59:17 CEST 2009 r...@arwen.saintlo.fr:/usr/src/sys/arch/i386/compile/GENERIC cpu0: Intel(R) Pentium(R) III CPU family 1266MHz ("GenuineIntel" 686-class) 1.27 GHz cpu0: FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,FXSR,SSE real mem = 267988992 (255MB) avail mem = 250839040 (239MB) mainbus0 at root bios0 at mainbus0: AT/286+ BIOS, date 12/31/99, BIOS32 rev. 0 @ 0xf, SMBIOS rev. 2.3 @ 0xec000 (31 entries) bios0: vendor Compaq version "D11" date 01/29/2002 bios0: Compaq ProLiant ML350 G2 acpi0 at bios0: rev 0 acpi0: tables DSDT FACP APIC SPCR acpi0: wakeup devices PBTN(S5) acpitimer0 at acpi0: 3579545 Hz, 32 bits acpimadt0 at acpi0 addr 0xfee0: PC-AT compat cpu0 at mainbus0: apid 3 (boot processor) cpu0: apic clock running at 132MHz ioapic0 at mainbus0: apid 8 pa 0xfec0, version 11, 16 pins ioapic0: misconfigured as apic 0, remapped to apid 8 ioapic1 at mainbus0: apid 2 pa 0xfec01000, version 11, 16 pins ioapic1: misconfigured as apic 0, remapped to apid 2 acpiprt0 at acpi0: bus 0 (PCI0) acpiprt1 at acpi0: bus 2 (PCI1) acpicpu0 at acpi0 acpitz0 at acpi0: critical temperature 31 degC acpibtn0 at acpi0: PBTN bios0: ROM list: 0xc/0x8000 0xc8000/0x1800 0xc9800/0x1800 0xcb000/0x1800 0xcc800/0x4000! 0xd0800/0x1800 0xee000/0x2000! pci0 at mainbus0 bus 0: configuration mode 1 (bios) pchb0 at pci0 dev 0 function 0 "ServerWorks CNB20LE Host" rev 0x06 pchb1 at pci0 dev 0 function 1 "ServerWorks CNB20LE Host" rev 0x06 pci1 at pchb1 bus 2 em0 at pci1 dev 1 function 0 "Intel PRO/1000T (82544GC)" rev 0x02: apic 2 int 0 (irq 5), address 00:02:b3:b9:0d:a4 em1 at pci1 dev 2 function 0 "Intel PRO/1000T (82544GC)" rev 0x02: apic 2 int 2 (irq 15), address 00:02:b3:b9:0d:7d re0 at pci1 dev 3 function 0 "D-Link Systems DGE-528T" rev 0x10: RTL8169/8110SB (0x1000), apic 2 int 4 (irq 15), address 00:1c:f0:6f:38:7e rgephy0 at re0 phy 7: RTL8169S/8110S PHY, rev. 3 cac0 at pci1 dev 4 function 0 "DEC Compaq SMART RAID 42xx" rev 0x01: apic 2 int 6 (irq 11), Smart Array 431 scsibus0 at cac0: 1 targets sd0 at scsibus0 targ 0 lun 0: SCSI2 0/direct fixed sd0: 34727MB, 512 bytes/sec, 71122560 sec total re1 at pci1 dev 5 function 0 "D-Link Systems DGE-528T" rev 0x10: RTL8169/8110SB (0x1000), apic 2 int 8 (irq 15), address 00:1c:f0:62:eb:12 rgephy1 at re1 phy 7: RTL8169S/8110S PHY, rev. 3 fxp0 at pci0 dev 1 function 0 "Intel 8255x" rev 0x08, i82559: apic 2 int 10 (irq 5), address 00:02:a5:44:33:f7 inphy0 at fxp0 phy 1: i82555 10/100 PHY, rev. 4 ahc0 at pci0 dev 2 function 0 "Adaptec AHA-3960D U160" rev 0x01: apic 2 int 11 (irq 11) scsibus1 at ahc0: 16 targets, initiator 7 ahc1 at pci0 dev 2 function 1 "Adaptec AHA-3960D U160" rev 0x01: apic 2 int 11 (irq 11) scsibus2 at ahc1: 16 targets, initiator 7 st0 at scsibus2 targ 6 lun 0: SCSI2 1/sequential removable fxp1 at pci0 dev 4 function 0 "Intel 8255x" rev 0x08, i82559: apic 2 int 13 (irq 10), address 00:08:02:45:29:64 inphy1 at fxp1 phy 1: i82555 10/100 PHY, rev. 4 vga1 at pci0 dev 5 function 0 "ATI Rage XL" rev 0x27 wsdisplay0 at vga1 mux 1: console (80x25, vt100 emulation) wsdisplay0: screen 1-5 added (80x25, vt100 emulation) "Compaq Netelligent ASMC" rev 0x00 at pci0 dev 6 function 0 not configured piixpm0 at pci0 dev 15 function 0 "ServerWorks CSB5" rev 0x92: polling iic0 at piixpm0 iic0: addr 0x28 00=a0 01=10 02=03 03=01 04=7f 05=04 06=03 07=00 08=00 09=00 0b=00 0c=03 0d=41 0e=02 0f=00 10=00 11=05 18=3a 19=10 20=ff 21=ff 28=00 29=00 2a=04 2b=00 2c=00 2d=00 2e=00 30=00 31=00 32=00 38=00 39=00 3a=00 3b=00 3c=00 3d=00 3e=00 40=08 41=08 42=80 48=03 49=03 4a=03 50=00 51=80 58=00 59=00 60=f0 61=f0 68=af 69=af 70=ff 71=00 78=ff 79=ff 80=2b 81=37 82=ff 88=f0 89=f0 8a=f0 90=3c 91=46 92=ff 98=37 99=41 9a=ff a0=22 a1=2d a2=80 a8=ff a9=ff b0=00 b1=00 b8=06 b9=00 words 00=a0a0 01=1010 02=0303 03=0101 04=7f7f 05=0404 06=0303 07= spdmem0 at iic0 addr 0x50: 256MB SDRAM registered ECC PC133CL2 pciide0 at pci0 dev 15 function 1 "ServerWorks CSB5 IDE" rev 0x92: DMA atapiscsi0 at pciide0 channel 0 drive 0 scsibus3 at atapiscsi0: 2 targets cd0 a
Re: random crashes on a firewall with OpenBSD 4.5-stable
On 2009-06-26, Michal wrote: > Well, you can check the Volt readings in the bios, most will give you a > reading, but I am sure there is some BSD software out there "sysctl hw.sensors" works for some systems. also see sensorsd(8).
Re: random crashes on a firewall with OpenBSD 4.5-stable
Overheating? On 26 jun 2009, at 17.50, Michal wrote: Well, you can check the Volt readings in the bios, most will give you a reading, but I am sure there is some BSD software out there, maybe someone in the list will know. On windows you can use Speedfan. Even if it's not this, it's worth knowing how to check this as a simple check on servers -Original Message- From: Comhte [mailto:com...@daknet.org] Sent: 26 June 2009 16:42 To: Michal Subject: Re: random crashes on a firewall with OpenBSD 4.5-stable Oh sorry :p How could i test the power supply unit ? Michal a icrit : Other servers?? I don't mean PDU, I mean PSU...the power supply in the server. If your shearing a power supply across 2 servers I would be shocked :) -Original Message- From: Comete [mailto:com...@daknet.org] Sent: 26 June 2009 13:48 To: Michal Subject: Re: random crashes on a firewall with OpenBSD 4.5-stable No problem with the PSU and voltage limits. The PSU isn't used at its full capacity and the other servers plugged on it work well. Could it be a bad network interface ? Michal a icrit : Just stabbing the dark here, test your Voltage Rails on your PSU. Check they are within limits. I find unexplained crash's can be traced back to PSU's quite often -Original Message- From: owner-m...@openbsd.org [mailto:owner-m...@openbsd.org] On Behalf Of Comhte Sent: 26 June 2009 12:22 To: Misc OpenBSD Cc: Daniel Gracia Garallar Subject: Re: random crashes on a firewall with OpenBSD 4.5-stable Well i have tested the RAM with memtest, no error. maybe another idea ? Thanks Daniel Gracia Garallar a C)crit : Oh and maybe bad RAM; I've hit some nasty errors with these faulty DIMMs... :/ ComC(te escribiC3: Hi, we are using the last OpenBSD 4.5-stable release on an old Compaq Proliant ML350 as a firewall with spamd. But we encounter randomly some system crashes (once a week or two weeks). The system always displays the same message: uvm_fault (0xd080d9e00x0,0,1) -> e kernel: page fault trap, code=0 Stopped at cac_pci_l0_intr_pending+0xb push 0x34 (%eax) What do you think it could be ? I thought about maybe a hardware problem but where exactly... I join my dmesg below Thanks for your advice ! OpenBSD 4.5-stable (GENERIC) #9: Sun May 17 22:59:17 CEST 2009 r...@arwen.saintlo.fr:/usr/src/sys/arch/i386/compile/GENERIC cpu0: Intel(R) Pentium(R) III CPU family 1266MHz ("GenuineIntel" 686-class) 1.27 GHz cpu0: FPU ,V86 ,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX, FXSR,SSE real mem = 267988992 (255MB) avail mem = 250839040 (239MB) mainbus0 at root bios0 at mainbus0: AT/286+ BIOS, date 12/31/99, BIOS32 rev. 0 @ 0xf, SMBIOS rev. 2.3 @ 0xec000 (31 entries) bios0: vendor Compaq version "D11" date 01/29/2002 bios0: Compaq ProLiant ML350 G2 acpi0 at bios0: rev 0 acpi0: tables DSDT FACP APIC SPCR acpi0: wakeup devices PBTN(S5) acpitimer0 at acpi0: 3579545 Hz, 32 bits acpimadt0 at acpi0 addr 0xfee0: PC-AT compat cpu0 at mainbus0: apid 3 (boot processor) cpu0: apic clock running at 132MHz ioapic0 at mainbus0: apid 8 pa 0xfec0, version 11, 16 pins ioapic0: misconfigured as apic 0, remapped to apid 8 ioapic1 at mainbus0: apid 2 pa 0xfec01000, version 11, 16 pins ioapic1: misconfigured as apic 0, remapped to apid 2 acpiprt0 at acpi0: bus 0 (PCI0) acpiprt1 at acpi0: bus 2 (PCI1) acpicpu0 at acpi0 acpitz0 at acpi0: critical temperature 31 degC acpibtn0 at acpi0: PBTN bios0: ROM list: 0xc/0x8000 0xc8000/0x1800 0xc9800/0x1800 0xcb000/0x1800 0xcc800/0x4000! 0xd0800/0x1800 0xee000/0x2000! pci0 at mainbus0 bus 0: configuration mode 1 (bios) pchb0 at pci0 dev 0 function 0 "ServerWorks CNB20LE Host" rev 0x06 pchb1 at pci0 dev 0 function 1 "ServerWorks CNB20LE Host" rev 0x06 pci1 at pchb1 bus 2 em0 at pci1 dev 1 function 0 "Intel PRO/1000T (82544GC)" rev 0x02: apic 2 int 0 (irq 5), address 00:02:b3:b9:0d:a4 em1 at pci1 dev 2 function 0 "Intel PRO/1000T (82544GC)" rev 0x02: apic 2 int 2 (irq 15), address 00:02:b3:b9:0d:7d re0 at pci1 dev 3 function 0 "D-Link Systems DGE-528T" rev 0x10: RTL8169/8110SB (0x1000), apic 2 int 4 (irq 15), address 00:1c:f0:6f:38:7e rgephy0 at re0 phy 7: RTL8169S/8110S PHY, rev. 3 cac0 at pci1 dev 4 function 0 "DEC Compaq SMART RAID 42xx" rev 0x01: apic 2 int 6 (irq 11), Smart Array 431 scsibus0 at cac0: 1 targets sd0 at scsibus0 targ 0 lun 0: SCSI2 0/ direct fixed sd0: 34727MB, 512 bytes/sec, 71122560 sec total re1 at pci1 dev 5 function 0 "D-Link Systems DGE-528T" rev 0x10: RTL8169/8110SB (0x1000), apic 2 int 8 (irq 15), address 00:1c:f0:62:eb:12 rgephy1 at re1 phy 7: RTL8169S/8110S PHY, rev. 3 fxp0 at pci0 dev 1 function 0 "Intel 8255x" rev 0x08, i82559: apic 2 int 10 (irq 5), address 00:02:a5:44:33:f7 inphy0 at fxp0 phy 1: i82555 10/100 PHY, rev. 4 ahc0 at pci0 dev 2 function 0 "A
Re: random crashes on a firewall with OpenBSD 4.5-stable
But even measuring the ripple with a scope won't guarantee it's OK. Swapping out all of the hardware is sometimes the only way to find out. Same goes for memtest86+: it can prove it's broken, but if it doesn't find problems it doesn't guarantee there are none. -- Jussi Peltola
Re: random crashes on a firewall with OpenBSD 4.5-stable
Well, you can check the Volt readings in the bios, most will give you a reading, but I am sure there is some BSD software out there, maybe someone in the list will know. On windows you can use Speedfan. Even if it's not this, it's worth knowing how to check this as a simple check on servers -Original Message- From: Comhte [mailto:com...@daknet.org] Sent: 26 June 2009 16:42 To: Michal Subject: Re: random crashes on a firewall with OpenBSD 4.5-stable Oh sorry :p How could i test the power supply unit ? Michal a icrit : > Other servers?? I don't mean PDU, I mean PSU...the power supply in the > server. If your shearing a power supply across 2 servers I would be shocked > :) > > -Original Message- > From: Comete [mailto:com...@daknet.org] > Sent: 26 June 2009 13:48 > To: Michal > Subject: Re: random crashes on a firewall with OpenBSD 4.5-stable > > No problem with the PSU and voltage limits. The PSU isn't used at its > full capacity and the other servers plugged on it work well. > > Could it be a bad network interface ? > > Michal a icrit : >> Just stabbing the dark here, test your Voltage Rails on your PSU. Check > they >> are within limits. I find unexplained crash's can be traced back to PSU's >> quite often >> >> -Original Message- >> From: owner-m...@openbsd.org [mailto:owner-m...@openbsd.org] On Behalf Of >> Comhte >> Sent: 26 June 2009 12:22 >> To: Misc OpenBSD >> Cc: Daniel Gracia Garallar >> Subject: Re: random crashes on a firewall with OpenBSD 4.5-stable >> >> Well i have tested the RAM with memtest, no error. >> >> maybe another idea ? >> >> Thanks >> >> Daniel Gracia Garallar a C)crit : >>> Oh and maybe bad RAM; I've hit some nasty errors with these faulty >>> DIMMs... :/ >>> >>> ComC(te escribiC3: >>>> Hi, >>>> >>>> we are using the last OpenBSD 4.5-stable release on an old Compaq >>>> Proliant ML350 as a firewall with spamd. But we encounter randomly >>>> some system crashes (once a week or two weeks). The system always >>>> displays the same message: >>>> >>>> uvm_fault (0xd080d9e00x0,0,1) -> e >>>> >>>> kernel: page fault trap, code=0 >>>> >>>> Stopped at cac_pci_l0_intr_pending+0xb >>>> push 0x34 (%eax) >>>> >>>> What do you think it could be ? I thought about maybe a hardware >>>> problem but where exactly... >>>> >>>> I join my dmesg below >>>> >>>> Thanks for your advice ! >>>> >>>> OpenBSD 4.5-stable (GENERIC) #9: Sun May 17 22:59:17 CEST 2009 >>>> r...@arwen.saintlo.fr:/usr/src/sys/arch/i386/compile/GENERIC >>>> cpu0: Intel(R) Pentium(R) III CPU family 1266MHz ("GenuineIntel" >>>> 686-class) 1.27 GHz >>>> cpu0: >>>> > FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX, >> FXSR,SSE >>>> real mem = 267988992 (255MB) >>>> avail mem = 250839040 (239MB) >>>> mainbus0 at root >>>> bios0 at mainbus0: AT/286+ BIOS, date 12/31/99, BIOS32 rev. 0 @ >>>> 0xf, SMBIOS rev. 2.3 @ 0xec000 (31 entries) >>>> bios0: vendor Compaq version "D11" date 01/29/2002 >>>> bios0: Compaq ProLiant ML350 G2 >>>> acpi0 at bios0: rev 0 >>>> acpi0: tables DSDT FACP APIC SPCR >>>> acpi0: wakeup devices PBTN(S5) >>>> acpitimer0 at acpi0: 3579545 Hz, 32 bits >>>> acpimadt0 at acpi0 addr 0xfee0: PC-AT compat >>>> cpu0 at mainbus0: apid 3 (boot processor) >>>> cpu0: apic clock running at 132MHz >>>> ioapic0 at mainbus0: apid 8 pa 0xfec0, version 11, 16 pins >>>> ioapic0: misconfigured as apic 0, remapped to apid 8 >>>> ioapic1 at mainbus0: apid 2 pa 0xfec01000, version 11, 16 pins >>>> ioapic1: misconfigured as apic 0, remapped to apid 2 >>>> acpiprt0 at acpi0: bus 0 (PCI0) >>>> acpiprt1 at acpi0: bus 2 (PCI1) >>>> acpicpu0 at acpi0 >>>> acpitz0 at acpi0: critical temperature 31 degC >>>> acpibtn0 at acpi0: PBTN >>>> bios0: ROM list: 0xc/0x8000 0xc8000/0x1800 0xc9800/0x1800 >>>> 0xcb000/0x1800 0xcc800/0x4000! 0xd0800/0x1800 0xee000/0x2000! >>>> pci0 at mainbus0 bus 0: configuration mode 1 (bios) >>>> pchb0 at pci0 dev 0 function 0 "ServerWorks CNB20LE Host" rev 0x06 >>>> pchb1 at pci0 dev 0 function 1 "ServerWorks CNB20LE Host&
Re: random crashes on a firewall with OpenBSD 4.5-stable
On Thu, Jun 25, 2009 at 05:23:40PM +0200, Com??te wrote: > Hi, > > we are using the last OpenBSD 4.5-stable release on an old Compaq > Proliant ML350 as a firewall with spamd. But we encounter randomly some > system crashes (once a week or two weeks). The system always displays > the same message: > > uvm_fault (0xd080d9e00x0,0,1) -> e > > kernel: page fault trap, code=0 > > Stopped at cac_pci_l0_intr_pending+0xb > push 0x34 (%eax) The function name gives it away. 99% cac(4) is causing your problem. Connect the drive to another scsi controller and if possible disable the compaq thing in the bios. > > What do you think it could be ? I thought about maybe a hardware problem > but where exactly... > > I join my dmesg below > > Thanks for your advice ! > > OpenBSD 4.5-stable (GENERIC) #9: Sun May 17 22:59:17 CEST 2009 > r...@arwen.saintlo.fr:/usr/src/sys/arch/i386/compile/GENERIC > cpu0: Intel(R) Pentium(R) III CPU family 1266MHz ("GenuineIntel" > 686-class) 1.27 GHz > cpu0: > FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,FXSR,SSE > real mem = 267988992 (255MB) > avail mem = 250839040 (239MB) > mainbus0 at root > bios0 at mainbus0: AT/286+ BIOS, date 12/31/99, BIOS32 rev. 0 @ 0xf, > SMBIOS rev. 2.3 @ 0xec000 (31 entries) > bios0: vendor Compaq version "D11" date 01/29/2002 > bios0: Compaq ProLiant ML350 G2 > acpi0 at bios0: rev 0 > acpi0: tables DSDT FACP APIC SPCR > acpi0: wakeup devices PBTN(S5) > acpitimer0 at acpi0: 3579545 Hz, 32 bits > acpimadt0 at acpi0 addr 0xfee0: PC-AT compat > cpu0 at mainbus0: apid 3 (boot processor) > cpu0: apic clock running at 132MHz > ioapic0 at mainbus0: apid 8 pa 0xfec0, version 11, 16 pins > ioapic0: misconfigured as apic 0, remapped to apid 8 > ioapic1 at mainbus0: apid 2 pa 0xfec01000, version 11, 16 pins > ioapic1: misconfigured as apic 0, remapped to apid 2 > acpiprt0 at acpi0: bus 0 (PCI0) > acpiprt1 at acpi0: bus 2 (PCI1) > acpicpu0 at acpi0 > acpitz0 at acpi0: critical temperature 31 degC > acpibtn0 at acpi0: PBTN > bios0: ROM list: 0xc/0x8000 0xc8000/0x1800 0xc9800/0x1800 > 0xcb000/0x1800 0xcc800/0x4000! 0xd0800/0x1800 0xee000/0x2000! > pci0 at mainbus0 bus 0: configuration mode 1 (bios) > pchb0 at pci0 dev 0 function 0 "ServerWorks CNB20LE Host" rev 0x06 > pchb1 at pci0 dev 0 function 1 "ServerWorks CNB20LE Host" rev 0x06 > pci1 at pchb1 bus 2 > em0 at pci1 dev 1 function 0 "Intel PRO/1000T (82544GC)" rev 0x02: apic > 2 int 0 (irq 5), address 00:02:b3:b9:0d:a4 > em1 at pci1 dev 2 function 0 "Intel PRO/1000T (82544GC)" rev 0x02: apic > 2 int 2 (irq 15), address 00:02:b3:b9:0d:7d > re0 at pci1 dev 3 function 0 "D-Link Systems DGE-528T" rev 0x10: > RTL8169/8110SB (0x1000), apic 2 int 4 (irq 15), address 00:1c:f0:6f:38:7e > rgephy0 at re0 phy 7: RTL8169S/8110S PHY, rev. 3 > cac0 at pci1 dev 4 function 0 "DEC Compaq SMART RAID 42xx" rev 0x01: > apic 2 int 6 (irq 11), Smart Array 431 > scsibus0 at cac0: 1 targets > sd0 at scsibus0 targ 0 lun 0: SCSI2 0/direct fixed > sd0: 34727MB, 512 bytes/sec, 71122560 sec total > re1 at pci1 dev 5 function 0 "D-Link Systems DGE-528T" rev 0x10: > RTL8169/8110SB (0x1000), apic 2 int 8 (irq 15), address 00:1c:f0:62:eb:12 > rgephy1 at re1 phy 7: RTL8169S/8110S PHY, rev. 3 > fxp0 at pci0 dev 1 function 0 "Intel 8255x" rev 0x08, i82559: apic 2 int > 10 (irq 5), address 00:02:a5:44:33:f7 > inphy0 at fxp0 phy 1: i82555 10/100 PHY, rev. 4 > ahc0 at pci0 dev 2 function 0 "Adaptec AHA-3960D U160" rev 0x01: apic 2 > int 11 (irq 11) > scsibus1 at ahc0: 16 targets, initiator 7 > ahc1 at pci0 dev 2 function 1 "Adaptec AHA-3960D U160" rev 0x01: apic 2 > int 11 (irq 11) > scsibus2 at ahc1: 16 targets, initiator 7 > st0 at scsibus2 targ 6 lun 0: SCSI2 > 1/sequential removable > fxp1 at pci0 dev 4 function 0 "Intel 8255x" rev 0x08, i82559: apic 2 int > 13 (irq 10), address 00:08:02:45:29:64 > inphy1 at fxp1 phy 1: i82555 10/100 PHY, rev. 4 > vga1 at pci0 dev 5 function 0 "ATI Rage XL" rev 0x27 > wsdisplay0 at vga1 mux 1: console (80x25, vt100 emulation) > wsdisplay0: screen 1-5 added (80x25, vt100 emulation) > "Compaq Netelligent ASMC" rev 0x00 at pci0 dev 6 function 0 not configured > piixpm0 at pci0 dev 15 function 0 "ServerWorks CSB5" rev 0x92: polling > iic0 at piixpm0 > iic0: addr 0x28 00=a0 01=10 02=03 03=01 04=7f 05=04 06=03 07=00 08=00 > 09=00 0b=00 0c=03 0d=41 0e=02 0f=00 10=00 11=05 18=3a 19=10 20=ff 21=ff > 28=00 29=00 2a=04 2b=00 2c=00 2d=00 2e=00 30=00 31=00 32=00 38=00 39=00 > 3a=00 3b=00 3c=00 3d=00 3e=00 40=08 41=08 42=80 48=03 49=03 4a=03 50=00 > 51=80 58=00 59=00 60=f0 61=f0 68=af 69=af 70=ff 71=00 78=ff 79=ff 80=2b > 81=37 82=ff 88=f0 89=f0 8a=f0 90=3c 91=46 92=ff 98=37 99=41 9a=ff a0=22 > a1=2d a2=80 a8=ff a9=ff b0=00 b1=00 b8=06 b9=00 words 00=a0a0 01=1010 > 02=0303 03=0101 04=7f7f 05=0404 06=0303 07= > spdmem0 at iic0 addr 0x50: 256MB SDRAM registered ECC PC133CL2 > pciide0 at pci0 dev 15 function 1 "ServerWorks CSB5 IDE" rev
Re: random crashes on a firewall with OpenBSD 4.5-stable
No problem with the PSU and voltage limits. The PSU isn't used at its full capacity and the other servers plugged on it work well. Could it be a bad network interface ? Michal a icrit : Just stabbing the dark here, test your Voltage Rails on your PSU. Check they are within limits. I find unexplained crash's can be traced back to PSU's quite often -Original Message- From: owner-m...@openbsd.org [mailto:owner-m...@openbsd.org] On Behalf Of Comhte Sent: 26 June 2009 12:22 To: Misc OpenBSD Cc: Daniel Gracia Garallar Subject: Re: random crashes on a firewall with OpenBSD 4.5-stable Well i have tested the RAM with memtest, no error. maybe another idea ? Thanks Daniel Gracia Garallar a C)crit : Oh and maybe bad RAM; I've hit some nasty errors with these faulty DIMMs... :/ ComC(te escribiC3: Hi, we are using the last OpenBSD 4.5-stable release on an old Compaq Proliant ML350 as a firewall with spamd. But we encounter randomly some system crashes (once a week or two weeks). The system always displays the same message: uvm_fault (0xd080d9e00x0,0,1) -> e kernel: page fault trap, code=0 Stopped at cac_pci_l0_intr_pending+0xb push 0x34 (%eax) What do you think it could be ? I thought about maybe a hardware problem but where exactly... I join my dmesg below Thanks for your advice ! OpenBSD 4.5-stable (GENERIC) #9: Sun May 17 22:59:17 CEST 2009 r...@arwen.saintlo.fr:/usr/src/sys/arch/i386/compile/GENERIC cpu0: Intel(R) Pentium(R) III CPU family 1266MHz ("GenuineIntel" 686-class) 1.27 GHz cpu0: FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX, FXSR,SSE real mem = 267988992 (255MB) avail mem = 250839040 (239MB) mainbus0 at root bios0 at mainbus0: AT/286+ BIOS, date 12/31/99, BIOS32 rev. 0 @ 0xf, SMBIOS rev. 2.3 @ 0xec000 (31 entries) bios0: vendor Compaq version "D11" date 01/29/2002 bios0: Compaq ProLiant ML350 G2 acpi0 at bios0: rev 0 acpi0: tables DSDT FACP APIC SPCR acpi0: wakeup devices PBTN(S5) acpitimer0 at acpi0: 3579545 Hz, 32 bits acpimadt0 at acpi0 addr 0xfee0: PC-AT compat cpu0 at mainbus0: apid 3 (boot processor) cpu0: apic clock running at 132MHz ioapic0 at mainbus0: apid 8 pa 0xfec0, version 11, 16 pins ioapic0: misconfigured as apic 0, remapped to apid 8 ioapic1 at mainbus0: apid 2 pa 0xfec01000, version 11, 16 pins ioapic1: misconfigured as apic 0, remapped to apid 2 acpiprt0 at acpi0: bus 0 (PCI0) acpiprt1 at acpi0: bus 2 (PCI1) acpicpu0 at acpi0 acpitz0 at acpi0: critical temperature 31 degC acpibtn0 at acpi0: PBTN bios0: ROM list: 0xc/0x8000 0xc8000/0x1800 0xc9800/0x1800 0xcb000/0x1800 0xcc800/0x4000! 0xd0800/0x1800 0xee000/0x2000! pci0 at mainbus0 bus 0: configuration mode 1 (bios) pchb0 at pci0 dev 0 function 0 "ServerWorks CNB20LE Host" rev 0x06 pchb1 at pci0 dev 0 function 1 "ServerWorks CNB20LE Host" rev 0x06 pci1 at pchb1 bus 2 em0 at pci1 dev 1 function 0 "Intel PRO/1000T (82544GC)" rev 0x02: apic 2 int 0 (irq 5), address 00:02:b3:b9:0d:a4 em1 at pci1 dev 2 function 0 "Intel PRO/1000T (82544GC)" rev 0x02: apic 2 int 2 (irq 15), address 00:02:b3:b9:0d:7d re0 at pci1 dev 3 function 0 "D-Link Systems DGE-528T" rev 0x10: RTL8169/8110SB (0x1000), apic 2 int 4 (irq 15), address 00:1c:f0:6f:38:7e rgephy0 at re0 phy 7: RTL8169S/8110S PHY, rev. 3 cac0 at pci1 dev 4 function 0 "DEC Compaq SMART RAID 42xx" rev 0x01: apic 2 int 6 (irq 11), Smart Array 431 scsibus0 at cac0: 1 targets sd0 at scsibus0 targ 0 lun 0: SCSI2 0/direct fixed sd0: 34727MB, 512 bytes/sec, 71122560 sec total re1 at pci1 dev 5 function 0 "D-Link Systems DGE-528T" rev 0x10: RTL8169/8110SB (0x1000), apic 2 int 8 (irq 15), address 00:1c:f0:62:eb:12 rgephy1 at re1 phy 7: RTL8169S/8110S PHY, rev. 3 fxp0 at pci0 dev 1 function 0 "Intel 8255x" rev 0x08, i82559: apic 2 int 10 (irq 5), address 00:02:a5:44:33:f7 inphy0 at fxp0 phy 1: i82555 10/100 PHY, rev. 4 ahc0 at pci0 dev 2 function 0 "Adaptec AHA-3960D U160" rev 0x01: apic 2 int 11 (irq 11) scsibus1 at ahc0: 16 targets, initiator 7 ahc1 at pci0 dev 2 function 1 "Adaptec AHA-3960D U160" rev 0x01: apic 2 int 11 (irq 11) scsibus2 at ahc1: 16 targets, initiator 7 st0 at scsibus2 targ 6 lun 0: SCSI2 1/sequential removable fxp1 at pci0 dev 4 function 0 "Intel 8255x" rev 0x08, i82559: apic 2 int 13 (irq 10), address 00:08:02:45:29:64 inphy1 at fxp1 phy 1: i82555 10/100 PHY, rev. 4 vga1 at pci0 dev 5 function 0 "ATI Rage XL" rev 0x27 wsdisplay0 at vga1 mux 1: console (80x25, vt100 emulation) wsdisplay0: screen 1-5 added (80x25, vt100 emulation) "Compaq Netelligent ASMC" rev 0x00 at pci0 dev 6 function 0 not configured piixpm0 at pci0 dev 15 function 0 "ServerWorks CSB5" rev 0x92: polling iic0 at piixpm0 iic0: addr 0x28 00=a0 01=10 02=03 03=01 04=7f 05=04 06=03 07=00 08=00 09=00 0b=00 0c=03 0d=41 0e=02 0f
Re: random crashes on a firewall with OpenBSD 4.5-stable
Well i have tested the RAM with memtest, no error. maybe another idea ? Thanks Daniel Gracia Garallar a C)crit : Oh and maybe bad RAM; I've hit some nasty errors with these faulty DIMMs... :/ ComC(te escribiC3: Hi, we are using the last OpenBSD 4.5-stable release on an old Compaq Proliant ML350 as a firewall with spamd. But we encounter randomly some system crashes (once a week or two weeks). The system always displays the same message: uvm_fault (0xd080d9e00x0,0,1) -> e kernel: page fault trap, code=0 Stopped at cac_pci_l0_intr_pending+0xb push 0x34 (%eax) What do you think it could be ? I thought about maybe a hardware problem but where exactly... I join my dmesg below Thanks for your advice ! OpenBSD 4.5-stable (GENERIC) #9: Sun May 17 22:59:17 CEST 2009 r...@arwen.saintlo.fr:/usr/src/sys/arch/i386/compile/GENERIC cpu0: Intel(R) Pentium(R) III CPU family 1266MHz ("GenuineIntel" 686-class) 1.27 GHz cpu0: FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,FXSR,SSE real mem = 267988992 (255MB) avail mem = 250839040 (239MB) mainbus0 at root bios0 at mainbus0: AT/286+ BIOS, date 12/31/99, BIOS32 rev. 0 @ 0xf, SMBIOS rev. 2.3 @ 0xec000 (31 entries) bios0: vendor Compaq version "D11" date 01/29/2002 bios0: Compaq ProLiant ML350 G2 acpi0 at bios0: rev 0 acpi0: tables DSDT FACP APIC SPCR acpi0: wakeup devices PBTN(S5) acpitimer0 at acpi0: 3579545 Hz, 32 bits acpimadt0 at acpi0 addr 0xfee0: PC-AT compat cpu0 at mainbus0: apid 3 (boot processor) cpu0: apic clock running at 132MHz ioapic0 at mainbus0: apid 8 pa 0xfec0, version 11, 16 pins ioapic0: misconfigured as apic 0, remapped to apid 8 ioapic1 at mainbus0: apid 2 pa 0xfec01000, version 11, 16 pins ioapic1: misconfigured as apic 0, remapped to apid 2 acpiprt0 at acpi0: bus 0 (PCI0) acpiprt1 at acpi0: bus 2 (PCI1) acpicpu0 at acpi0 acpitz0 at acpi0: critical temperature 31 degC acpibtn0 at acpi0: PBTN bios0: ROM list: 0xc/0x8000 0xc8000/0x1800 0xc9800/0x1800 0xcb000/0x1800 0xcc800/0x4000! 0xd0800/0x1800 0xee000/0x2000! pci0 at mainbus0 bus 0: configuration mode 1 (bios) pchb0 at pci0 dev 0 function 0 "ServerWorks CNB20LE Host" rev 0x06 pchb1 at pci0 dev 0 function 1 "ServerWorks CNB20LE Host" rev 0x06 pci1 at pchb1 bus 2 em0 at pci1 dev 1 function 0 "Intel PRO/1000T (82544GC)" rev 0x02: apic 2 int 0 (irq 5), address 00:02:b3:b9:0d:a4 em1 at pci1 dev 2 function 0 "Intel PRO/1000T (82544GC)" rev 0x02: apic 2 int 2 (irq 15), address 00:02:b3:b9:0d:7d re0 at pci1 dev 3 function 0 "D-Link Systems DGE-528T" rev 0x10: RTL8169/8110SB (0x1000), apic 2 int 4 (irq 15), address 00:1c:f0:6f:38:7e rgephy0 at re0 phy 7: RTL8169S/8110S PHY, rev. 3 cac0 at pci1 dev 4 function 0 "DEC Compaq SMART RAID 42xx" rev 0x01: apic 2 int 6 (irq 11), Smart Array 431 scsibus0 at cac0: 1 targets sd0 at scsibus0 targ 0 lun 0: SCSI2 0/direct fixed sd0: 34727MB, 512 bytes/sec, 71122560 sec total re1 at pci1 dev 5 function 0 "D-Link Systems DGE-528T" rev 0x10: RTL8169/8110SB (0x1000), apic 2 int 8 (irq 15), address 00:1c:f0:62:eb:12 rgephy1 at re1 phy 7: RTL8169S/8110S PHY, rev. 3 fxp0 at pci0 dev 1 function 0 "Intel 8255x" rev 0x08, i82559: apic 2 int 10 (irq 5), address 00:02:a5:44:33:f7 inphy0 at fxp0 phy 1: i82555 10/100 PHY, rev. 4 ahc0 at pci0 dev 2 function 0 "Adaptec AHA-3960D U160" rev 0x01: apic 2 int 11 (irq 11) scsibus1 at ahc0: 16 targets, initiator 7 ahc1 at pci0 dev 2 function 1 "Adaptec AHA-3960D U160" rev 0x01: apic 2 int 11 (irq 11) scsibus2 at ahc1: 16 targets, initiator 7 st0 at scsibus2 targ 6 lun 0: SCSI2 1/sequential removable fxp1 at pci0 dev 4 function 0 "Intel 8255x" rev 0x08, i82559: apic 2 int 13 (irq 10), address 00:08:02:45:29:64 inphy1 at fxp1 phy 1: i82555 10/100 PHY, rev. 4 vga1 at pci0 dev 5 function 0 "ATI Rage XL" rev 0x27 wsdisplay0 at vga1 mux 1: console (80x25, vt100 emulation) wsdisplay0: screen 1-5 added (80x25, vt100 emulation) "Compaq Netelligent ASMC" rev 0x00 at pci0 dev 6 function 0 not configured piixpm0 at pci0 dev 15 function 0 "ServerWorks CSB5" rev 0x92: polling iic0 at piixpm0 iic0: addr 0x28 00=a0 01=10 02=03 03=01 04=7f 05=04 06=03 07=00 08=00 09=00 0b=00 0c=03 0d=41 0e=02 0f=00 10=00 11=05 18=3a 19=10 20=ff 21=ff 28=00 29=00 2a=04 2b=00 2c=00 2d=00 2e=00 30=00 31=00 32=00 38=00 39=00 3a=00 3b=00 3c=00 3d=00 3e=00 40=08 41=08 42=80 48=03 49=03 4a=03 50=00 51=80 58=00 59=00 60=f0 61=f0 68=af 69=af 70=ff 71=00 78=ff 79=ff 80=2b 81=37 82=ff 88=f0 89=f0 8a=f0 90=3c 91=46 92=ff 98=37 99=41 9a=ff a0=22 a1=2d a2=80 a8=ff a9=ff b0=00 b1=00 b8=06 b9=00 words 00=a0a0 01=1010 02=0303 03=0101 04=7f7f 05=0404 06=0303 07= spdmem0 at iic0 addr 0x50: 256MB SDRAM registered ECC PC133CL2 pciide0 at pci0 dev 15 function 1 "ServerWorks CSB5 IDE" rev 0x92: DMA atapiscsi0 at pciide0 channel 0 drive 0 scsibus3 at atapiscsi0: 2 targets cd0 at scsibus3 targ 0 lun 0: ATAPI 5/cdrom removable cd0(pciide0:0:0): using PIO mode 4, DMA mode 2 ohci0 at pci0 dev
Re: random crashes on a firewall with OpenBSD 4.5-stable
Oh and maybe bad RAM; I've hit some nasty errors with these faulty DIMMs... :/ ComC(te escribiC3: Hi, we are using the last OpenBSD 4.5-stable release on an old Compaq Proliant ML350 as a firewall with spamd. But we encounter randomly some system crashes (once a week or two weeks). The system always displays the same message: uvm_fault (0xd080d9e00x0,0,1) -> e kernel: page fault trap, code=0 Stopped at cac_pci_l0_intr_pending+0xb push 0x34 (%eax) What do you think it could be ? I thought about maybe a hardware problem but where exactly... I join my dmesg below Thanks for your advice ! OpenBSD 4.5-stable (GENERIC) #9: Sun May 17 22:59:17 CEST 2009 r...@arwen.saintlo.fr:/usr/src/sys/arch/i386/compile/GENERIC cpu0: Intel(R) Pentium(R) III CPU family 1266MHz ("GenuineIntel" 686-class) 1.27 GHz cpu0: FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,FXSR,SSE real mem = 267988992 (255MB) avail mem = 250839040 (239MB) mainbus0 at root bios0 at mainbus0: AT/286+ BIOS, date 12/31/99, BIOS32 rev. 0 @ 0xf, SMBIOS rev. 2.3 @ 0xec000 (31 entries) bios0: vendor Compaq version "D11" date 01/29/2002 bios0: Compaq ProLiant ML350 G2 acpi0 at bios0: rev 0 acpi0: tables DSDT FACP APIC SPCR acpi0: wakeup devices PBTN(S5) acpitimer0 at acpi0: 3579545 Hz, 32 bits acpimadt0 at acpi0 addr 0xfee0: PC-AT compat cpu0 at mainbus0: apid 3 (boot processor) cpu0: apic clock running at 132MHz ioapic0 at mainbus0: apid 8 pa 0xfec0, version 11, 16 pins ioapic0: misconfigured as apic 0, remapped to apid 8 ioapic1 at mainbus0: apid 2 pa 0xfec01000, version 11, 16 pins ioapic1: misconfigured as apic 0, remapped to apid 2 acpiprt0 at acpi0: bus 0 (PCI0) acpiprt1 at acpi0: bus 2 (PCI1) acpicpu0 at acpi0 acpitz0 at acpi0: critical temperature 31 degC acpibtn0 at acpi0: PBTN bios0: ROM list: 0xc/0x8000 0xc8000/0x1800 0xc9800/0x1800 0xcb000/0x1800 0xcc800/0x4000! 0xd0800/0x1800 0xee000/0x2000! pci0 at mainbus0 bus 0: configuration mode 1 (bios) pchb0 at pci0 dev 0 function 0 "ServerWorks CNB20LE Host" rev 0x06 pchb1 at pci0 dev 0 function 1 "ServerWorks CNB20LE Host" rev 0x06 pci1 at pchb1 bus 2 em0 at pci1 dev 1 function 0 "Intel PRO/1000T (82544GC)" rev 0x02: apic 2 int 0 (irq 5), address 00:02:b3:b9:0d:a4 em1 at pci1 dev 2 function 0 "Intel PRO/1000T (82544GC)" rev 0x02: apic 2 int 2 (irq 15), address 00:02:b3:b9:0d:7d re0 at pci1 dev 3 function 0 "D-Link Systems DGE-528T" rev 0x10: RTL8169/8110SB (0x1000), apic 2 int 4 (irq 15), address 00:1c:f0:6f:38:7e rgephy0 at re0 phy 7: RTL8169S/8110S PHY, rev. 3 cac0 at pci1 dev 4 function 0 "DEC Compaq SMART RAID 42xx" rev 0x01: apic 2 int 6 (irq 11), Smart Array 431 scsibus0 at cac0: 1 targets sd0 at scsibus0 targ 0 lun 0: SCSI2 0/direct fixed sd0: 34727MB, 512 bytes/sec, 71122560 sec total re1 at pci1 dev 5 function 0 "D-Link Systems DGE-528T" rev 0x10: RTL8169/8110SB (0x1000), apic 2 int 8 (irq 15), address 00:1c:f0:62:eb:12 rgephy1 at re1 phy 7: RTL8169S/8110S PHY, rev. 3 fxp0 at pci0 dev 1 function 0 "Intel 8255x" rev 0x08, i82559: apic 2 int 10 (irq 5), address 00:02:a5:44:33:f7 inphy0 at fxp0 phy 1: i82555 10/100 PHY, rev. 4 ahc0 at pci0 dev 2 function 0 "Adaptec AHA-3960D U160" rev 0x01: apic 2 int 11 (irq 11) scsibus1 at ahc0: 16 targets, initiator 7 ahc1 at pci0 dev 2 function 1 "Adaptec AHA-3960D U160" rev 0x01: apic 2 int 11 (irq 11) scsibus2 at ahc1: 16 targets, initiator 7 st0 at scsibus2 targ 6 lun 0: SCSI2 1/sequential removable fxp1 at pci0 dev 4 function 0 "Intel 8255x" rev 0x08, i82559: apic 2 int 13 (irq 10), address 00:08:02:45:29:64 inphy1 at fxp1 phy 1: i82555 10/100 PHY, rev. 4 vga1 at pci0 dev 5 function 0 "ATI Rage XL" rev 0x27 wsdisplay0 at vga1 mux 1: console (80x25, vt100 emulation) wsdisplay0: screen 1-5 added (80x25, vt100 emulation) "Compaq Netelligent ASMC" rev 0x00 at pci0 dev 6 function 0 not configured piixpm0 at pci0 dev 15 function 0 "ServerWorks CSB5" rev 0x92: polling iic0 at piixpm0 iic0: addr 0x28 00=a0 01=10 02=03 03=01 04=7f 05=04 06=03 07=00 08=00 09=00 0b=00 0c=03 0d=41 0e=02 0f=00 10=00 11=05 18=3a 19=10 20=ff 21=ff 28=00 29=00 2a=04 2b=00 2c=00 2d=00 2e=00 30=00 31=00 32=00 38=00 39=00 3a=00 3b=00 3c=00 3d=00 3e=00 40=08 41=08 42=80 48=03 49=03 4a=03 50=00 51=80 58=00 59=00 60=f0 61=f0 68=af 69=af 70=ff 71=00 78=ff 79=ff 80=2b 81=37 82=ff 88=f0 89=f0 8a=f0 90=3c 91=46 92=ff 98=37 99=41 9a=ff a0=22 a1=2d a2=80 a8=ff a9=ff b0=00 b1=00 b8=06 b9=00 words 00=a0a0 01=1010 02=0303 03=0101 04=7f7f 05=0404 06=0303 07= spdmem0 at iic0 addr 0x50: 256MB SDRAM registered ECC PC133CL2 pciide0 at pci0 dev 15 function 1 "ServerWorks CSB5 IDE" rev 0x92: DMA atapiscsi0 at pciide0 channel 0 drive 0 scsibus3 at atapiscsi0: 2 targets cd0 at scsibus3 targ 0 lun 0: ATAPI 5/cdrom removable cd0(pciide0:0:0): using PIO mode 4, DMA mode 2 ohci0 at pci0 dev 15 function 2 "ServerWorks OSB4/CSB5 USB" rev 0x05: apic 8 int 10 (irq 10), version 1.0, legacy support pchb2 at pci0
random crashes on a firewall with OpenBSD 4.5-stable
Hi, we are using the last OpenBSD 4.5-stable release on an old Compaq Proliant ML350 as a firewall with spamd. But we encounter randomly some system crashes (once a week or two weeks). The system always displays the same message: uvm_fault (0xd080d9e00x0,0,1) -> e kernel: page fault trap, code=0 Stopped at cac_pci_l0_intr_pending+0xb push 0x34 (%eax) What do you think it could be ? I thought about maybe a hardware problem but where exactly... I join my dmesg below Thanks for your advice ! OpenBSD 4.5-stable (GENERIC) #9: Sun May 17 22:59:17 CEST 2009 r...@arwen.saintlo.fr:/usr/src/sys/arch/i386/compile/GENERIC cpu0: Intel(R) Pentium(R) III CPU family 1266MHz ("GenuineIntel" 686-class) 1.27 GHz cpu0: FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,FXSR,SSE real mem = 267988992 (255MB) avail mem = 250839040 (239MB) mainbus0 at root bios0 at mainbus0: AT/286+ BIOS, date 12/31/99, BIOS32 rev. 0 @ 0xf, SMBIOS rev. 2.3 @ 0xec000 (31 entries) bios0: vendor Compaq version "D11" date 01/29/2002 bios0: Compaq ProLiant ML350 G2 acpi0 at bios0: rev 0 acpi0: tables DSDT FACP APIC SPCR acpi0: wakeup devices PBTN(S5) acpitimer0 at acpi0: 3579545 Hz, 32 bits acpimadt0 at acpi0 addr 0xfee0: PC-AT compat cpu0 at mainbus0: apid 3 (boot processor) cpu0: apic clock running at 132MHz ioapic0 at mainbus0: apid 8 pa 0xfec0, version 11, 16 pins ioapic0: misconfigured as apic 0, remapped to apid 8 ioapic1 at mainbus0: apid 2 pa 0xfec01000, version 11, 16 pins ioapic1: misconfigured as apic 0, remapped to apid 2 acpiprt0 at acpi0: bus 0 (PCI0) acpiprt1 at acpi0: bus 2 (PCI1) acpicpu0 at acpi0 acpitz0 at acpi0: critical temperature 31 degC acpibtn0 at acpi0: PBTN bios0: ROM list: 0xc/0x8000 0xc8000/0x1800 0xc9800/0x1800 0xcb000/0x1800 0xcc800/0x4000! 0xd0800/0x1800 0xee000/0x2000! pci0 at mainbus0 bus 0: configuration mode 1 (bios) pchb0 at pci0 dev 0 function 0 "ServerWorks CNB20LE Host" rev 0x06 pchb1 at pci0 dev 0 function 1 "ServerWorks CNB20LE Host" rev 0x06 pci1 at pchb1 bus 2 em0 at pci1 dev 1 function 0 "Intel PRO/1000T (82544GC)" rev 0x02: apic 2 int 0 (irq 5), address 00:02:b3:b9:0d:a4 em1 at pci1 dev 2 function 0 "Intel PRO/1000T (82544GC)" rev 0x02: apic 2 int 2 (irq 15), address 00:02:b3:b9:0d:7d re0 at pci1 dev 3 function 0 "D-Link Systems DGE-528T" rev 0x10: RTL8169/8110SB (0x1000), apic 2 int 4 (irq 15), address 00:1c:f0:6f:38:7e rgephy0 at re0 phy 7: RTL8169S/8110S PHY, rev. 3 cac0 at pci1 dev 4 function 0 "DEC Compaq SMART RAID 42xx" rev 0x01: apic 2 int 6 (irq 11), Smart Array 431 scsibus0 at cac0: 1 targets sd0 at scsibus0 targ 0 lun 0: SCSI2 0/direct fixed sd0: 34727MB, 512 bytes/sec, 71122560 sec total re1 at pci1 dev 5 function 0 "D-Link Systems DGE-528T" rev 0x10: RTL8169/8110SB (0x1000), apic 2 int 8 (irq 15), address 00:1c:f0:62:eb:12 rgephy1 at re1 phy 7: RTL8169S/8110S PHY, rev. 3 fxp0 at pci0 dev 1 function 0 "Intel 8255x" rev 0x08, i82559: apic 2 int 10 (irq 5), address 00:02:a5:44:33:f7 inphy0 at fxp0 phy 1: i82555 10/100 PHY, rev. 4 ahc0 at pci0 dev 2 function 0 "Adaptec AHA-3960D U160" rev 0x01: apic 2 int 11 (irq 11) scsibus1 at ahc0: 16 targets, initiator 7 ahc1 at pci0 dev 2 function 1 "Adaptec AHA-3960D U160" rev 0x01: apic 2 int 11 (irq 11) scsibus2 at ahc1: 16 targets, initiator 7 st0 at scsibus2 targ 6 lun 0: SCSI2 1/sequential removable fxp1 at pci0 dev 4 function 0 "Intel 8255x" rev 0x08, i82559: apic 2 int 13 (irq 10), address 00:08:02:45:29:64 inphy1 at fxp1 phy 1: i82555 10/100 PHY, rev. 4 vga1 at pci0 dev 5 function 0 "ATI Rage XL" rev 0x27 wsdisplay0 at vga1 mux 1: console (80x25, vt100 emulation) wsdisplay0: screen 1-5 added (80x25, vt100 emulation) "Compaq Netelligent ASMC" rev 0x00 at pci0 dev 6 function 0 not configured piixpm0 at pci0 dev 15 function 0 "ServerWorks CSB5" rev 0x92: polling iic0 at piixpm0 iic0: addr 0x28 00=a0 01=10 02=03 03=01 04=7f 05=04 06=03 07=00 08=00 09=00 0b=00 0c=03 0d=41 0e=02 0f=00 10=00 11=05 18=3a 19=10 20=ff 21=ff 28=00 29=00 2a=04 2b=00 2c=00 2d=00 2e=00 30=00 31=00 32=00 38=00 39=00 3a=00 3b=00 3c=00 3d=00 3e=00 40=08 41=08 42=80 48=03 49=03 4a=03 50=00 51=80 58=00 59=00 60=f0 61=f0 68=af 69=af 70=ff 71=00 78=ff 79=ff 80=2b 81=37 82=ff 88=f0 89=f0 8a=f0 90=3c 91=46 92=ff 98=37 99=41 9a=ff a0=22 a1=2d a2=80 a8=ff a9=ff b0=00 b1=00 b8=06 b9=00 words 00=a0a0 01=1010 02=0303 03=0101 04=7f7f 05=0404 06=0303 07= spdmem0 at iic0 addr 0x50: 256MB SDRAM registered ECC PC133CL2 pciide0 at pci0 dev 15 function 1 "ServerWorks CSB5 IDE" rev 0x92: DMA atapiscsi0 at pciide0 channel 0 drive 0 scsibus3 at atapiscsi0: 2 targets cd0 at scsibus3 targ 0 lun 0: ATAPI 5/cdrom removable cd0(pciide0:0:0): using PIO mode 4, DMA mode 2 ohci0 at pci0 dev 15 function 2 "ServerWorks OSB4/CSB5 USB" rev 0x05: apic 8 int 10 (irq 10), version 1.0, legacy support pchb2 at pci0 dev 15 function 3 "ServerWorks CSB5 LPC" rev 0x00 usb0 at ohci0: USB revision 1.0 uhub0 at usb0 "ServerWo