Hi, I upgraded yesterday a SunFire V100 which serves as an ntpd server. It was running 5.5 before and had 330 days of uptime (which seems to eliminaite hw failure).
Running 5.9 the box crashed twice already each time after ca 4 hours of uptime with the same panic. Detais below. Any idea/patch/things to look for under ddb next time it happens ? panic: psycho0: uncorrectable DMA error AFAR 6e866250 (pa=0 tte=0/69270012) AFSR 410000ff40800000 Stopped at Debugger+0x8: nop TID PID UID PRFLAGS PFLAGS CPU COMMAND *16805 16805 83 0x100010 0 0 ntpd psycho_ue(400008a3200, 5a, 2, 40000a1e050, 4000fab5700, 127) at psycho_ue+0x7c intr_handler(e0017ec8, 400008a3300, 5facef, 800, e364, ffff) at intr_handler+0x c sparc_interrupt(18960f8, 400090cf900, 1695200, 0, 0, 400091005a0) at sparc_inte rrupt+0x298 pool_put(18960f8, 400090cf900, 400090cf920, 0, 70197ff, 0) at pool_put+0x1dc m_free(400090cf900, 9, 400090cf900, 0, 0, 1) at m_free+0x9c m_freem(400090cf900, 4000fab5b78, 4000fab5b90, 0, 0, 0) at m_freem+0xc sendit(0, 9, 0, 0, 4000fab5df0, 14b) at sendit+0x2bc sys_sendto(40009177690, 4000fab5db0, 4000fab5df0, cb70bfd1cc, 0, 14b) at sys_se ndto+0x68 syscall(4000fab5ed0, 485, cb70c04918, cb70c0491c, 0, 0) at syscall+0x34c softtrap(9, fffffffffffdfdbc, 30, 0, fffffffffffdfe20, 10) at softtrap+0x19c http://www.openbsd.org/ddb.html describes the minimum info required in bug reports. Insufficient info makes it difficult to find and fix bugs. ddb> LOM event: +330d+14h1m38s host FAULT: watchdog triggered trace psycho_ue(400008a3200, 5a, 2, 40000a1e050, 4000fab5700, 127) at psycho_ue+0x7c intr_handler(e0017ec8, 400008a3300, 5facef, 800, e364, ffff) at intr_handler+0x c sparc_interrupt(18960f8, 400090cf900, 1695200, 0, 0, 400091005a0) at sparc_inte rrupt+0x298 pool_put(18960f8, 400090cf900, 400090cf920, 0, 70197ff, 0) at pool_put+0x1dc m_free(400090cf900, 9, 400090cf900, 0, 0, 1) at m_free+0x9c m_freem(400090cf900, 4000fab5b78, 4000fab5b90, 0, 0, 0) at m_freem+0xc sendit(0, 9, 0, 0, 4000fab5df0, 14b) at sendit+0x2bc sys_sendto(40009177690, 4000fab5db0, 4000fab5df0, cb70bfd1cc, 0, 14b) at sys_se ndto+0x68 syscall(4000fab5ed0, 485, cb70c04918, cb70c0491c, 0, 0) at syscall+0x34c softtrap(9, fffffffffffdfdbc, 30, 0, fffffffffffdfe20, 10) at softtrap+0x19c ddb> ps TID PPID PGRP UID S FLAGS WAIT COMMAND 18314 4143 18314 0 3 0x83 poll systat 4143 30172 4143 0 3 0x10008b pause ksh 30172 2827 30172 0 3 0x92 select sshd 10502 1 10502 0 3 0x100083 ttyin getty 16663 1 16663 0 3 0x100098 poll cron 10518 1 10518 110 3 0x100090 poll sndiod 27012 1 27012 99 3 0x100090 poll sndiod 7526 31292 31292 95 3 0x100090 kqread smtpd 15286 31292 31292 95 3 0x100090 kqread smtpd 7378 31292 31292 95 3 0x100090 kqread smtpd 813 31292 31292 95 3 0x100090 kqread smtpd 7395 31292 31292 95 3 0x100090 kqread smtpd 13554 31292 31292 103 3 0x100090 kqread smtpd 31292 1 31292 0 3 0x100080 kqread smtpd 2827 1 2827 0 3 0x80 select sshd 24960 16805 20173 83 3 0x100090 poll ntpd *16805 20173 20173 83 7 0x100010 ntpd 20173 1 20173 0 3 0x100080 poll ntpd 32115 28778 28778 74 3 0x100090 bpf pflogd 28778 1 28778 0 3 0x80 netio pflogd 25537 31067 31067 73 3 0x100090 kqread syslogd 31067 1 31067 0 3 0x100080 netio syslogd 32003 0 0 0 3 0x14200 pgzero zerothread 4880 0 0 0 3 0x14200 aiodoned aiodoned 2947 0 0 0 3 0x14200 syncer update 2812 0 0 0 3 0x14200 cleaner cleaner 24150 0 0 0 3 0x14200 reaper reaper 15162 0 0 0 3 0x14200 pgdaemon pagedaemon 24138 0 0 0 3 0x14200 bored crypto 22463 0 0 0 3 0x14200 pftm pfpurge 20852 0 0 0 3 0x14200 usbtsk usbtask 21476 0 0 0 3 0x14200 usbatsk usbatsk 11816 0 0 0 3 0x14200 bored sensors 16023 0 0 0 3 0x14200 bored softnet 22770 0 0 0 3 0x14200 bored systqmp 20377 0 0 0 3 0x14200 bored systq 16037 0 0 0 3 0x40014200 idle0 10456 0 0 0 3 0x14200 kmalloc kmthread 1 0 1 0 3 0x82 wait init 0 -1 0 0 3 0x10200 scheduler swapper ddb> Copyright (c) 1982, 1986, 1989, 1991, 1993 The Regents of the University of California. All rights reserved. Copyright (c) 1995-2016 OpenBSD. All rights reserved. http://www.OpenBSD.org OpenBSD 5.9 (GENERIC) #914: Fri Feb 26 04:04:42 MST 2016 dera...@sparc64.openbsd.org:/usr/src/sys/arch/sparc64/compile/GENERIC real mem = 536870912 (512MB) avail mem = 511688704 (487MB) mpath0 at root scsibus0 at mpath0: 256 targets mainbus0 at root: Sun Fire V100 (UltraSPARC-IIe 500MHz) cpu0 at mainbus0: SUNW,UltraSPARC-IIe (rev 1.4) @ 500 MHz cpu0: physical 16K instruction (32 b/l), 16K data (32 b/l), 256K external (64 b/l) psycho0 at mainbus0: SUNW,sabre, impl 0, version 0, ign 7c0 psycho0: bus range 0-0, PCI bus 0 psycho0: dvma map 60000000-7fffffff pci0 at psycho0 ebus0 at pci0 dev 7 function 0 "Acer Labs M1533 ISA" rev 0x00 "dma" at ebus0 addr 0-ffff ivec 0x2a not configured rtc0 at ebus0 addr 70-71: m5819 power0 at ebus0 addr 2000-2007 ivec 0x23 lom0 at ebus0 addr 8010-8011 ivec 0x2a: LOMlite2 rev 3.11 com0 at ebus0 addr 3f8-3ff ivec 0x2b: ns16550a, 16 byte fifo com0: console com1 at ebus0 addr 2e8-2ef ivec 0x2b: ns16550a, 16 byte fifo "flashprom" at ebus0 addr 0-7ffff not configured alipm0 at pci0 dev 3 function 0 "Acer Labs M7101 Power" rev 0x00: 74KHz clock iic0 at alipm0 "max1617" at alipm0 addr 0x18 skipped due to alipm0 bugs spdmem0 at iic0 addr 0x56: 256MB SDRAM registered ECC PC133CL2 spdmem1 at iic0 addr 0x57: 256MB SDRAM registered ECC PC133CL2 dc0 at pci0 dev 12 function 0 "Davicom DM9102" rev 0x31: ivec 0x7c6, address 00:03:ba:14:db:a9 amphy0 at dc0 phy 1: DM9102 10/100 PHY, rev. 0 dc1 at pci0 dev 5 function 0 "Davicom DM9102" rev 0x31: ivec 0x7dc, address 00:03:ba:14:db:aa amphy1 at dc1 phy 1: DM9102 10/100 PHY, rev. 0 ohci0 at pci0 dev 10 function 0 "Acer Labs M5237 USB" rev 0x03: ivec 0x7e4, version 1.0, legacy support pciide0 at pci0 dev 13 function 0 "Acer Labs M5229 UDMA IDE" rev 0xc3: DMA, channel 0 configured to native-PCI, channel 1 configured to native-PCI pciide0: using ivec 0x7cc for native-PCI interrupt pciide0: channel 0 disabled (no drives) wd0 at pciide0 channel 1 drive 0: <ST340016A> wd0: 16-sector PIO, LBA, 38166MB, 78165360 sectors atapiscsi0 at pciide0 channel 1 drive 1 scsibus1 at atapiscsi0: 2 targets cd0 at scsibus1 targ 0 lun 0: <TEAC, CD-224E, 1.7A> ATAPI 5/cdrom removable wd0(pciide0:1:0): using PIO mode 4, Ultra-DMA mode 2 cd0(pciide0:1:1): using PIO mode 4, Ultra-DMA mode 2 usb0 at ohci0: USB revision 1.0 uhub0 at usb0 "Acer Labs OHCI root hub" rev 1.00/1.00 addr 1 vscsi0 at root scsibus2 at vscsi0: 256 targets softraid0 at root scsibus3 at softraid0: 256 targets bootpath: /pci@1f,0/ide@d,0/disk@2,0 root on wd0a (c78b8744c5af954f.a) swap on wd0b dump on wd0b -- Matthieu Herrb
signature.asc
Description: PGP signature