Hi,

I upgraded yesterday a SunFire V100 which serves as an ntpd server. It
was running 5.5 before and had 330 days of uptime (which seems to
eliminaite hw failure).

Running 5.9 the box crashed twice already each time after ca 4 hours
of uptime with the same panic. Detais below.

Any idea/patch/things to look for under ddb next time it happens ?

 panic: psycho0: uncorrectable DMA error AFAR 6e866250 (pa=0 tte=0/69270012) 
AFSR 410000ff40800000
Stopped at      Debugger+0x8:   nop
   TID    PID    UID     PRFLAGS     PFLAGS  CPU  COMMAND
*16805  16805     83    0x100010          0    0  ntpd
psycho_ue(400008a3200, 5a, 2, 40000a1e050, 4000fab5700, 127) at psycho_ue+0x7c
intr_handler(e0017ec8, 400008a3300, 5facef, 800, e364, ffff) at intr_handler+0x
c
sparc_interrupt(18960f8, 400090cf900, 1695200, 0, 0, 400091005a0) at sparc_inte
rrupt+0x298
pool_put(18960f8, 400090cf900, 400090cf920, 0, 70197ff, 0) at pool_put+0x1dc
m_free(400090cf900, 9, 400090cf900, 0, 0, 1) at m_free+0x9c
m_freem(400090cf900, 4000fab5b78, 4000fab5b90, 0, 0, 0) at m_freem+0xc
sendit(0, 9, 0, 0, 4000fab5df0, 14b) at sendit+0x2bc
sys_sendto(40009177690, 4000fab5db0, 4000fab5df0, cb70bfd1cc, 0, 14b) at sys_se
ndto+0x68
syscall(4000fab5ed0, 485, cb70c04918, cb70c0491c, 0, 0) at syscall+0x34c
softtrap(9, fffffffffffdfdbc, 30, 0, fffffffffffdfe20, 10) at softtrap+0x19c
http://www.openbsd.org/ddb.html describes the minimum info required in bug
reports.  Insufficient info makes it difficult to find and fix bugs.
ddb>
LOM event: +330d+14h1m38s host FAULT: watchdog triggered
trace
psycho_ue(400008a3200, 5a, 2, 40000a1e050, 4000fab5700, 127) at psycho_ue+0x7c
intr_handler(e0017ec8, 400008a3300, 5facef, 800, e364, ffff) at intr_handler+0x
c
sparc_interrupt(18960f8, 400090cf900, 1695200, 0, 0, 400091005a0) at sparc_inte
rrupt+0x298
pool_put(18960f8, 400090cf900, 400090cf920, 0, 70197ff, 0) at pool_put+0x1dc
m_free(400090cf900, 9, 400090cf900, 0, 0, 1) at m_free+0x9c
m_freem(400090cf900, 4000fab5b78, 4000fab5b90, 0, 0, 0) at m_freem+0xc
sendit(0, 9, 0, 0, 4000fab5df0, 14b) at sendit+0x2bc
sys_sendto(40009177690, 4000fab5db0, 4000fab5df0, cb70bfd1cc, 0, 14b) at sys_se
ndto+0x68
syscall(4000fab5ed0, 485, cb70c04918, cb70c0491c, 0, 0) at syscall+0x34c
softtrap(9, fffffffffffdfdbc, 30, 0, fffffffffffdfe20, 10) at softtrap+0x19c
ddb> ps
   TID   PPID   PGRP    UID  S       FLAGS  WAIT          COMMAND
 18314   4143  18314      0  3        0x83  poll          systat
  4143  30172   4143      0  3    0x10008b  pause         ksh
 30172   2827  30172      0  3        0x92  select        sshd
 10502      1  10502      0  3    0x100083  ttyin         getty
 16663      1  16663      0  3    0x100098  poll          cron
 10518      1  10518    110  3    0x100090  poll          sndiod
 27012      1  27012     99  3    0x100090  poll          sndiod
  7526  31292  31292     95  3    0x100090  kqread        smtpd
 15286  31292  31292     95  3    0x100090  kqread        smtpd
  7378  31292  31292     95  3    0x100090  kqread        smtpd
   813  31292  31292     95  3    0x100090  kqread        smtpd
  7395  31292  31292     95  3    0x100090  kqread        smtpd
 13554  31292  31292    103  3    0x100090  kqread        smtpd
 31292      1  31292      0  3    0x100080  kqread        smtpd
  2827      1   2827      0  3        0x80  select        sshd
 24960  16805  20173     83  3    0x100090  poll          ntpd
*16805  20173  20173     83  7    0x100010                ntpd
 20173      1  20173      0  3    0x100080  poll          ntpd
 32115  28778  28778     74  3    0x100090  bpf           pflogd
 28778      1  28778      0  3        0x80  netio         pflogd
 25537  31067  31067     73  3    0x100090  kqread        syslogd
 31067      1  31067      0  3    0x100080  netio         syslogd
 32003      0      0      0  3     0x14200  pgzero        zerothread
  4880      0      0      0  3     0x14200  aiodoned      aiodoned
  2947      0      0      0  3     0x14200  syncer        update
  2812      0      0      0  3     0x14200  cleaner       cleaner
 24150      0      0      0  3     0x14200  reaper        reaper
 15162      0      0      0  3     0x14200  pgdaemon      pagedaemon
 24138      0      0      0  3     0x14200  bored         crypto
 22463      0      0      0  3     0x14200  pftm          pfpurge
 20852      0      0      0  3     0x14200  usbtsk        usbtask
 21476      0      0      0  3     0x14200  usbatsk       usbatsk
 11816      0      0      0  3     0x14200  bored         sensors
 16023      0      0      0  3     0x14200  bored         softnet
 22770      0      0      0  3     0x14200  bored         systqmp
 20377      0      0      0  3     0x14200  bored         systq
 16037      0      0      0  3  0x40014200                idle0
 10456      0      0      0  3     0x14200  kmalloc       kmthread
     1      0      1      0  3        0x82  wait          init
     0     -1      0      0  3     0x10200  scheduler     swapper
ddb>

Copyright (c) 1982, 1986, 1989, 1991, 1993
        The Regents of the University of California.  All rights reserved.
Copyright (c) 1995-2016 OpenBSD. All rights reserved.  http://www.OpenBSD.org

OpenBSD 5.9 (GENERIC) #914: Fri Feb 26 04:04:42 MST 2016
    dera...@sparc64.openbsd.org:/usr/src/sys/arch/sparc64/compile/GENERIC
real mem = 536870912 (512MB)
avail mem = 511688704 (487MB)
mpath0 at root
scsibus0 at mpath0: 256 targets
mainbus0 at root: Sun Fire V100 (UltraSPARC-IIe 500MHz)
cpu0 at mainbus0: SUNW,UltraSPARC-IIe (rev 1.4) @ 500 MHz
cpu0: physical 16K instruction (32 b/l), 16K data (32 b/l), 256K external (64 
b/l)
psycho0 at mainbus0: SUNW,sabre, impl 0, version 0, ign 7c0
psycho0: bus range 0-0, PCI bus 0
psycho0: dvma map 60000000-7fffffff
pci0 at psycho0
ebus0 at pci0 dev 7 function 0 "Acer Labs M1533 ISA" rev 0x00
"dma" at ebus0 addr 0-ffff ivec 0x2a not configured
rtc0 at ebus0 addr 70-71: m5819
power0 at ebus0 addr 2000-2007 ivec 0x23
lom0 at ebus0 addr 8010-8011 ivec 0x2a: LOMlite2 rev 3.11
com0 at ebus0 addr 3f8-3ff ivec 0x2b: ns16550a, 16 byte fifo
com0: console
com1 at ebus0 addr 2e8-2ef ivec 0x2b: ns16550a, 16 byte fifo
"flashprom" at ebus0 addr 0-7ffff not configured
alipm0 at pci0 dev 3 function 0 "Acer Labs M7101 Power" rev 0x00: 74KHz clock
iic0 at alipm0
"max1617" at alipm0 addr 0x18 skipped due to alipm0 bugs
spdmem0 at iic0 addr 0x56: 256MB SDRAM registered ECC PC133CL2
spdmem1 at iic0 addr 0x57: 256MB SDRAM registered ECC PC133CL2
dc0 at pci0 dev 12 function 0 "Davicom DM9102" rev 0x31: ivec 0x7c6, address 
00:03:ba:14:db:a9
amphy0 at dc0 phy 1: DM9102 10/100 PHY, rev. 0
dc1 at pci0 dev 5 function 0 "Davicom DM9102" rev 0x31: ivec 0x7dc, address 
00:03:ba:14:db:aa
amphy1 at dc1 phy 1: DM9102 10/100 PHY, rev. 0
ohci0 at pci0 dev 10 function 0 "Acer Labs M5237 USB" rev 0x03: ivec 0x7e4, 
version 1.0, legacy support
pciide0 at pci0 dev 13 function 0 "Acer Labs M5229 UDMA IDE" rev 0xc3: DMA, 
channel 0 configured to native-PCI, channel 1 configured to native-PCI
pciide0: using ivec 0x7cc for native-PCI interrupt
pciide0: channel 0 disabled (no drives)
wd0 at pciide0 channel 1 drive 0: <ST340016A>
wd0: 16-sector PIO, LBA, 38166MB, 78165360 sectors
atapiscsi0 at pciide0 channel 1 drive 1
scsibus1 at atapiscsi0: 2 targets
cd0 at scsibus1 targ 0 lun 0: <TEAC, CD-224E, 1.7A> ATAPI 5/cdrom removable
wd0(pciide0:1:0): using PIO mode 4, Ultra-DMA mode 2
cd0(pciide0:1:1): using PIO mode 4, Ultra-DMA mode 2
usb0 at ohci0: USB revision 1.0
uhub0 at usb0 "Acer Labs OHCI root hub" rev 1.00/1.00 addr 1
vscsi0 at root
scsibus2 at vscsi0: 256 targets
softraid0 at root
scsibus3 at softraid0: 256 targets
bootpath: /pci@1f,0/ide@d,0/disk@2,0
root on wd0a (c78b8744c5af954f.a) swap on wd0b dump on wd0b

-- 
Matthieu Herrb

Attachment: signature.asc
Description: PGP signature

Reply via email to