Hello All, Synopsis: spurious crash in softnet Category: kernel sgi Environment:
System: OpenBSD 6.0 Details: OpenBSD 6.0-beta (GENERIC-IP22) #664: Sun Jul 10 00:31:39 MDT 2016 Architecture: SGI (MIPS64) Machine: Challenge S R5000 Description: Machine seems to hang at (seemingly) random intervals. This has occurred on several recent snapshots including 10-Jul. I have reproduced the issue on multiple systems, so it doesn't seem to be a hardware issue. It may not be relevant but the machines are running an MTU of 1454 on sq1. How-To-Repeat: Seems to repeat itself given enough time, but I've not been able to tie it to any specific sequence of events. I will say that the machine typically does not run longer than a day (though it has on occasion). Fix: Unknown sq1: receive FIFO overflow Trap cause = 4 Frame 0xffffffff91f439b0 Trap PC 0xffffffff888b2be0 RA 0xffffffff888b2dbc fault 0xd97d3b7057b9cf7b pool_put+0xa8 (1becdf323dc0c775,c0000000030a2800,c0000000030f87e0,ffffffff888d2358) ra 0xffffffff888d18f0 sp 0xffffffff91f43b08,0 m_extfree+0x110 (1becdf323dc0c775,c0000000030a2800,c0000000030f87e0,ffffffff888d2358) ra 0xffffffff888d1fa0 sp 0xffffffff91f43ba2 m_free+0x138 (1becdf323dc0c775,c0000000030a2800,c0000000030f87e0,ffffffff888d2358) ra 0xffffffff888d20b0 sp 0xffffffff91f43bc8, 8 m_freem+0x28 (1becdf323dc0c775,c0000000030a2800,c0000000030f87e0,ffffffff888d2358) ra 0xffffffff88961b88 sp 0xffffffff91f43bf8, 2 in_arpinput+0x88 (1becdf323dc0c775,c0000000030a2800,c0000000030f87e0,ffffffff888d2358) ra 0xffffffff8892169c sp 0xffffffff91f43c4 ether_input+0x334 (1becdf323dc0c775,c0000000030a2800,c0000000030f87e0,ffffffff888d2358) ra 0xffffffff8891df38 sp 0xffffffff91f432 User-level: pid 34898 stopped on non ddb fault Stopped at pool_put+0xa8: ld v0,8(v1) ddb> trace pool_put+0xa8 (1becdf323dc0c775,c0000000030a2800,c0000000030f87e0,ffffffff888d2 358) ra 0xffffffff888d18f0 sp 0xffffffff91f43b08, sz 160 m_extfree+0x110 (1becdf323dc0c775,c0000000030a2800,c0000000030f87e0,ffffffff888 d2358) ra 0xffffffff888d1fa0 sp 0xffffffff91f43ba8, sz 32 m_free+0x138 (1becdf323dc0c775,c0000000030a2800,c0000000030f87e0,ffffffff888d23 58) ra 0xffffffff888d20b0 sp 0xffffffff91f43bc8, sz 48 m_freem+0x28 (1becdf323dc0c775,c0000000030a2800,c0000000030f87e0,ffffffff888d23 58) ra 0xffffffff88961b88 sp 0xffffffff91f43bf8, sz 32 in_arpinput+0x88 (1becdf323dc0c775,c0000000030a2800,c0000000030f87e0,ffffffff88 8d2358) ra 0xffffffff8892169c sp 0xffffffff91f43c18, sz 144 ether_input+0x334 (1becdf323dc0c775,c0000000030a2800,c0000000030f87e0,ffffffff8 88d2358) ra 0xffffffff8891df38 sp 0xffffffff91f43ca8, sz 112 if_input_process+0xf8 (1becdf323dc0c775,c0000000030a2800,c0000000030f87e0,fffff fff888d2358) ra 0xffffffff888a3968 sp 0xffffffff91f43d18, sz 80 taskq_thread+0xd0 (1becdf323dc0c775,c0000000030a2800,c0000000030f87e0,ffffffff8 88d2358) ra 0xffffffff88a797fc sp 0xffffffff91f43d68, sz 80 proc_trampoline+0x1c (1becdf323dc0c775,c0000000030a2800,c0000000030f87e0,ffffff ff888d2358) ra 0x0 sp 0xffffffff91f43db8, sz 0 User-level: pid 34898 ddb> ps TID PPID PGRP UID S FLAGS WAIT COMMAND 74365 1 74365 0 3 0x100083 ttyin getty 2436 1 2436 0 3 0x100098 poll cron 35314 58087 58087 619 3 0x82 kqread bandb 40008 58087 58087 619 3 0x82 kqread ssld 55415 58087 58087 619 3 0x82 kqread resolver 58087 1 58087 619 3 0x90 kqread ircd 82599 1 82599 0 3 0x80 select sshd 65889 67599 99686 83 3 0x100090 poll ntpd 67599 99686 99686 83 3 0x100090 poll ntpd 99686 1 99686 0 3 0x80 poll ntpd 97827 46866 46866 74 3 0x100090 bpf pflogd 46866 1 46866 0 3 0x80 netio pflogd 94958 52247 52247 73 2 0x100090 syslogd 52247 1 52247 0 3 0x100080 netio syslogd 8668 0 0 0 3 0x14200 pgzero zerothread 60775 0 0 0 3 0x14200 aiodoned aiodoned 87475 0 0 0 3 0x14200 syncer update 81321 0 0 0 3 0x14200 cleaner cleaner 75445 0 0 0 3 0x14200 reaper reaper 67147 0 0 0 3 0x14200 pgdaemon pagedaemon 26226 0 0 0 3 0x14200 bored crynlk 85686 0 0 0 3 0x14200 bored crypto 71123 0 0 0 3 0x14200 pftm pfpurge *34898 0 0 0 7 0x14210 softnet 93986 0 0 0 3 0x14200 bored systqmp 52845 0 0 0 3 0x14200 bored systq 16345 0 0 0 3 0x40014200 idle0 69755 0 0 0 3 0x14200 kmalloc kmthread 1 0 1 0 3 0x82 wait init 0 -1 0 0 3 0x10200 scheduler swapper ddb> show panic the kernel did not panic ddb> show registers at 0xffffffff88b60000 sysent+0xec0 v0 0xd97d3b7057b9cf73 v1 0xd97d3b7057b9cf73 a0 0x1becdf323dc0c775 a1 0xc0000000030a2800 a2 0xc0000000030f87e0 a3 0xffffffff888d2358 m_extfree_pool a4 0xffffffff91f43be6 end+0x92e34b6 a5 0x14 a6 0x18 a7 0x8 t0 0x4 t1 0xffffffff88c0e2f0 kernel_pmap_store t2 0 t3 0xffffffff91f40000 end+0x92df8d0 s0 0xc0000000030f87e0 s1 0xc0000000030a2800 s2 0xffffffff88b88070 mclpools s3 0x1 s4 0xc0000000000de078 s5 0 s6 0xc0000000030a2818 s7 0xffffffff91f43c38 end+0x92e3508 t8 0x52f2c064 t9 0xffffffff88a95188 int2_splx k0 0xffffffff8894a114 rtable_match+0x84 k1 0xc000000002f40bc0 gp 0xffffffff88b64430 _gp sp 0xffffffff91f43b08 end+0x92e33d8 s8 0 ra 0xffffffff888b2dbc pool_put+0x284 sr 0x1000cfa3 lo 0x61861862 hi 0 bad 0xd97d3b7057b9cf7b cs 0x10 pc 0xffffffff888b2be0 pool_put+0xa8 pool_put+0xa8: ld v0,8(v1) ddb> continue panic: trap Stopped at Debugger+0x4: jr ra Debugger+0x8: nop TID PID UID PRFLAGS PFLAGS CPU COMMAND *34898 34898 0 0x14000 0x210 0 softnet Debugger+0x4 (73e2c57b1f779808,900000001fbd9880,900000001fbd9830,ffffffff91f438 30) ra 0xffffffff888b6040 sp 0xffffffff91f43868, sz 0 panic+0x100 (73e2c57b1f779808,ffffffff91f43af0,0,ffffffff88c0eb20) ra 0xffffff ff88a76aec sp 0xffffffff91f43868, sz 112 itsa+0xf4 (73e2c57b1f779808,ffffffff91f43af0,0,ffffffff88c0eb20) ra 0xffffffff 88a7a2fc sp 0xffffffff91f438d8, sz 176 k_general+0x114 (ffffffff91f439b0,ffffffff91f43af0,0,ffffffff888b2be0) ra 0x0 s p 0xffffffff91f43988, sz 0 (KERNEL TRAP) pool_put+0xa8 (ffffffff91f439b0,ffffffff91f43af0,0,ffffffff888b2be0) ra 0xffff ffff888d18f0 sp 0xffffffff91f43b08, sz 160 m_extfree+0x110 (ffffffff91f439b0,ffffffff91f43af0,0,ffffffff888b2be0) ra 0xff ffffff888d1fa0 sp 0xffffffff91f43ba8, sz 32 m_free+0x138 (ffffffff91f439b0,ffffffff91f43af0,0,ffffffff888b2be0) ra 0xfffff fff888d20b0 sp 0xffffffff91f43bc8, sz 48 m_freem+0x28 (ffffffff91f439b0,ffffffff91f43af0,0,ffffffff888b2be0) ra 0xfffff fff88961b88 sp 0xffffffff91f43bf8, sz 32 in_arpinput+0x88 (ffffffff91f439b0,ffffffff91f43af0,0,ffffffff888b2be0) ra 0xf fffffff8892169c sp 0xffffffff91f43c18, sz 144 ether_input+0x334 (ffffffff91f439b0,ffffffff91f43af0,0,ffffffff888b2be0) ra 0x ffffffff8891df38 sp 0xffffffff91f43ca8, sz 112 if_input_process+0xf8 (ffffffff91f439b0,ffffffff91f43af0,0,ffffffff888b2be0) r a 0xffffffff888a3968 sp 0xffffffff91f43d18, sz 80 taskq_thread+0xd0 (ffffffff91f439b0,ffffffff91f43af0,0,ffffffff888b2be0) ra 0x ffffffff88a797fc sp 0xffffffff91f43d68, sz 80 proc_trampoline+0x1c (ffffffff91f439b0,ffffffff91f43af0,0,ffffffff888b2be0) ra 0x0 sp 0xffffffff91f43db8, sz 0 User-level: pid 34898 http://www.openbsd.org/ddb.html describes the minimum info required in bug reports. Insufficient info makes it difficult to find and fix bugs. ddb> boot reboot panic: wd33c93_scsicmd: busy Stopped at Debugger+0x4: jr ra Debugger+0x8: nop Debugger+0x4 (73e2c57b1f779808,900000001fbd9880,900000001fbd9830,ffffffff91f42b e0) ra 0xffffffff888b6040 sp 0xffffffff91f42c18, sz 0 panic+0x100 (73e2c57b1f779808,35,c000000002b9c16b,c000000000008700) ra 0xfffff fff88806960 sp 0xffffffff91f42c18, sz 112 wd33c93_scsi_cmd+0x280 (73e2c57b1f779808,35,c000000002b9c16b,c000000000008700) r a 0xffffffff88aa9e18 sp 0xffffffff91f42c88, sz 64 scsi_xs_sync+0xb8 (73e2c57b1f779808,35,c000000002b9c16b,c000000000008700) ra 0 xffffffff88ab3454 sp 0xffffffff91f42cc8, sz 64 sd_flush+0x8c (73e2c57b1f779808,35,c000000002b9c16b,c000000000008700) ra 0xfff fffff88ab54d8 sp 0xffffffff91f42d08, sz 48 sdactivate+0x140 (73e2c57b1f779808,35,c000000002b9c16b,c000000000008700) ra 0x ffffffff888a54dc sp 0xffffffff91f42d38, sz 48 config_suspend+0x3c (73e2c57b1f779808,35,c000000002b9c16b,c000000000008700) ra 0xffffffff88aae184 sp 0xffffffff91f42d68, sz 48 scsi_activate_target+0x54 (73e2c57b1f779808,35,c000000002b9c16b,c00000000000870 0) ra 0xffffffff88aae20c sp 0xffffffff91f42d98, sz 64 scsi_activate_bus+0x44 (73e2c57b1f779808,35,c000000002b9c16b,c000000000008700) r a 0xffffffff888a54dc sp 0xffffffff91f42dd8, sz 64 config_suspend+0x3c (73e2c57b1f779808,35,c000000002b9c16b,c000000000008700) ra 0xffffffff888a5398 sp 0xffffffff91f42e18, sz 48 config_activate_children+0x78 (73e2c57b1f779808,35,c000000002b9c16b,c0000000000 08700) ra 0xffffffff888a5510 sp 0xffffffff91f42e48, sz 80 config_suspend+0x70 (73e2c57b1f779808,35,c000000002b9c16b,c000000000008700) ra ddb> boot reboot System restart. sc0,1,0: cmd=0x12 timeout after 2 sec. Resetting SCSI bus [ using 388944 bytes of bsd ELF symbol table ] Copyright (c) 1982, 1986, 1989, 1991, 1993 The Regents of the University of California. All rights reserved. Copyright (c) 1995-2016 OpenBSD. All rights reserved. http://www.OpenBSD.org OpenBSD 6.0-beta (GENERIC-IP22) #664: Sun Jul 10 00:31:39 MDT 2016 dera...@sgi.openbsd.org:/usr/src/sys/arch/sgi/compile/GENERIC-IP22 real mem = 167772160 (160MB) rsvd mem = 802816 (1MB) avail mem = 160169984 (152MB) mainbus0 at root: Challenge S cpu0 at mainbus0: MIPS R5000 CPU rev 1.0 150 MHz, R5000 based FPC rev 1.0 cpu0: cache L1-I 32KB D 32KB 2 way, L2 512KB direct int0 at mainbus0 addr 0x1fbd9880 imc0 at mainbus0: revision 3 gio0 at imc0 hpc0 at gio0 addr 0x1fb80000: SGI HPC3 (onboard) zs0 at hpc0 offset 0x00059830 irq 29: 85230 zstty0 at zs0 channel 1: console zstty1 at zs0 channel 0 sq0 at hpc0 offset 0x00054000 irq 3: Seeq 80c03, address 08:00:69:0a:34:09 wdsc0 at hpc0 offset 0x00044000 irq 1: WD33C93B, 20.0 MHz, burst DMA wdsc0: microcode revision 0x0d, fast SCSI scsibus0 at wdsc0: 8 targets, initiator 0 sd0 at scsibus0 targ 1 lun 0: <SEAGATE, ST39103LCSUN9.0G, 034A> SCSI2 0/direct fixed serial.SEAGATE_ST39103LCSUN9.0GLS4557570000101519ZQ sd0: 8637MB, 512 bytes/sector, 17689267 sectors pione at hpc0 offset 0x00059800 irq 5 not configured panel0 at hpc0 offset 0x00059850 irq 9: power button dsclock0 at hpc0 offset 0x00060000 hpc1 at gio0 addr 0x1fb00000: SGI HPC3 (IO+ mezzanine) hpc1: using EXP1's DMA channel sq1 at hpc1 offset 0x00054000 irq 0: Seeq 80c03, address 08:00:69:02:64:d1 clock0 at mainbus0: int 5 vscsi0 at root scsibus1 at vscsi0: 256 targets softraid0 at root scsibus2 at softraid0: 256 targets boot device: sd0 root on sd0a (ffbd62fcf39fc195.a) swap on sd0b dump on sd0b WARNING: / was not properly unmounted