Hi *, I've got a system running -current that keeps crashing on me every couple of days. Output of ddb:
Connected to /dev/cuaU0 (speed 115200) ddb{0}> show panic the kernel did not panic ddb{0}> show uvm Current UVM status: pagesize=4096 (0x1000), pagemask=0xfff, pageshift=12 482451 VM pages: 43158 active, 132795 inactive, 35 wired, 192336 free (24054 z ero) min 10% (25) anon, 10% (25) vnode, 5% (12) vtext freemin=16081, free-target=21441, inactive-target=0, wired-max=160817 faults=2487210, traps=2404140, intrs=211883, ctxswitch=1960560 fpuswitch=0 softint=3499069, syscalls=2015497, kmapent=9 fault counts: noram=0, noanon=0, noamap=0, pgwait=0, pgrele=0 ok relocks(total)=192470(193514), anget(retries)=603205(0), amapcopy=177151 neighbor anon/obj pg=82033/639788, gets(lock/unlock)=415897/193548 cases: anon=570367, anoncow=32838, obj=347149, prcopy=67670, przero=1469152 daemon and swap counts: woke=0, revs=0, scans=0, obscans=0, anscans=0 busy=0, freed=0, reactivate=0, deactivate=0 pageouts=0, pending=0, nswget=0 nswapdev=1 swpages=526020, swpginuse=0, swpgonly=0 paging=0 kernel pointers: objs(kern)=0xffffffff8238a038 ddb{0}> show trace No such command ddb{0}> trace icmp_mtudisc_timeout(fffffd807a50b070,0) at icmp_mtudisc_timeout+0x77 rt_timer_timer(ffffffff8235d668) at rt_timer_timer+0x1cc softclock_thread(ffff8000fffff260) at softclock_thread+0x13b end trace frame: 0x0, count: -3 ddb{0}> Output of a second crash: ddb{0}> show panic the kernel did not panic ddb{0}> trace icmp_mtudisc_timeout(fffffd8069f9f700,0) at icmp_mtudisc_timeout+0x77 rt_timer_timer(ffffffff8231bfc8) at rt_timer_timer+0x1cc softclock_thread(ffff8000fffff500) at softclock_thread+0x13b end trace frame: 0x0, count: -3 ddb{0}> show uvm Current UVM status: pagesize=4096 (0x1000), pagemask=0xfff, pageshift=12 482457 VM pages: 29240 active, 133535 inactive, 35 wired, 205028 free (25630 z ero) min 10% (25) anon, 10% (25) vnode, 5% (12) vtext freemin=16081, free-target=21441, inactive-target=0, wired-max=160819 faults=687274, traps=693441, intrs=75204, ctxswitch=381252 fpuswitch=0 softint=615411, syscalls=607703, kmapent=9 fault counts: noram=0, noanon=0, noamap=0, pgwait=0, pgrele=0 ok relocks(total)=185433(186477), anget(retries)=141598(0), amapcopy=75047 neighbor anon/obj pg=69895/201703, gets(lock/unlock)=256502/186509 cases: anon=114948, anoncow=26650, obj=237702, prcopy=17724, przero=290216 daemon and swap counts: woke=0, revs=0, scans=0, obscans=0, anscans=0 busy=0, freed=0, reactivate=0, deactivate=0 pageouts=0, pending=0, nswget=0 nswapdev=1 swpages=526020, swpginuse=0, swpgonly=0 paging=0 kernel pointers: objs(kern)=0xffffffff82317458 ddb{0}> show bcstats Current Buffer Cache status: numbufs 24114 busymapped 0, delwri 5 kvaslots 6030 avail kva slots 6030 bufpages 96426, dmapages 96426, dirtypages 20 pendingreads 0, pendingwrites 0 highflips 0, highflops 0, dmaflips 0 ddb{0}> mount No such command ddb{0}> trace icmp_mtudisc_timeout(fffffd8069f9f700,0) at icmp_mtudisc_timeout+0x77 rt_timer_timer(ffffffff8231bfc8) at rt_timer_timer+0x1cc softclock_thread(ffff8000fffff500) at softclock_thread+0x13b end trace frame: 0x0, count: -3 Especially the line stating "the kernel did not panic" surprises me, as I am greeted by the kernel debugger. Not sure how to interpret that. While looking for the reason behind these "crashes", I noticed that cron is constantly running at 99% cpu. As a first measure I commented out all cronjobs in place (except for daily weekly monthly as I figured these shouldnt pose a problem). But that did not remedy the problem. Right after startup cron starts eating away at the cpu. Does anybody have an idea how to further analyze the issue (apart from giving it a go by recompiling cron and using gdb) ? Best regards, Stephan