On 10/19/15 11:27 , Yuri Pankov wrote: > On Mon, 19 Oct 2015 20:23:28 +0200, Richard PALO wrote: >> Lately when doing a rather intensive build (clang3.7) I'm experiencing >> hangs and crashes. >> crashdump available (it's big!): >>> Oct 19 19:50:48 omnis genunix: [ID 918906 kern.notice] I/O to pool >>> 'rpool' appears to be hung. >>> 488 Oct 19 19:50:48 omnis unix: [ID 100000 kern.notice] >>> 489 Oct 19 19:50:48 omnis genunix: [ID 655072 kern.notice] >>> ffffff003d0cba20 zfs:vdev_deadman+10b () >>> 490 Oct 19 19:50:48 omnis genunix: [ID 655072 kern.notice] >>> ffffff003d0cba70 zfs:vdev_deadman+4a () >>> 491 Oct 19 19:50:48 omnis genunix: [ID 655072 kern.notice] >>> ffffff003d0cbaa0 zfs:spa_deadman+ad () >>> 492 Oct 19 19:50:48 omnis genunix: [ID 655072 kern.notice] >>> ffffff003d0cbb90 genunix:cyclic_softint+209 () >>> 493 Oct 19 19:50:48 omnis genunix: [ID 655072 kern.notice] >>> ffffff003d0cbba0 unix:cbe_low_level+14 () >>> 494 Oct 19 19:50:48 omnis genunix: [ID 655072 kern.notice] >>> ffffff003d0cbbf0 unix:av_dispatch_softvect+88 () >>> 495 Oct 19 19:50:48 omnis genunix: [ID 655072 kern.notice] >>> ffffff003d0cbc20 unix:dispatch_softint+39 () >>> 496 Oct 19 19:50:48 omnis genunix: [ID 655072 kern.notice] >>> ffffff003d005a20 unix:switch_sp_and_call+13 () >>> 497 Oct 19 19:50:48 omnis genunix: [ID 655072 kern.notice] >>> ffffff003d005a60 unix:dosoftint+44 () >>> 498 Oct 19 19:50:48 omnis genunix: [ID 655072 kern.notice] >>> ffffff003d005ac0 unix:do_interrupt+10d () >>> 499 Oct 19 19:50:48 omnis genunix: [ID 655072 kern.notice] >>> ffffff003d005ad0 unix:cmnint+1e9 () >>> 500 Oct 19 19:50:48 omnis genunix: [ID 655072 kern.notice] >>> ffffff003d005bc0 unix:mach_cpu_idle+6 () >>> 501 Oct 19 19:50:48 omnis genunix: [ID 655072 kern.notice] >>> ffffff003d005bf0 unix:cpu_idle+11a () >>> 502 Oct 19 19:50:48 omnis genunix: [ID 655072 kern.notice] >>> ffffff003d005c00 unix:cpu_idle_adaptive+13 () >>> 503 Oct 19 19:50:48 omnis genunix: [ID 655072 kern.notice] >>> ffffff003d005c20 unix:idle+a7 () >>> 504 Oct 19 19:50:48 omnis genunix: [ID 655072 kern.notice] >>> ffffff003d005c30 unix:thread_start+8 () >>> 505 Oct 19 19:50:48 omnis unix: [ID 100000 kern.notice] >>> 506 Oct 19 19:50:48 omnis genunix: [ID 672855 kern.notice] syncing >>> file systems... >> >> is this already known? > > Yes, and it's actually a feature, not a bug - > https://www.illumos.org/issues/3246
To phrase this more practically, this means that some I/O that was issued was not returned. This could be because of a hardware or software bug. Figuring out what issued I/Os it was and to what device would be a useful next step to make forward progress here. I would start looking at the outstanding zio's in mdb with, IIRC something like ::walk zio | ::zio -r. Robert ------------------------------------------- illumos-discuss Archives: https://www.listbox.com/member/archive/182180/=now RSS Feed: https://www.listbox.com/member/archive/rss/182180/21175430-2e6923be Modify Your Subscription: https://www.listbox.com/member/?member_id=21175430&id_secret=21175430-6a77cda4 Powered by Listbox: http://www.listbox.com
