On 10/19/15 11:27 , Yuri Pankov wrote:
> On Mon, 19 Oct 2015 20:23:28 +0200, Richard PALO wrote:
>> Lately when doing a rather intensive build (clang3.7) I'm experiencing
>> hangs and crashes.
>> crashdump available (it's big!):
>>> Oct 19 19:50:48 omnis genunix: [ID 918906 kern.notice] I/O to pool
>>> 'rpool' appears to be hung.
>>> 488 Oct 19 19:50:48 omnis unix: [ID 100000 kern.notice]
>>> 489 Oct 19 19:50:48 omnis genunix: [ID 655072 kern.notice]
>>> ffffff003d0cba20 zfs:vdev_deadman+10b ()
>>> 490 Oct 19 19:50:48 omnis genunix: [ID 655072 kern.notice]
>>> ffffff003d0cba70 zfs:vdev_deadman+4a ()
>>> 491 Oct 19 19:50:48 omnis genunix: [ID 655072 kern.notice]
>>> ffffff003d0cbaa0 zfs:spa_deadman+ad ()
>>> 492 Oct 19 19:50:48 omnis genunix: [ID 655072 kern.notice]
>>> ffffff003d0cbb90 genunix:cyclic_softint+209 ()
>>> 493 Oct 19 19:50:48 omnis genunix: [ID 655072 kern.notice]
>>> ffffff003d0cbba0 unix:cbe_low_level+14 ()
>>> 494 Oct 19 19:50:48 omnis genunix: [ID 655072 kern.notice]
>>> ffffff003d0cbbf0 unix:av_dispatch_softvect+88 ()
>>> 495 Oct 19 19:50:48 omnis genunix: [ID 655072 kern.notice]
>>> ffffff003d0cbc20 unix:dispatch_softint+39 ()
>>> 496 Oct 19 19:50:48 omnis genunix: [ID 655072 kern.notice]
>>> ffffff003d005a20 unix:switch_sp_and_call+13 ()
>>> 497 Oct 19 19:50:48 omnis genunix: [ID 655072 kern.notice]
>>> ffffff003d005a60 unix:dosoftint+44 ()
>>> 498 Oct 19 19:50:48 omnis genunix: [ID 655072 kern.notice]
>>> ffffff003d005ac0 unix:do_interrupt+10d ()
>>> 499 Oct 19 19:50:48 omnis genunix: [ID 655072 kern.notice]
>>> ffffff003d005ad0 unix:cmnint+1e9 ()
>>> 500 Oct 19 19:50:48 omnis genunix: [ID 655072 kern.notice]
>>> ffffff003d005bc0 unix:mach_cpu_idle+6 ()
>>> 501 Oct 19 19:50:48 omnis genunix: [ID 655072 kern.notice]
>>> ffffff003d005bf0 unix:cpu_idle+11a ()
>>> 502 Oct 19 19:50:48 omnis genunix: [ID 655072 kern.notice]
>>> ffffff003d005c00 unix:cpu_idle_adaptive+13 ()
>>> 503 Oct 19 19:50:48 omnis genunix: [ID 655072 kern.notice]
>>> ffffff003d005c20 unix:idle+a7 ()
>>> 504 Oct 19 19:50:48 omnis genunix: [ID 655072 kern.notice]
>>> ffffff003d005c30 unix:thread_start+8 ()
>>> 505 Oct 19 19:50:48 omnis unix: [ID 100000 kern.notice]
>>> 506 Oct 19 19:50:48 omnis genunix: [ID 672855 kern.notice] syncing
>>> file systems...
>>
>> is this already known?
> 
> Yes, and it's actually a feature, not a bug -
> https://www.illumos.org/issues/3246

To phrase this more practically, this means that some I/O that was
issued was not returned. This could be because of a hardware or software
bug. Figuring out what issued I/Os it was and to what device would be a
useful next step to make forward progress here. I would start looking at
the outstanding zio's in mdb with, IIRC something like ::walk zio |
::zio -r.

Robert


-------------------------------------------
illumos-discuss
Archives: https://www.listbox.com/member/archive/182180/=now
RSS Feed: https://www.listbox.com/member/archive/rss/182180/21175430-2e6923be
Modify Your Subscription: 
https://www.listbox.com/member/?member_id=21175430&id_secret=21175430-6a77cda4
Powered by Listbox: http://www.listbox.com

Reply via email to