Hi,
Seems that after the migration, things stop working on the guest. Cant
shutdown, reboot, cron fails, ssh fails.
Further info (quest's console during migration process and attempt to
migrate back
-----------
migrate from server 1 to server 2
root@c1-sb-vs1:~# xl console c1-sb-test1
c1-sb-test1 login:
Debian GNU/Linux 8 c1-sb-test1 hvc0
c1-sb-test1 login: [ 31.750901] Freezing user space processes ...
(elapsed 0.001 seconds) done.
[ 31.752377] Freezing remaining freezable tasks ... (elapsed 0.001
seconds) done.
[ 31.753716] PM: freeze of devices complete after 0.077 msecs
[ 31.753748] PM: late freeze of devices complete after 0.023 msecs
[ 31.753785] PM: noirq freeze of devices complete after 0.032 msecs
<... disconnect ...>
root@c1-sb-vs2:~# xl console c1-sb-test1
[ 31.757439] xen:grant_table: Grant tables using version 1 layout
[ 31.757439] PM: noirq restore of devices complete after 0.036 msecs
[ 31.757439] PM: early restore of devices complete after 0.022 msecs
[ 31.761161] PM: restore of devices complete after 6.782 msecs
[ 31.761190] Restarting tasks ... done.
[ 31.778758] Setting capacity to 10485760
Debian GNU/Linux 8 c1-sb-test1 hvc0
c1-sb-test1 login:
-----------
attempt to migrate back from server 2 to server 1
root@c1-sb-vs2:~# xl console c1-sb-test1
Debian GNU/Linux 8 c1-sb-test1 hvc0
c1-sb-test1 login: [ 92.404108] random: nonblocking pool is initialized
[ 93.959766] Freezing user space processes ...
[ 113.960323] Freezing of tasks failed after 20.000 seconds (1 tasks
refusing to freeze, wq_busy=0):
[ 113.960331] cron D ffff88001f815bc0 0 425 396
0x00000004
[ 113.960340] ffff88001a94e5c0 ffffffff81a0d500 ffff8800028fc000
ffff8800028fb9e8
[ 113.960346] ffff880002199598 ffff88001a94e5c0 ffff88001a93e240
ffff8800021994c0
[ 113.960360] ffffffff815c2a81 ffff8800021995a0 ffffffff815c5942
7fffffffffffffff
[ 113.960372] Call Trace:
[ 113.960416] [<ffffffff815c2a81>] ? schedule+0x31/0x80
[ 113.960424] [<ffffffff815c5942>] ? schedule_timeout+0x1b2/0x270
[ 113.960441] [<ffffffff812e4f77>] ? blk_finish_plug+0x27/0x40
[ 113.960516] [<ffffffffc007d8ef>] ? _xfs_buf_ioapply+0x35f/0x490 [xfs]
[ 113.960528] [<ffffffff815c34b1>] ? wait_for_completion+0xf1/0x130
[ 113.960545] [<ffffffff810a3790>] ? wake_up_q+0x60/0x60
[ 113.960591] [<ffffffffc007f61f>] ? xfs_buf_read_map+0xff/0x180 [xfs]
[ 113.960645] [<ffffffffc007f368>] ? xfs_buf_submit_wait+0x78/0x200 [xfs]
[ 113.960684] [<ffffffffc00ac3a8>] ? xfs_trans_read_buf_map+0xe8/0x330
[xfs]
[ 113.960728] [<ffffffffc007f61f>] ? xfs_buf_read_map+0xff/0x180 [xfs]
[ 113.960781] [<ffffffffc00ac3a8>] ? xfs_trans_read_buf_map+0xe8/0x330
[xfs]
[ 113.960825] [<ffffffffc0071f3e>] ? xfs_imap_to_bp+0x6e/0xf0 [xfs]
[ 113.960883] [<ffffffffc0072680>] ? xfs_iread+0x80/0x2c0 [xfs]
[ 113.960931] [<ffffffffc0087d62>] ? xfs_iget+0x332/0x840 [xfs]
[ 113.960982] [<ffffffffc0090950>] ? xfs_lookup+0x100/0x140 [xfs]
[ 113.961042] [<ffffffffc008d5a3>] ? xfs_vn_lookup+0x73/0xb0 [xfs]
[ 113.961071] [<ffffffff8120a376>] ? __d_alloc+0x116/0x170
[ 113.961094] [<ffffffff811faf89>] ? lookup_real+0x19/0x60
[ 113.961104] [<ffffffff811fb397>] ? lookup_slow+0x57/0xe0
[ 113.961113] [<ffffffff811fdcd3>] ? walk_component+0x1f3/0x470
[ 113.961132] [<ffffffff811fbcf9>] ? path_init+0x1d9/0x330
[ 113.961142] [<ffffffff811fe6bd>] ? path_lookupat+0x5d/0x110
[ 113.961151] [<ffffffff81200dc1>] ? filename_lookup+0xb1/0x180
[ 113.961161] [<ffffffff812009ff>] ? getname_flags+0x6f/0x1e0
[ 113.961181] [<ffffffff811f5ff9>] ? vfs_fstatat+0x59/0xb0
[ 113.961189] [<ffffffff811916cc>] ? vm_mmap_pgoff+0xbc/0xf0
[ 113.961195] [<ffffffff811f654a>] ? SYSC_newstat+0x2a/0x60
[ 113.961203] [<ffffffff810648de>] ? __do_page_fault+0x20e/0x500
[ 113.961211] [<ffffffff815c6776>] ? system_call_fast_compare_end+0xc/0x96
[ 113.961217]
[ 113.961225] Restarting tasks ... done.
[ 113.962782] xen:manage: do_suspend: freeze processes failed -16
[ 240.032030] INFO: task cron:425 blocked for more than 120 seconds.
[ 240.032043] Tainted: G W E 4.6.0-1-amd64 #1
[ 240.032049] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs"
disables this message.
[ 240.032058] cron D ffff88001f815bc0 0 425 396
0x00000004
[ 240.032072] ffff88001a94e5c0 ffffffff81a0d500 ffff8800028fc000
ffff8800028fb9e8
[ 240.032084] ffff880002199598 ffff88001a94e5c0 ffff88001a93e240
ffff8800021994c0
[ 240.032100] ffffffff815c2a81 ffff8800021995a0 ffffffff815c5942
7fffffffffffffff
[ 240.032122] Call Trace:
[ 240.032147] [<ffffffff815c2a81>] ? schedule+0x31/0x80
[ 240.032159] [<ffffffff815c5942>] ? schedule_timeout+0x1b2/0x270
[ 240.032175] [<ffffffff812e4f77>] ? blk_finish_plug+0x27/0x40
[ 240.032235] [<ffffffffc007d8ef>] ? _xfs_buf_ioapply+0x35f/0x490 [xfs]
[ 240.032252] [<ffffffff815c34b1>] ? wait_for_completion+0xf1/0x130
[ 240.032274] [<ffffffff810a3790>] ? wake_up_q+0x60/0x60
[ 240.032306] [<ffffffffc007f61f>] ? xfs_buf_read_map+0xff/0x180 [xfs]
[ 240.032360] [<ffffffffc007f368>] ? xfs_buf_submit_wait+0x78/0x200 [xfs]
[ 240.032395] [<ffffffffc00ac3a8>] ? xfs_trans_read_buf_map+0xe8/0x330
[xfs]
[ 240.032444] [<ffffffffc007f61f>] ? xfs_buf_read_map+0xff/0x180 [xfs]
[ 240.032479] [<ffffffffc00ac3a8>] ? xfs_trans_read_buf_map+0xe8/0x330
[xfs]
[ 240.032520] [<ffffffffc0071f3e>] ? xfs_imap_to_bp+0x6e/0xf0 [xfs]
[ 240.032557] [<ffffffffc0072680>] ? xfs_iread+0x80/0x2c0 [xfs]
[ 240.032592] [<ffffffffc0087d62>] ? xfs_iget+0x332/0x840 [xfs]
[ 240.032629] [<ffffffffc0090950>] ? xfs_lookup+0x100/0x140 [xfs]
[ 240.032671] [<ffffffffc008d5a3>] ? xfs_vn_lookup+0x73/0xb0 [xfs]
[ 240.032681] [<ffffffff8120a376>] ? __d_alloc+0x116/0x170
[ 240.032688] [<ffffffff811faf89>] ? lookup_real+0x19/0x60
[ 240.032697] [<ffffffff811fb397>] ? lookup_slow+0x57/0xe0
[ 240.032713] [<ffffffff811fdcd3>] ? walk_component+0x1f3/0x470
[ 240.032719] [<ffffffff811fbcf9>] ? path_init+0x1d9/0x330
[ 240.032726] [<ffffffff811fe6bd>] ? path_lookupat+0x5d/0x110
[ 240.032742] [<ffffffff81200dc1>] ? filename_lookup+0xb1/0x180
[ 240.032749] [<ffffffff812009ff>] ? getname_flags+0x6f/0x1e0
[ 240.032756] [<ffffffff811f5ff9>] ? vfs_fstatat+0x59/0xb0
[ 240.032763] [<ffffffff811916cc>] ? vm_mmap_pgoff+0xbc/0xf0
[ 240.032775] [<ffffffff811f654a>] ? SYSC_newstat+0x2a/0x60
[ 240.032783] [<ffffffff810648de>] ? __do_page_fault+0x20e/0x500
[ 240.032790] [<ffffffff815c6776>] ? system_call_fast_compare_end+0xc/0x96