[Bug 1531768] Re: [arm64] lockups when idle if tickless (nohz=on) is used

2016-06-17 Thread Martin Pitt
>- trusty and vivid kernels with nohz=off have survived for a full day
without any lockups.

They both hung last night.

So in summary: Neither nohz=off nor older kernels help here. This really
seems to be a matter of luck/what's going on on the host system.

** Summary changed:

- [arm64] lockups when idle if tickless (nohz=on) is used
+ [arm64] lockups some time after booting

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1531768

Title:
  [arm64] lockups some time after booting

To manage notifications about this bug go to:
https://bugs.launchpad.net/auto-package-testing/+bug/1531768/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1531768] Re: [arm64] lockups when idle if tickless (nohz=on) is used

2016-06-17 Thread Junien Fridrick
Hi !

kvm cmdline : https://paste.ubuntu.com/17430254/
cpuinfo : https://paste.ubuntu.com/17430265/
dmesg: https://paste.ubuntu.com/17430277/

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1531768

Title:
  [arm64] lockups when idle if tickless (nohz=on) is used

To manage notifications about this bug go to:
https://bugs.launchpad.net/auto-package-testing/+bug/1531768/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1531768] Re: [arm64] lockups when idle if tickless (nohz=on) is used

2016-06-17 Thread Colin Ian King
I'd like to factor out if we are missing IRQs on the host and inside the
VM, so can both be booted with kernel parameter: irqpoll

unfortunately this can eat more cpu cycles, so I'm reluctant to ask for
this to be used, but I'm wondering of the host or VM are occasionally
missing timer wakeups

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1531768

Title:
  [arm64] lockups when idle if tickless (nohz=on) is used

To manage notifications about this bug go to:
https://bugs.launchpad.net/auto-package-testing/+bug/1531768/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


Re: [Bug 1531768] Re: [arm64] lockups when idle if tickless (nohz=on) is used

2016-06-17 Thread Martin Pitt
Colin Ian King [2016-06-17 10:50 -]:
> I'm trying to get a reliable reproducer on a similarly sized aarch64
> host.  Just so that I'm not missing anything, what is the entire command
> line being used on the host to run the VM?

I can't determine this. I asked Junien on IRC to put it here.

> Also, what is /proc/cmdline on the VM?

Aside from the "nohz=off" it's rather unsurprising:

  BOOT_IMAGE=/boot/vmlinuz-4.2.0-38-generic root=UUID=b98e4d93-8d8f-4349
-a6ce-b5a87cdb2edd ro nohz=off

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1531768

Title:
  [arm64] lockups when idle if tickless (nohz=on) is used

To manage notifications about this bug go to:
https://bugs.launchpad.net/auto-package-testing/+bug/1531768/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1531768] Re: [arm64] lockups when idle if tickless (nohz=on) is used

2016-06-17 Thread Colin Ian King
And the /proc/cmdline info from the host would be of some use to see if
anything special there is being used.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1531768

Title:
  [arm64] lockups when idle if tickless (nohz=on) is used

To manage notifications about this bug go to:
https://bugs.launchpad.net/auto-package-testing/+bug/1531768/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1531768] Re: [arm64] lockups when idle if tickless (nohz=on) is used

2016-06-17 Thread Colin Ian King
I'm trying to get a reliable reproducer on a similarly sized aarch64
host.  Just so that I'm not missing anything, what is the entire command
line being used on the host to run the VM?

Also, what is /proc/cmdline on the VM?

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1531768

Title:
  [arm64] lockups when idle if tickless (nohz=on) is used

To manage notifications about this bug go to:
https://bugs.launchpad.net/auto-package-testing/+bug/1531768/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1531768] Re: [arm64] lockups when idle if tickless (nohz=on) is used

2016-06-16 Thread Martin Pitt
Further notekeeping:
 - 4 dd's (xenial+nohz=off) has survived for half a day, then the instance 
crashed on something else.
 - trusty and vivid kernels with nohz=off have survived for a full day without 
any lockups. lxd on trusty kernel causes a lot of leaked "FREEZED/FREEZING" 
containers, but that's unrelated and does not happen with the vivid kernel. So 
it's unclear whether this combination is stable or the lockups are just 
reduced, or it just was lucky.
 - trusty kernel without the nohz option locked up a few minutes after reboot, 
without actually running any test.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1531768

Title:
  [arm64] lockups when idle if tickless (nohz=on) is used

To manage notifications about this bug go to:
https://bugs.launchpad.net/auto-package-testing/+bug/1531768/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1531768] Re: [arm64] lockups when idle if tickless (nohz=on) is used

2016-06-16 Thread Colin Ian King
If 4 dd's work OK, it may be worth running a minimal sleep loop:

while true; do sleep 0.5; done

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1531768

Title:
  [arm64] lockups when idle if tickless (nohz=on) is used

To manage notifications about this bug go to:
https://bugs.launchpad.net/auto-package-testing/+bug/1531768/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1531768] Re: [arm64] lockups when idle if tickless (nohz=on) is used

2016-06-16 Thread Martin Pitt
Hang still occurs with xenial kernel and one instance of

   nice -n 19 dd if=/dev/zero of=/dev/null bs=1024 &

I have now rebooted and started four dd's, so that all four CPUs should
remain busy constantly.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1531768

Title:
  [arm64] lockups when idle if tickless (nohz=on) is used

To manage notifications about this bug go to:
https://bugs.launchpad.net/auto-package-testing/+bug/1531768/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1531768] Re: [arm64] lockups when idle if tickless (nohz=on) is used

2016-06-16 Thread Martin Pitt
FTR, running the trusty kernel on xenial userspace does not work:
http://paste.ubuntu.com/17392362/

cking | pitti, syscall 384 on aarch64 is getrandom() and that does not
exist on trusty

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1531768

Title:
  [arm64] lockups when idle if tickless (nohz=on) is used

To manage notifications about this bug go to:
https://bugs.launchpad.net/auto-package-testing/+bug/1531768/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1531768] Re: [arm64] lockups when idle if tickless (nohz=on) is used

2016-06-15 Thread Martin Pitt
> It may be worth trying nohz=off on the host as well

Junien did that on the nova compute host, and no change. Processes in
the instance still freeze.

This is actually also consistent with the observation that this
apparently does not happen with the trusty kernel.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1531768

Title:
  [arm64] lockups when idle if tickless (nohz=on) is used

To manage notifications about this bug go to:
https://bugs.launchpad.net/auto-package-testing/+bug/1531768/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1531768] Re: [arm64] lockups when idle if tickless (nohz=on) is used

2016-06-14 Thread Martin Pitt
Oh noes! I'm still getting "task * blocked for more than 120 seconds"
hangs even with nohz=off :-( Is there another option which I could try?

** Changed in: auto-package-testing
   Status: Fix Released => Triaged

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1531768

Title:
  [arm64] lockups when idle if tickless (nohz=on) is used

To manage notifications about this bug go to:
https://bugs.launchpad.net/auto-package-testing/+bug/1531768/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1531768] Re: [arm64] lockups when idle if tickless (nohz=on) is used

2016-06-14 Thread Colin Ian King
It may be worth trying nohz=off on the host as well, just as an
experiment to see if this also improves things.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1531768

Title:
  [arm64] lockups when idle if tickless (nohz=on) is used

To manage notifications about this bug go to:
https://bugs.launchpad.net/auto-package-testing/+bug/1531768/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1531768] Re: [arm64] lockups when idle if tickless (nohz=on) is used

2016-06-14 Thread Paul Gear
We've also been running into this issue on ScalingStack instances
recently; I got this traceback which seems to strongly implicate nohz as
the problem area: https://pastebin.canonical.com/158640/  Presently
testing @pitti's workaround on a number of different sized instances to
confirm.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1531768

Title:
  [arm64] lockups when idle if tickless (nohz=on) is used

To manage notifications about this bug go to:
https://bugs.launchpad.net/auto-package-testing/+bug/1531768/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1531768] Re: [arm64] lockups when idle if tickless (nohz=on) is used

2016-06-13 Thread Colin Ian King
That's great news!  I'll try and figure out what the root cause is. Let
me know if there are other issues.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1531768

Title:
  [arm64] lockups when idle if tickless (nohz=on) is used

To manage notifications about this bug go to:
https://bugs.launchpad.net/auto-package-testing/+bug/1531768/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs