[Bug 999755] Re: Kernel crash in rb_next doing ohai loops

2012-07-31 Thread Karl Matthias
We verified the patched kernel does not exhibit the crashing behavior over a week-long run of Ohai loops on an EC2 large instance. It also behaves correctly on 2-core a VirtualBox running Ohai loops. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscrib

[Bug 999755] Re: Kernel crash in rb_next doing ohai loops

2012-07-31 Thread Karl Matthias
** Tags removed: verification-needed-precise ** Tags added: verification-done-precise -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/999755 Title: Kernel crash in rb_next doing ohai loops To manage

[Bug 999755] Re: Kernel crash in rb_next doing ohai loops

2012-07-03 Thread Karl Matthias
Awesome, thanks for the nice explanation and the dedication to working this out. We'll try out the patch on our several test setups as well. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/999755 Title

[Bug 999755] Re: Kernel crash in rb_next doing ohai loops

2012-05-30 Thread Karl Matthias
Stefan, our experience is that it also does not ever happen on just a single core so it's definitely some scheduler interaction on a multi- core box. You probably could get a 4 core to do it as well if the process list were more heavily loaded. -- You received this bug notification because you a

[Bug 999755] Re: Kernel crash in rb_next doing ohai loops

2012-05-29 Thread Karl Matthias
Ok, that is "good" news. Happy to help with further testing. ** Description changed: Testcase: 1. apt-get install build-essential ruby-1.9.3 screen 2. gem install chef - 3. in screen session: while true; oahi; done + 3. in screen session: while true; ohai; done --- We have a numbe

[Bug 999755] Re: Kernel crash on EC2 & VirtualBox

2012-05-28 Thread Karl Matthias
We have had a crash with Stefan's debugging kernel running. Here is the output (doesn't look like it contains any more information). [248587.286290] [ cut here ] [248587.286765] kernel BUG at /home/smb/precise-amd64/ubuntu-2.6/kernel/sched_fair.c:1239! [248587.286775] inv

[Bug 999755] Re: Kernel crash on EC2 m1.large instances

2012-05-25 Thread Karl Matthias
We have the debugging kernel installed on a box that has been happily crashing for the last few days. We'll see if we can get a good debug for you from it. What would you like us to gather the next time it crashes? -- You received this bug notification because you are a member of Ubuntu Bugs, w

[Bug 999755] Re: Kernel crash on EC2 m1.large instances

2012-05-24 Thread Karl Matthias
Excellent, thanks for that. I'll drop it on a box tomorrow (UK standard time). -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/999755 Title: Kernel crash on EC2 m1.large instances To manage notifica

[Bug 999755] Re: Kernel crash on EC2 m1.large instances

2012-05-23 Thread Karl Matthias
Stefan, if you have a kernel build with memory dumps to help debug this, I'm happy to try it. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/999755 Title: Kernel crash on EC2 m1.large instances To m

[Bug 999755] Re: Kernel crash on EC2 m1.large instances

2012-05-22 Thread Karl Matthias
Thanks, we're running tests on the 3.2.0-24.38 kernel now to see if we can get it to crash in the same way. Yes, this affected both the previous versions: -24.37 and -23.36. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs

[Bug 999755] Re: Kernel crash on EC2 m1.large instances

2012-05-17 Thread Karl Matthias
Forgot to post this earlier. Re: Brad's request for crash logs. apport-cli says "No pending crash reports. Try --help for more information." ** Changed in: linux (Ubuntu) Status: Incomplete => Confirmed -- You received this bug notification because you are a member of Ubuntu Bugs, which

[Bug 999755] Re: Kernel crash on EC2 m1.large instances

2012-05-17 Thread Karl Matthias
Here's another one if it helps: [848423.023156] BUG: unable to handle kernel NULL pointer dereference at 0010 [848423.023180] IP: [] rb_next+0x1/0x50 [848423.023194] PGD 18ad83067 PUD 18ad82067 PMD 0 [848423.023203] Oops: [#1] SMP [848423.023210] CPU 1 [848423.023213] Modules

[Bug 999755] Re: Kernel crash on EC2 m1.large instances

2012-05-17 Thread Karl Matthias
Hi Stefan, OK. Here's another stack trace: [47708.053788] BUG: unable to handle kernel NULL pointer dereference at 0010 [47708.053810] IP: [] rb_next+0x1/0x50 [47708.053824] PGD 1d0b83067 PUD 1d0e64067 PMD 0 [47708.053833] Oops: [#1] SMP [47708.053840] CPU 1 [47708.053843] Mo

[Bug 999755] [NEW] Kernel crash on EC2 m1.large instances

2012-05-15 Thread Karl Matthias
Public bug reported: We have a number of small and large instances running the release version of 12.04. The small instances have been completely stable. However, every large instance we have has crashed at a seemingly random interval. This is repeatable on individual systems, though not within

[Bug 999755] Re: Kernel crash on EC2 m1.large instances

2012-05-15 Thread Karl Matthias
-- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/999755 Title: Kernel crash on EC2 m1.large instances To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/