[Bug 1921355] Re: cgroups related kernel panics

2021-07-06 Thread Nikita Nedvetskiy
Hello!

Actually, we got a surprising behavior.
Shortly after communication in this thread, the bug just disappeared, for 
nearly two months.
Still had no luck reproducing.

We used this opportunity to migrate and reboot part of our servers to activate 
kdump on them, and decided to wait.
A couple of days ago one of our hypervisors hung, and we got our crash kernel 
dump :)
Kernel version was 5.4.0-73-generic this time.

Now that we have it, could somebody please have a look at it?
The file is quite large, ~2.5 GB (3.2 GB unpacked)
https://drive.google.com/file/d/1JVMWJpXNeou06UxqJwl5wjbLKzcb2rOq/view?usp=sharing

** Changed in: linux (Ubuntu)
   Status: Incomplete => Confirmed

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1921355

Title:
  cgroups related kernel panics

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1921355/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

[Bug 1921355] Re: cgroups related kernel panics

2021-04-16 Thread Nikita Nedvetskiy
Thank you all for your ideas!

Sure, we do have some modules not from the kernel source tree. These are
Mellanox (our NICs) and OpenvSwitch, as we've had some problems that
were fixed in the newer driver versions.

We don't have apport enabled, and actually, the hypervisor nodes don't even 
have direct access to the internet (only some VMs on them).
I checked on a test VM what kind of info it collects, and it seems that these 
are the arch, kernel version, and the stack trace. That kind of info is 
attached manually, we have netconsole enabled that collected it.

When the issue started, it was even reproducible on the then-latest
kernel (5.4.0-66), so I'm not sure that simply upgrading can help.

Currently I'm working on integrating kdump into our infrastructure,
trying to reproduce again, and I'll also try to schedule migration +
upgrade for our hypervisor node (that's not fast though).

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1921355

Title:
  cgroups related kernel panics

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1921355/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

[Bug 1921355] Re: cgroups related kernel panics

2021-03-25 Thread Nikita Nedvetskiy
** Attachment added: "crash-160321.log"
   
https://bugs.launchpad.net/ubuntu/+source/linux-hwe-5.4/+bug/1921355/+attachment/5480851/+files/crash-160321.log

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1921355

Title:
  cgroups related kernel panics

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux-hwe-5.4/+bug/1921355/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

[Bug 1921355] Re: cgroups related kernel panics

2021-03-25 Thread Nikita Nedvetskiy
** Attachment added: "crash-080321.log"
   
https://bugs.launchpad.net/ubuntu/+source/linux-hwe-5.4/+bug/1921355/+attachment/5480849/+files/crash-080321.log

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1921355

Title:
  cgroups related kernel panics

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux-hwe-5.4/+bug/1921355/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

[Bug 1921355] Re: cgroups related kernel panics

2021-03-25 Thread Nikita Nedvetskiy
** Attachment added: "crash-110321.log"
   
https://bugs.launchpad.net/ubuntu/+source/linux-hwe-5.4/+bug/1921355/+attachment/5480850/+files/crash-110321.log

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1921355

Title:
  cgroups related kernel panics

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux-hwe-5.4/+bug/1921355/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

[Bug 1921355] [NEW] cgroups related kernel panics

2021-03-25 Thread Nikita Nedvetskiy
Public bug reported:

Hi!

Recently (throughout the last 6 months) we've upgraded our hypervisor
compute hosts from ubuntu bionic kernel 4.15.* to ubuntu bionic hwe
kernel 5.4.

This month we noticed that several nodes failed due to bugs in cgroups.
Trace was different almost every time, but it all revolves around cgroups - 
either null pointer failures, or panic caught by BUG_ON() macro. Looked like 
some cgroup didn't exist anymore but somebody tried to access it, thus causing 
kernel panic.
Please find the logs attached.

3 of 4 cases happened after a VM shutdown. We tried to spawn lots of VMs, load 
them, shut them down, but didn't manage to reproduce the behavior.
Actually, every case is sort of different - patch kernel versions (5.4.0-42 to 
5.4.0-66), uptime vary (from 1 day to ~half a year). There are also lots of 
hosts with several months of uptime, no issue with them. Also, on 4.15 we've 
never seen this behavior, at all.
That's quite disturbing, as I don't want dozens of VMs crash (due to host 
outage) at random times for some vague reason...
I didn't manage to find any related bugs on the bug tracker, thus creating this 
one.

I wonder if anybody in the community came across something like that.
Could somebody give an advice how to debug further, or where else to report / 
look for a similar the case?

** Affects: linux-hwe-5.4 (Ubuntu)
 Importance: Undecided
 Status: New


** Tags: cgroups

** Attachment added: "crash-030321.log"
   
https://bugs.launchpad.net/bugs/1921355/+attachment/5480836/+files/crash-030321.log

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1921355

Title:
  cgroups related kernel panics

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux-hwe-5.4/+bug/1921355/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

[Bug 877509] Re: [4286CTO, Creative CA0110-IBG, White Headphone Out, Front] Underruns, dropouts or crackling sound

2012-11-08 Thread Nikita Nedvetskiy
Very odd bug.
Appeared again for me >_>
So posting the temporary workaround I found here:
https://bugs.launchpad.net/ubuntu/+source/pulseaudio/+bug/751265/comments/23
I guess now I can sleep easy at night :D

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/877509

Title:
  [4286CTO, Creative CA0110-IBG, White Headphone Out, Front] Underruns,
  dropouts or crackling sound

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/alsa-driver/+bug/877509/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs