Also worth mentioning. We are only seeing this on the A100. Neither
our automated testing or manual testing of ftrace saw any issues on
DGX2.
--
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1922387
Here are the steps I used to reproduce:
#if using proposed pocket kernel
https://wiki.ubuntu.com/Testing/EnableProposed
#Need to enable deb-src for proposed/updates for this work
sudo apt update
$ sudo apt-get source linux
#After source is pulled, build and run ftrace selftests
$ sudo make -C
I did some manual ubuntu_kernel_selftests ftrace testing on the
5.4.0-71.79-generic kernel. I was able to replicate the panic, but not
on every run, but even on runs with no panic dmesg would report several
soft lockups.
After removing the MOFED dkms, I was unable to replicate a panic or any
of
This panic occurred while running the ubuntu_kernel_selftests suite. The
last bit of logs are:
13:33:20 DEBUG| [stdout] # selftests: ftrace: ftracetest
13:33:20 DEBUG| [stdout] # === Ftrace unit tests ===
13:33:28 DEBUG| [stdout] # [1] Basic trace file check [PASS]
13:37:04 DEBUG| [stdout] # [2]
This panic occurred while running the ubuntu_kernel_selftests suite. The
last bit of logs are:
13:33:20 DEBUG| [stdout] # selftests: ftrace: ftracetest
13:33:20 DEBUG| [stdout] # === Ftrace unit tests ===
13:33:28 DEBUG| [stdout] # [1] Basic trace file check [PASS]
13:37:04 DEBUG| [stdout] #