Public bug reported:

System experiences repeated hard freezes (requires power button reset) under
Wayland with GNOME Shell 50 on kernel 7.0.0-10-generic. Audio continues
playing during freeze, indicating the kernel is still running but the
display compositor is deadlocked.

== System Information ==

Ubuntu 26.04 (Resolute Raccoon, development branch)
Kernel: 7.0.0-10-generic #10-Ubuntu SMP PREEMPT_DYNAMIC (7.0.0-rc4)
GNOME Shell: 50.0
Session: Wayland
CPU: AMD Ryzen 9 3900X 12-Core Processor
GPU: NVIDIA GeForce RTX 2070 SUPER (PCI ID: 10de:1e84)
NVIDIA Driver: 580.126.09 (tested both open and closed kernel modules)
Motherboard: ASRock B550 Steel Legend (BIOS P3.40)
RAM: 32GB (2/4 slots populated)

== Symptoms ==

1. Complete GUI freeze - mouse, keyboard, display all unresponsive
2. Audio continues playing (kernel still running)
3. No kernel panic/oops in logs
4. No NVIDIA Xid errors logged
5. Hard power reset required
6. Occurs approximately every 20-60 minutes under normal desktop use
7. Occurs with both nvidia-driver-580-open AND nvidia-driver-580 (closed)

== Freeze Pattern (consistent across all occurrences) ==

The following sequence appears in journalctl before every freeze:

1. Optional: gnome-shell logs "Value 9223372036854775807 cannot be safely
   stored in a JS Number and may be rounded" (INT64_MAX, likely invalid
   Wayland presentation timestamp)
2. Kernel logs "NOHZ tick-stop error: local softirq work is pending,
   handler #200!!!" (and sometimes #08, #01, #80)
3. NOHZ errors increase in frequency
4. All logging stops - system frozen

== Example logs from crash at 23:19 (Mar 31 2026) ==

Mar 31 23:19:58 kernel: NOHZ tick-stop error: local softirq work is pending, 
handler #200!!!
Mar 31 23:19:59 kernel: NOHZ tick-stop error: local softirq work is pending, 
handler #08!!!
Mar 31 23:20:02 kernel: NOHZ tick-stop error: local softirq work is pending, 
handler #200!!!
Mar 31 23:20:06 kernel: NOHZ tick-stop error: local softirq work is pending, 
handler #200!!!
Mar 31 23:20:06 kernel: NOHZ tick-stop error: local softirq work is pending, 
handler #200!!!
Mar 31 23:20:07 kernel: NOHZ tick-stop error: local softirq work is pending, 
handler #08!!!
Mar 31 23:20:18 kernel: NOHZ tick-stop error: local softirq work is pending, 
handler #01!!!
Mar 31 23:20:18 kernel: NOHZ tick-stop error: local softirq work is pending, 
handler #01!!!
Mar 31 23:20:26 kernel: NOHZ tick-stop error: local softirq work is pending, 
handler #08!!!
[no further log entries - system frozen]

== Example logs from crash at 22:49 (earlier same day) ==

Mar 31 22:49:23 gnome-shell[16774]: Value 9223372036854775807 cannot be safely 
stored in a JS Number
[repeated ~15 times]
Mar 31 22:49:34 kernel: NOHZ tick-stop error: local softirq work is pending, 
handler #200!!!
[system frozen shortly after]

== Additional observations ==

- nvidia-gpu i2c timeout error at boot: "nvidia-gpu 0000:04:00.3:
  i2c timeout error e0000000" (USB-C/VirtualLink port)
- nvidia-powerd reports unsupported system: "ERROR! Running on an
  unsupported system (PCI device Id: 0x1e84)"
- Snap applications (Firefox, Thunderbird) trigger AppArmor DENIED on
  NVIDIA device nodes (/dev/char/195:*) via glxtest before some freezes
- Edge Wayland "buggy presentation feedback" logged continuously
  (likely unrelated but contributes to log noise)

== Troubleshooting performed ==

All of the following were tested individually and in combination;
none resolved the freeze:

- Removed Discord Snap (was causing audit log flooding with LXD)
- Disabled [email protected] GNOME extension
- Switched from nvidia-driver-580-open to nvidia-driver-580 (closed)
- Disabled GPU acceleration in Microsoft Edge
- Disabled nvidia-powerd service
- Reset GNOME Mutter experimental features

== Versions ==

$ cat /proc/version_signature
Ubuntu 7.0.0-10.10-generic 7.0.0-rc4

$ apt-cache policy linux-image-7.0.0-10-generic
linux-image-7.0.0-10-generic:
  Installiert:           7.0.0-10.10
  Installationskandidat: 7.0.0-10.10
  Versionstabelle:
 *** 7.0.0-10.10 500
        500 http://archive.ubuntu.com/ubuntu resolute/main amd64 Packages
        100 /var/lib/dpkg/status


$ apt-cache policy nvidia-driver-580
nvidia-driver-580:
  Installiert:           580.126.09-0ubuntu9
  Installationskandidat: 580.126.09-0ubuntu9
  Versionstabelle:
 *** 580.126.09-0ubuntu9 500
        500 http://archive.ubuntu.com/ubuntu resolute/restricted amd64 Packages
        100 /var/lib/dpkg/status

ProblemType: Bug
DistroRelease: Ubuntu 26.04
Package: linux-image-7.0.0-10-generic 7.0.0-10.10
ProcVersionSignature: Ubuntu 7.0.0-10.10-generic 7.0.0-rc4
Uname: Linux 7.0.0-10-generic x86_64
NonfreeKernelModules: zfs nvidia_modeset nvidia
ApportVersion: 2.33.1-0ubuntu7
Architecture: amd64
AudioDevicesInUse:
 USER        PID ACCESS COMMAND
 /dev/snd/controlC2:  luis       5974 F.... wireplumber
 /dev/snd/controlC1:  luis       5974 F.... wireplumber
 /dev/snd/controlC0:  luis       5974 F.... wireplumber
 /dev/snd/seq:        luis       5959 F.... pipewire
CasperMD5CheckMismatches: ./boot/grub/i386-pc/eltorito.img
CasperMD5CheckResult: fail
CurrentDesktop: ubuntu:GNOME
Date: Tue Mar 31 23:24:00 2026
InstallationDate: Installed on 2026-03-28 (3 days ago)
InstallationMedia: Ubuntu 26.04 "Resolute Raccoon" - Daily amd64 (20260325)
MachineType: To Be Filled By O.E.M. B550 Steel Legend
ProcEnviron:
 LANG=de_DE.UTF-8
 PATH=(custom, no user)
 SHELL=/bin/bash
 TERM=xterm-256color
 XDG_RUNTIME_DIR=<set>
ProcFB: 0 nvidia-drmdrmfb
ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-7.0.0-10-generic 
root=UUID=2c6e0aea-b2a9-4318-87f5-8e06a9447fb0 ro quiet splash 
crashkernel=2G-4G:320M,4G-32G:512M,32G-64G:1024M,64G-128G:2048M,128G-:4096M
RfKill:
 
SourcePackage: linux
UpgradeStatus: No upgrade log present (probably fresh install)
dmi.bios.date: 01/18/2024
dmi.bios.release: 5.17
dmi.bios.vendor: American Megatrends International, LLC.
dmi.bios.version: P3.40
dmi.board.name: B550 Steel Legend
dmi.board.vendor: ASRock
dmi.chassis.asset.tag: To Be Filled By O.E.M.
dmi.chassis.type: 3
dmi.chassis.vendor: To Be Filled By O.E.M.
dmi.chassis.version: To Be Filled By O.E.M.
dmi.modalias: 
dmi:bvnAmericanMegatrendsInternational,LLC.:bvrP3.40:bd01/18/2024:br5.17:svnToBeFilledByO.E.M.:pnB550SteelLegend:pvrToBeFilledByO.E.M.:rvnASRock:rnB550SteelLegend:rvr:cvnToBeFilledByO.E.M.:ct3:cvrToBeFilledByO.E.M.:skuToBeFilledByO.E.M.:pfaToBeFilledByO.E.M.:
dmi.product.family: To Be Filled By O.E.M.
dmi.product.name: B550 Steel Legend
dmi.product.sku: To Be Filled By O.E.M.
dmi.product.version: To Be Filled By O.E.M.
dmi.sys.vendor: To Be Filled By O.E.M.

** Affects: linux (Ubuntu)
     Importance: Undecided
         Status: New


** Tags: amd64 apport-bug resolute wayland-session

** Attachment added: "bugreport.zip"
   
https://bugs.launchpad.net/bugs/2146965/+attachment/5957367/+files/bugreport.zip

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/2146965

Title:
  7.0.0-10-generic] Hard freeze with NOHZ tick-stop errors

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/2146965/+subscriptions


-- 
ubuntu-bugs mailing list
[email protected]
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to