Processed: Re: Bug#1033862: nouveau: watchdog: BUG: soft lockup - CPU#0 stuck for 548s! [kscreenlocker_g:19260]
Processing control commands: > severity -1 important Bug #1033862 [src:linux] nouveau: watchdog: BUG: soft lockup - CPU#0 stuck for 548s! [kscreenlocker_g:19260] Severity set to 'important' from 'critical' > tags -1 + moreinfo Bug #1033862 [src:linux] nouveau: watchdog: BUG: soft lockup - CPU#0 stuck for 548s! [kscreenlocker_g:19260] Added tag(s) moreinfo. -- 1033862: https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=1033862 Debian Bug Tracking System Contact ow...@bugs.debian.org with problems
Bug#1033862: nouveau: watchdog: BUG: soft lockup - CPU#0 stuck for 548s! [kscreenlocker_g:19260]
Control: severity -1 important Control: tags -1 + moreinfo Hi, On Sun, Apr 02, 2023 at 09:56:52PM -0400, A. F. Cano wrote: > Package: src:linux > Version: 6.1.20-1 > Severity: critical > File: nouveau > Justification: breaks the whole system > X-Debbugs-Cc: af...@comcast.net > > When the above message occurs, the system becomes totally unresponsive and > the only way to recover is > a hard power-off via the power button held for about 5 seconds. Upon boot, > the sddm login screen appears > but at 1024x768, which is much less than the monitor is capable of: 1920x1200. > > xrandr > Screen 0: minimum 16 x 16, current 1024 x 768, maximum 32767 x 32767 > XWAYLAND0 connected primary 1024x768+0+0 (normal left inverted right x axis y > axis) 0mm x 0mm >1024x768 59.92*+ >800x600 59.86 >640x480 59.38 >320x240 59.52 >720x480 59.71 >640x400 59.95 >320x200 58.96 >1024x576 59.90 >864x486 59.92 >720x400 59.55 >640x350 59.77 > > After login sometimes the screen goes blank (but the backlight remains on). > Hard power off required. > Sometimes the gear wheel stops turning and the system freezes. Hard power > off required. > > I have tried to install the nvidia proprietary driver 304 > (NVIDIA-Linux-x86_64-304.117.run) which is what > this old chip needs but it fails to install. No matter what I do the nouveau > driver is in use and > cannot be removed. > > If it were possible for nouveau and/or X/Wayland to access the whole set of > resolutions of the system > without hard freezes, I'd be happy. Any tricks? Any specific things I could > try to figure out the issue? > > Obviously, in this particular boot the hard freeze did not happen. > > These lines seem to be relevant (from the logs below): > > [ 47.892652] nouveau :00:0 > d.0: bus: MMIO write of 00340001 FAULT at 00b000 > [ 64.113759] nouveau :00:0d.0: bus: MMIO write of 00640001 FAULT at > 00b010 > [ 64.114792] nouveau :00:0d.0: bus: MMIO write of 00310001 FAULT at > 00b020 > [ 69.614326] nouveau :00:0d.0: bus: MMIO write of FAULT at > 00b020 > [ 69.614542] nouveau :00:0d.0: bus: MMIO write of 00640001 FAULT at > 00b010 > [ 69.615432] nouveau :00:0d.0: bus: MMIO write of 00310001 FAULT at > 00b020 > [ 70.336843] nouveau :00:0d.0: bus: MMIO write of FAULT at > 00b020 > [ 70.337057] nouveau :00:0d.0: bus: MMIO write of 00640001 FAULT at > 00b010 > [ 70.337684] nouveau :00:0d.0: bus: MMIO write of 00660001 FAULT at > 00b020 > [ 70.357387] nouveau :00:0d.0: bus: MMIO write of FAULT at > 00b010 > [ 89.666120] nouveau :00:0d.0: bus: MMIO write of 00ca0001 FAULT at > 00b010 > [ 97.330127] nouveau :00:0d.0: bus: MMIO write of FAULT at > 00b010 > [ 104.590842] traps: light-locker[4745] trap int3 ip:7f59b65be7d7 > sp:7fff472f8690 error:0 in libglib-2.0.so.0.7400.6[7f59b658+8d000] Can you clarify, is this a regression from 6.1.15-1 previously in testing, and now happening first with 6.1.20-1? Looking for reports about the same and similar effects, it looks issues with nouveau and the old eForce 6150SE nForce 430 goes way back several years. Can you please clarify if this is a new regression though between 6.1.15 and 6.1.20. Regards, Salvatore
Bug#1033862: nouveau: watchdog: BUG: soft lockup - CPU#0 stuck for 548s! [kscreenlocker_g:19260]
Package: src:linux Version: 6.1.20-1 Severity: critical File: nouveau Justification: breaks the whole system X-Debbugs-Cc: af...@comcast.net When the above message occurs, the system becomes totally unresponsive and the only way to recover is a hard power-off via the power button held for about 5 seconds. Upon boot, the sddm login screen appears but at 1024x768, which is much less than the monitor is capable of: 1920x1200. xrandr Screen 0: minimum 16 x 16, current 1024 x 768, maximum 32767 x 32767 XWAYLAND0 connected primary 1024x768+0+0 (normal left inverted right x axis y axis) 0mm x 0mm 1024x768 59.92*+ 800x600 59.86 640x480 59.38 320x240 59.52 720x480 59.71 640x400 59.95 320x200 58.96 1024x576 59.90 864x486 59.92 720x400 59.55 640x350 59.77 After login sometimes the screen goes blank (but the backlight remains on). Hard power off required. Sometimes the gear wheel stops turning and the system freezes. Hard power off required. I have tried to install the nvidia proprietary driver 304 (NVIDIA-Linux-x86_64-304.117.run) which is what this old chip needs but it fails to install. No matter what I do the nouveau driver is in use and cannot be removed. If it were possible for nouveau and/or X/Wayland to access the whole set of resolutions of the system without hard freezes, I'd be happy. Any tricks? Any specific things I could try to figure out the issue? Obviously, in this particular boot the hard freeze did not happen. These lines seem to be relevant (from the logs below): [ 47.892652] nouveau :00:0 d.0: bus: MMIO write of 00340001 FAULT at 00b000 [ 64.113759] nouveau :00:0d.0: bus: MMIO write of 00640001 FAULT at 00b010 [ 64.114792] nouveau :00:0d.0: bus: MMIO write of 00310001 FAULT at 00b020 [ 69.614326] nouveau :00:0d.0: bus: MMIO write of FAULT at 00b020 [ 69.614542] nouveau :00:0d.0: bus: MMIO write of 00640001 FAULT at 00b010 [ 69.615432] nouveau :00:0d.0: bus: MMIO write of 00310001 FAULT at 00b020 [ 70.336843] nouveau :00:0d.0: bus: MMIO write of FAULT at 00b020 [ 70.337057] nouveau :00:0d.0: bus: MMIO write of 00640001 FAULT at 00b010 [ 70.337684] nouveau :00:0d.0: bus: MMIO write of 00660001 FAULT at 00b020 [ 70.357387] nouveau :00:0d.0: bus: MMIO write of FAULT at 00b010 [ 89.666120] nouveau :00:0d.0: bus: MMIO write of 00ca0001 FAULT at 00b010 [ 97.330127] nouveau :00:0d.0: bus: MMIO write of FAULT at 00b010 [ 104.590842] traps: light-locker[4745] trap int3 ip:7f59b65be7d7 sp:7fff472f8690 error:0 in libglib-2.0.so.0.7400.6[7f59b658+8d000] -- Package-specific info: ** Version: Linux version 6.1.0-7-amd64 (debian-ker...@lists.debian.org) (gcc-12 (Debian 12.2.0-14) 12.2.0, GNU ld (GNU Binutils for Debian) 2.40) #1 SMP PREEMPT_DYNAMIC Debian 6.1.20-1 (2023-03-19) ** Command line: BOOT_IMAGE=/boot/vmlinuz-6.1.0-7-amd64 root=UUID=28bd15dd-cd17-45c8-93e0-65decc995980 ro quiet ** Not tainted ** Kernel log: [6.707012] systemd[1]: Starting systemd-journald.service - Journal Service... [6.760881] systemd[1]: Starting systemd-modules-load.service - Load Kernel Modules... [6.762291] systemd[1]: Starting systemd-remount-fs.service - Remount Root and Kernel File Systems... [6.763752] systemd[1]: Starting systemd-udev-trigger.service - Coldplug All udev Devices... [6.766133] systemd[1]: Finished kmod-static-nodes.service - Create List of Static Device Nodes. [6.766733] systemd[1]: modprobe@configfs.service: Deactivated successfully. [6.766949] systemd[1]: Finished modprobe@configfs.service - Load Kernel Module configfs. [6.767339] systemd[1]: modprobe@drm.service: Deactivated successfully. [6.767583] systemd[1]: Finished modprobe@drm.service - Load Kernel Module drm. [6.769231] systemd[1]: Mounting sys-kernel-config.mount - Kernel Configuration File System... [6.778991] systemd[1]: modprobe@efi_pstore.service: Deactivated successfully. [6.779202] systemd[1]: Finished modprobe@efi_pstore.service - Load Kernel Module efi_pstore. [6.791834] systemd[1]: Mounted dev-mqueue.mount - POSIX Message Queue File System. [6.792129] systemd[1]: Mounted sys-kernel-debug.mount - Kernel Debug File System. [6.792350] systemd[1]: Mounted sys-kernel-tracing.mount - Kernel Trace File System. [6.792608] systemd[1]: Mounted sys-kernel-config.mount - Kernel Configuration File System. [6.829931] systemd[1]: Mounted dev-hugepages.mount - Huge Pages File System. [6.853427] loop: module loaded [6.854583] systemd[1]: modprobe@loop.service: Deactivated successfully. [6.854794] systemd[1]: Finished modprobe@loop.service - Load Kernel Module loop. [6.857921] fuse: init (API version 7.37) [6.858998] systemd[1]: modprobe@fuse.service: Deactivated successfully. [6.859199] systemd[1]: