Processed: Re: Bug#1033862: nouveau: watchdog: BUG: soft lockup - CPU#0 stuck for 548s! [kscreenlocker_g:19260]

2023-04-04 Thread Debian Bug Tracking System
Processing control commands:

> severity -1 important
Bug #1033862 [src:linux] nouveau: watchdog: BUG: soft lockup - CPU#0 stuck for 
548s! [kscreenlocker_g:19260]
Severity set to 'important' from 'critical'
> tags -1 + moreinfo
Bug #1033862 [src:linux] nouveau: watchdog: BUG: soft lockup - CPU#0 stuck for 
548s! [kscreenlocker_g:19260]
Added tag(s) moreinfo.

-- 
1033862: https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=1033862
Debian Bug Tracking System
Contact ow...@bugs.debian.org with problems



Bug#1033862: nouveau: watchdog: BUG: soft lockup - CPU#0 stuck for 548s! [kscreenlocker_g:19260]

2023-04-04 Thread Salvatore Bonaccorso
Control: severity -1 important
Control: tags -1 + moreinfo

Hi,

On Sun, Apr 02, 2023 at 09:56:52PM -0400, A. F. Cano wrote:
> Package: src:linux
> Version: 6.1.20-1
> Severity: critical
> File: nouveau
> Justification: breaks the whole system
> X-Debbugs-Cc: af...@comcast.net
>
> When the above message occurs, the system becomes totally unresponsive and 
> the only way to recover is
> a hard power-off via the power button held for about 5 seconds.  Upon boot, 
> the sddm login screen appears
> but at 1024x768, which is much less than the monitor is capable of: 1920x1200.
> 
> xrandr
> Screen 0: minimum 16 x 16, current 1024 x 768, maximum 32767 x 32767
> XWAYLAND0 connected primary 1024x768+0+0 (normal left inverted right x axis y 
> axis) 0mm x 0mm
>1024x768  59.92*+
>800x600   59.86  
>640x480   59.38  
>320x240   59.52  
>720x480   59.71  
>640x400   59.95  
>320x200   58.96  
>1024x576  59.90  
>864x486   59.92  
>720x400   59.55  
>640x350   59.77
> 
> After login sometimes the screen goes blank (but the backlight remains on). 
> Hard power off required.
> Sometimes the gear wheel stops turning and the system freezes.  Hard power 
> off required.
> 
> I have tried to install the nvidia proprietary driver 304 
> (NVIDIA-Linux-x86_64-304.117.run) which is what
> this old chip needs but it fails to install.  No matter what I do the nouveau 
> driver is in use and
> cannot be removed.
> 
> If it were possible for nouveau and/or X/Wayland to access the whole set of 
> resolutions of the system
> without hard freezes, I'd be happy.  Any tricks?  Any specific things I could 
> try to figure out the issue?
> 
> Obviously, in this particular boot the hard freeze did not happen.
> 
> These lines seem to be relevant (from the logs below):
> 
> [   47.892652] nouveau :00:0
> d.0: bus: MMIO write of 00340001 FAULT at 00b000
> [   64.113759] nouveau :00:0d.0: bus: MMIO write of 00640001 FAULT at 
> 00b010
> [   64.114792] nouveau :00:0d.0: bus: MMIO write of 00310001 FAULT at 
> 00b020
> [   69.614326] nouveau :00:0d.0: bus: MMIO write of  FAULT at 
> 00b020
> [   69.614542] nouveau :00:0d.0: bus: MMIO write of 00640001 FAULT at 
> 00b010
> [   69.615432] nouveau :00:0d.0: bus: MMIO write of 00310001 FAULT at 
> 00b020
> [   70.336843] nouveau :00:0d.0: bus: MMIO write of  FAULT at 
> 00b020
> [   70.337057] nouveau :00:0d.0: bus: MMIO write of 00640001 FAULT at 
> 00b010
> [   70.337684] nouveau :00:0d.0: bus: MMIO write of 00660001 FAULT at 
> 00b020
> [   70.357387] nouveau :00:0d.0: bus: MMIO write of  FAULT at 
> 00b010
> [   89.666120] nouveau :00:0d.0: bus: MMIO write of 00ca0001 FAULT at 
> 00b010
> [   97.330127] nouveau :00:0d.0: bus: MMIO write of  FAULT at 
> 00b010
> [  104.590842] traps: light-locker[4745] trap int3 ip:7f59b65be7d7 
> sp:7fff472f8690 error:0 in libglib-2.0.so.0.7400.6[7f59b658+8d000]

Can you clarify, is this a regression from 6.1.15-1 previously in
testing, and now happening first with 6.1.20-1? 

Looking for reports about the same and similar effects, it looks
issues with nouveau and the old eForce 6150SE nForce 430 goes way back
several years. 

Can you please clarify if this is a new regression though between
6.1.15 and 6.1.20.

Regards,
Salvatore



Bug#1033862: nouveau: watchdog: BUG: soft lockup - CPU#0 stuck for 548s! [kscreenlocker_g:19260]

2023-04-02 Thread A. F. Cano
Package: src:linux
Version: 6.1.20-1
Severity: critical
File: nouveau
Justification: breaks the whole system
X-Debbugs-Cc: af...@comcast.net

When the above message occurs, the system becomes totally unresponsive and the 
only way to recover is
a hard power-off via the power button held for about 5 seconds.  Upon boot, the 
sddm login screen appears
but at 1024x768, which is much less than the monitor is capable of: 1920x1200.

xrandr
Screen 0: minimum 16 x 16, current 1024 x 768, maximum 32767 x 32767
XWAYLAND0 connected primary 1024x768+0+0 (normal left inverted right x axis y 
axis) 0mm x 0mm
   1024x768  59.92*+
   800x600   59.86  
   640x480   59.38  
   320x240   59.52  
   720x480   59.71  
   640x400   59.95  
   320x200   58.96  
   1024x576  59.90  
   864x486   59.92  
   720x400   59.55  
   640x350   59.77

After login sometimes the screen goes blank (but the backlight remains on). 
Hard power off required.
Sometimes the gear wheel stops turning and the system freezes.  Hard power off 
required.

I have tried to install the nvidia proprietary driver 304 
(NVIDIA-Linux-x86_64-304.117.run) which is what
this old chip needs but it fails to install.  No matter what I do the nouveau 
driver is in use and
cannot be removed.

If it were possible for nouveau and/or X/Wayland to access the whole set of 
resolutions of the system
without hard freezes, I'd be happy.  Any tricks?  Any specific things I could 
try to figure out the issue?

Obviously, in this particular boot the hard freeze did not happen.

These lines seem to be relevant (from the logs below):

[   47.892652] nouveau :00:0
d.0: bus: MMIO write of 00340001 FAULT at 00b000
[   64.113759] nouveau :00:0d.0: bus: MMIO write of 00640001 FAULT at 00b010
[   64.114792] nouveau :00:0d.0: bus: MMIO write of 00310001 FAULT at 00b020
[   69.614326] nouveau :00:0d.0: bus: MMIO write of  FAULT at 00b020
[   69.614542] nouveau :00:0d.0: bus: MMIO write of 00640001 FAULT at 00b010
[   69.615432] nouveau :00:0d.0: bus: MMIO write of 00310001 FAULT at 00b020
[   70.336843] nouveau :00:0d.0: bus: MMIO write of  FAULT at 00b020
[   70.337057] nouveau :00:0d.0: bus: MMIO write of 00640001 FAULT at 00b010
[   70.337684] nouveau :00:0d.0: bus: MMIO write of 00660001 FAULT at 00b020
[   70.357387] nouveau :00:0d.0: bus: MMIO write of  FAULT at 00b010
[   89.666120] nouveau :00:0d.0: bus: MMIO write of 00ca0001 FAULT at 00b010
[   97.330127] nouveau :00:0d.0: bus: MMIO write of  FAULT at 00b010
[  104.590842] traps: light-locker[4745] trap int3 ip:7f59b65be7d7 
sp:7fff472f8690 error:0 in libglib-2.0.so.0.7400.6[7f59b658+8d000]



-- Package-specific info:
** Version:
Linux version 6.1.0-7-amd64 (debian-ker...@lists.debian.org) (gcc-12 (Debian 
12.2.0-14) 12.2.0, GNU ld (GNU Binutils for Debian) 2.40) #1 SMP 
PREEMPT_DYNAMIC Debian 6.1.20-1 (2023-03-19)

** Command line:
BOOT_IMAGE=/boot/vmlinuz-6.1.0-7-amd64 
root=UUID=28bd15dd-cd17-45c8-93e0-65decc995980 ro quiet

** Not tainted

** Kernel log:
[6.707012] systemd[1]: Starting systemd-journald.service - Journal 
Service...
[6.760881] systemd[1]: Starting systemd-modules-load.service - Load Kernel 
Modules...
[6.762291] systemd[1]: Starting systemd-remount-fs.service - Remount Root 
and Kernel File Systems...
[6.763752] systemd[1]: Starting systemd-udev-trigger.service - Coldplug All 
udev Devices...
[6.766133] systemd[1]: Finished kmod-static-nodes.service - Create List of 
Static Device Nodes.
[6.766733] systemd[1]: modprobe@configfs.service: Deactivated successfully.
[6.766949] systemd[1]: Finished modprobe@configfs.service - Load Kernel 
Module configfs.
[6.767339] systemd[1]: modprobe@drm.service: Deactivated successfully.
[6.767583] systemd[1]: Finished modprobe@drm.service - Load Kernel Module 
drm.
[6.769231] systemd[1]: Mounting sys-kernel-config.mount - Kernel 
Configuration File System...
[6.778991] systemd[1]: modprobe@efi_pstore.service: Deactivated 
successfully.
[6.779202] systemd[1]: Finished modprobe@efi_pstore.service - Load Kernel 
Module efi_pstore.
[6.791834] systemd[1]: Mounted dev-mqueue.mount - POSIX Message Queue File 
System.
[6.792129] systemd[1]: Mounted sys-kernel-debug.mount - Kernel Debug File 
System.
[6.792350] systemd[1]: Mounted sys-kernel-tracing.mount - Kernel Trace File 
System.
[6.792608] systemd[1]: Mounted sys-kernel-config.mount - Kernel 
Configuration File System.
[6.829931] systemd[1]: Mounted dev-hugepages.mount - Huge Pages File System.
[6.853427] loop: module loaded
[6.854583] systemd[1]: modprobe@loop.service: Deactivated successfully.
[6.854794] systemd[1]: Finished modprobe@loop.service - Load Kernel Module 
loop.
[6.857921] fuse: init (API version 7.37)
[6.858998] systemd[1]: modprobe@fuse.service: Deactivated successfully.
[6.859199] systemd[1]: