[Kernel-packages] [Bug 1788044] Re: task kworker blocked for more than 120 seconds nouveau

2024-04-21 Thread Rick Sayre
This is the closest, most recent report I could find with something which also 
appears kernel related
Kernel 6.5.0-27  -- works fine
Kernel 6.5.0-28  -- graphics hard-hangs - sddm never displays, vtty can't be 
activated
   shh works, sddm and x11 processes can not be killed, reboot hangs, hard 
power cycle required


I see reports like this literally for years, yet seemingly no resolution.  Now 
I'm bit, in a way suspiciously kernel version related
Common internet "use nvidia drivers" wisdom is unworkable as old nvidia 
hardware is not supported on 22.04
04:00.0 VGA compatible controller: NVIDIA Corporation MCP89 [GeForce 320M] (rev 
a2)

Again, for me, this is with 22.04, kernel 6.5.0-28

All the various combinations of nouveau flags to kernel boot which have
helped reduce garbage or sddm troubles don't help.

Here's a relevant boot log:
Apr 20 21:06:29 Konnekt sddm-greeter[922]: Adding view for "LVDS-1" QRect(0,0 
1280x800)
Apr 20 21:06:29 Konnekt kernel: nouveau :04:00.0: fifo: DMA_PUSHER - ch 3 
[sddm-greeter[922]] get 216004 put 216088 ib_get 0007 ib_put 
0007 state 800081a4 (err: INVALID_CMD) push 00400040
Apr 20 21:06:29 Konnekt kernel: nouveau :04:00.0: fifo: DMA_PUSHER - ch 3 
[sddm-greeter[922]] get 21608c put 217b7c ib_get 0009 ib_put 
000a state 8000 (err: INVALID_CMD) push 00400040
Apr 20 21:06:29 Konnekt kernel: nouveau :04:00.0: fifo: DMA_PUSHER - ch 3 
[sddm-greeter[922]] get 218ff0 put 21932c ib_get 000d ib_put 
0012 state 8000 (err: INVALID_CMD) push 00400040
Apr 20 21:06:29 Konnekt kernel: nouveau :04:00.0: fifo: DMA_PUSHER - ch 3 
[sddm-greeter[922]] get 219a48 put 219a64 ib_get 001f ib_put 
0024 state 8000 (err: INVALID_CMD) push 00400040
Apr 20 21:06:29 Konnekt kernel: nouveau :04:00.0: fifo: DMA_PUSHER - ch 3 
[sddm-greeter[922]] get 219a64 put 219a7c ib_get 0021 ib_put 
0024 state 8024 (err: INVALID_CMD) push 00400040
Apr 20 21:06:29 Konnekt kernel: nouveau :04:00.0: fifo: DMA_PUSHER - ch 3 
[sddm-greeter[922]] get 219a7c put 219a8c ib_get 0023 ib_put 
0024 state 8000 (err: INVALID_CMD) push 00400040
Apr 20 21:06:29 Konnekt kernel: nouveau :04:00.0: fifo: DMA_PUSHER - ch 3 
[sddm-greeter[922]] get 219a8c put 219ccc ib_get 0025 ib_put 
002a state 8024 (err: INVALID_CMD) push 00400040
Apr 20 21:06:29 Konnekt kernel: nouveau :04:00.0: fifo: DMA_PUSHER - ch 3 
[sddm-greeter[922]] get 219de8 put 219df8 ib_get 002f ib_put 
0030 state 4004 (err: INVALID_MTHD) push 00400040
Apr 20 21:06:29 Konnekt kernel: nouveau :04:00.0: fifo: CACHE_ERROR - ch 3 
[sddm-greeter[922]] subc 0 mthd  data 2000
Apr 20 21:06:29 Konnekt kernel: nouveau :04:00.0: fifo: DMA_PUSHER - ch 3 
[sddm-greeter[922]] get 219ef0 put 21a2f0 ib_get 0033 ib_put 
0038 state 8000 (err: INVALID_CMD) push 00400040
Apr 20 21:06:29 Konnekt kernel: nouveau :04:00.0: fifo: DMA_PUSHER - ch 3 
[sddm-greeter[922]] get 0020b10960 put 0020b10ac8 ib_get 0042 ib_put 
0044 state 8000 (err: INVALID_CMD) push 00400040
Apr 20 21:06:29 Konnekt kernel: nouveau :04:00.0: fifo: DMA_PUSHER - ch 3 
[sddm-greeter[922]] get 21aff8 put 21bdf4 ib_get 004b ib_put 
004c state 8000 (err: INVALID_CMD) push 00400040
Apr 20 21:06:29 Konnekt kernel: nouveau :04:00.0: fifo: DMA_PUSHER - ch 3 
[sddm-greeter[922]] get 21c248 put 21c620 ib_get 004d ib_put 
004e state 8000 (err: INVALID_CMD) push 00400040
Apr 20 21:06:29 Konnekt kernel: nouveau :04:00.0: fifo: DMA_PUSHER - ch 3 
[sddm-greeter[922]] get 417a40 put 417a4c ib_get 0056 ib_put 
0058 state 8000 (err: INVALID_CMD) push 00400040
Apr 20 21:06:29 Konnekt kernel: nouveau :04:00.0: fifo: DMA_PUSHER - ch 3 
[sddm-greeter[922]] get 417b54 put 417b5c ib_get 005c ib_put 
005e state 8000 (err: INVALID_CMD) push 00400040
Apr 20 21:06:29 Konnekt kernel: nouveau :04:00.0: fifo: DMA_PUSHER - ch 3 
[sddm-greeter[922]] get 4d3370 put 4d33f4 ib_get 0062 ib_put 
0064 state 8000 (err: INVALID_CMD) push 00400040
Apr 20 21:06:29 Konnekt kernel: nouveau :04:00.0: fifo: DMA_PUSHER - ch 3 
[sddm-greeter[922]] get 4db3c0 put 4db450 ib_get 0068 ib_put 
006a state 8000 (err: INVALID_CMD) push 00400040
Apr 20 21:06:29 Konnekt kernel: nouveau :04:00.0: fifo: DMA_PUSHER - ch 3 
[sddm-greeter[922]] get 21d000 put 21d150 ib_get 006b ib_put 
006c state 8000 (err: INVALID_CMD) push 00400040
Apr 20 21:06:29 Konnekt kernel: nouveau :04:00.0: fifo: DMA_PUSHER - ch 3 
[sddm-greeter[922]] get 417c50 put 417c5c ib_get 006e ib_put 
0070 state 8000 (err: INVALID_CMD) push 00400040
Apr 20 21:06:29 Konnekt kernel: nouveau :04:00.0: fifo: DMA_PUSHER - ch 3 
[sddm-greeter[922]] get 4d3630 put 4d3684 ib_get 007a 

[Kernel-packages] [Bug 1788044] Re: task kworker blocked for more than 120 seconds nouveau

2024-04-21 Thread Rick Sayre
I have found a fix.
First I tried adding "nouveau.modeset=0" to bootflags, which got me to boot but 
no greeter.
sddm appeared to be happily running but it failed to start the display server
restarting it did not help
"startx" worked, which gave me hope

It appeared the session "seat" had been determined by systemd to have
"CanGraphical" false

Then I found this:
https://github.com/mikhailnov/systemd/commit/c00bf275fdfbad3a9db8934b5e266b6abbdb8443

There's a heuristic hack which sets CanGraphical when the kernel starts
w/o graphics, but only if this is done via "nomodeset" rather than the
driver-specific ".modeset" flags

So I just added "nomodeset" to boot flags, and now not only does
6.5.0-28 boot and land at a working greeter again, but all the weird
sddm graphics bugs i'd been encountering since 6.5 are gone.

Hope this helps someone else...   And maybe this kernel issue will get
fixed, some day

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1788044

Title:
  task kworker blocked for more than 120 seconds nouveau

Status in linux package in Ubuntu:
  Incomplete
Status in xserver-xorg-video-nouveau package in Ubuntu:
  Confirmed

Bug description:
  I was using chromium when the whole system GUI stopped responding at
  15:42. This corresponds to the system journal at that point:

  Aug 20 15:42:56 dh3930 kernel: nouveau :01:00.0: fifo: SCHED_ERROR 0a 
[CTXSW_TIMEOUT]
  Aug 20 15:42:56 dh3930 kernel: nouveau :01:00.0: fifo: runlist 0: 
scheduled for recovery
  Aug 20 15:42:56 dh3930 kernel: nouveau :01:00.0: fifo: channel 15: killed
  Aug 20 15:42:56 dh3930 kernel: nouveau :01:00.0: fifo: engine 0: 
scheduled for recovery
  Aug 20 15:42:56 dh3930 kernel: nouveau :01:00.0: compiz[7682]: channel 15 
killed!
  Aug 20 15:45:50 dh3930 kernel: INFO: task kworker/u24:4:14623 blocked for 
more than 120 seconds.
  Aug 20 15:45:50 dh3930 kernel:   Not tainted 4.15.0-32-generic #35-Ubuntu
  Aug 20 15:45:50 dh3930 kernel: "echo 0 > 
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
  Aug 20 15:45:50 dh3930 kernel: kworker/u24:4   D0 14623  2 0x8000
  Aug 20 15:45:50 dh3930 kernel: Workqueue: events_unbound 
nv50_disp_atomic_commit_work [nouveau]
  Aug 20 15:45:50 dh3930 kernel: Call Trace:
  Aug 20 15:45:50 dh3930 kernel:  __schedule+0x291/0x8a0
  Aug 20 15:45:50 dh3930 kernel:  schedule+0x2c/0x80
  Aug 20 15:45:50 dh3930 kernel:  schedule_timeout+0x1cf/0x350
  Aug 20 15:45:50 dh3930 kernel:  ? nvif_object_ioctl+0x47/0x50 [nouveau]
  Aug 20 15:45:50 dh3930 kernel:  ? nouveau_bo_rd32+0x2a/0x30 [nouveau]
  Aug 20 15:45:50 dh3930 kernel:  ? nv84_fence_read+0x2e/0x30 [nouveau]
  Aug 20 15:45:50 dh3930 kernel:  ? nouveau_fence_no_signaling+0x2a/0x80 
[nouveau]
  Aug 20 15:45:50 dh3930 kernel:  dma_fence_default_wait+0x1c7/0x260
  Aug 20 15:45:50 dh3930 kernel:  ? dma_fence_release+0xa0/0xa0
  Aug 20 15:45:50 dh3930 kernel:  dma_fence_wait_timeout+0x3e/0xf0
  Aug 20 15:45:50 dh3930 kernel:  drm_atomic_helper_wait_for_fences+0x63/0xc0 
[drm_kms_helper]
  Aug 20 15:45:50 dh3930 kernel:  nv50_disp_atomic_commit_tail+0x55/0x3b10 
[nouveau]
  Aug 20 15:45:50 dh3930 kernel:  nv50_disp_atomic_commit_work+0x12/0x20 
[nouveau]
  Aug 20 15:45:50 dh3930 kernel:  process_one_work+0x1de/0x410
  Aug 20 15:45:50 dh3930 kernel:  worker_thread+0x32/0x410
  Aug 20 15:45:50 dh3930 kernel:  kthread+0x121/0x140
  Aug 20 15:45:50 dh3930 kernel:  ? process_one_work+0x410/0x410
  Aug 20 15:45:50 dh3930 kernel:  ? kthread_create_worker_on_cpu+0x70/0x70
  Aug 20 15:45:50 dh3930 kernel:  ret_from_fork+0x35/0x40

  The 'blocked for more than 120 seconds' message and call trace
  repeated every ~121 seconds until I rebooted. At that point, the
  following additional line appeared with the 'blocked for more than 120
  seconds' message:

  Aug 20 16:01:51 dh3930 kernel: nouveau :01:00.0: chromium-
  browse[14187]: failed to idle channel 20 [chromium-browse[14187]]

  ProblemType: Bug
  DistroRelease: Ubuntu 18.04
  Package: linux-image-4.15.0-32-generic 4.15.0-32.35
  ProcVersionSignature: Ubuntu 4.15.0-32.35-generic 4.15.18
  Uname: Linux 4.15.0-32-generic x86_64
  ApportVersion: 2.20.9-0ubuntu7.2
  Architecture: amd64
  AudioDevicesInUse:
   USERPID ACCESS COMMAND
   /dev/snd/controlC1:  david  2460 F pulseaudio
   /dev/snd/controlC0:  david  2460 F pulseaudio
  CurrentDesktop: Unity:Unity7:ubuntu
  Date: Mon Aug 20 16:07:40 2018
  EcryptfsInUse: Yes
  HibernationDevice: RESUME=UUID=9e4f3d6a-f1b3-40c0-8c97-97d861a7ce11
  InstallationDate: Installed on 2016-10-24 (664 days ago)
  InstallationMedia: Ubuntu 16.10 "Yakkety Yak" - Release amd64 (20161012.2)
  MachineType: System manufacturer System Product Name
  ProcFB: 0 nouveaufb
  ProcKernelCmdLine: BOOT_IMAGE=/@/boot/vmlinuz-4.15.0-32-generic 
root=UUID=311cc681-166d-47bd-847d-f41c81578c1a ro rootflags=subvol=@ quiet

[Kernel-packages] [Bug 2063254] [NEW] firewire_ohci stops under moderate load

2024-04-23 Thread Rick Sayre
Public bug reported:

This began in 6.5.0-27 afaict, and is very noticeable in 6.5.0-28
Using snd_dice to playback audio, doing anything else, even just scrolling in 
firefox or launching this bug report, causes the firewire driver to quit:

Apr 23 12:48:53 Konnekt kernel: firewire_ohci :01:00.0: DMA context
IT0 has stopped, error code: evt_timeout

ProblemType: Bug
DistroRelease: Ubuntu 22.04
Package: linux-modules-extra-6.5.0-28-generic 6.5.0-28.29~22.04.1
ProcVersionSignature: Ubuntu 6.5.0-28.29~22.04.1-generic 6.5.13
Uname: Linux 6.5.0-28-generic x86_64
NonfreeKernelModules: wl
ApportVersion: 2.20.11-0ubuntu82.5
Architecture: amd64
CasperMD5CheckResult: unknown
CurrentDesktop: LXQt
Date: Tue Apr 23 13:08:53 2024
Dependencies:
 linux-modules-6.5.0-28-generic 6.5.0-28.29~22.04.1
 wireless-regdb 2022.06.06-0ubuntu1~22.04.1
InstallationDate: Installed on 2023-10-31 (175 days ago)
InstallationMedia: Lubuntu 22.04.1 LTS "Jammy Jellyfish" - Release amd64 
(20220809)
SourcePackage: linux-hwe-6.5
UpgradeStatus: No upgrade log present (probably fresh install)

** Affects: linux-hwe-6.5 (Ubuntu)
 Importance: Undecided
 Status: New


** Tags: amd64 apport-bug jammy

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux-hwe-6.5 in Ubuntu.
https://bugs.launchpad.net/bugs/2063254

Title:
  firewire_ohci stops under moderate load

Status in linux-hwe-6.5 package in Ubuntu:
  New

Bug description:
  This began in 6.5.0-27 afaict, and is very noticeable in 6.5.0-28
  Using snd_dice to playback audio, doing anything else, even just scrolling in 
firefox or launching this bug report, causes the firewire driver to quit:

  Apr 23 12:48:53 Konnekt kernel: firewire_ohci :01:00.0: DMA
  context IT0 has stopped, error code: evt_timeout

  ProblemType: Bug
  DistroRelease: Ubuntu 22.04
  Package: linux-modules-extra-6.5.0-28-generic 6.5.0-28.29~22.04.1
  ProcVersionSignature: Ubuntu 6.5.0-28.29~22.04.1-generic 6.5.13
  Uname: Linux 6.5.0-28-generic x86_64
  NonfreeKernelModules: wl
  ApportVersion: 2.20.11-0ubuntu82.5
  Architecture: amd64
  CasperMD5CheckResult: unknown
  CurrentDesktop: LXQt
  Date: Tue Apr 23 13:08:53 2024
  Dependencies:
   linux-modules-6.5.0-28-generic 6.5.0-28.29~22.04.1
   wireless-regdb 2022.06.06-0ubuntu1~22.04.1
  InstallationDate: Installed on 2023-10-31 (175 days ago)
  InstallationMedia: Lubuntu 22.04.1 LTS "Jammy Jellyfish" - Release amd64 
(20220809)
  SourcePackage: linux-hwe-6.5
  UpgradeStatus: No upgrade log present (probably fresh install)

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux-hwe-6.5/+bug/2063254/+subscriptions


-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp


[Kernel-packages] [Bug 2063254] Re: firewire_ohci stops under moderate load

2024-04-27 Thread Rick Sayre
Update: this is strangely related to nouveau
On a machine with a "NVIDIA MCP89 [GeForce 320M]" the nvidia drivers have been 
removed from 22.04.  nouveau when run normally now causes a hang [see other 
reports if curious]

The described "firewire_ohci quits when using firefox" happens when kernel 
launched with kernel flag
nomodeset

When launched, instead, with 
nouveau.noaccel=1

...normal usage does not affect firewire_ohci, as expected and desired

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux-hwe-6.5 in Ubuntu.
https://bugs.launchpad.net/bugs/2063254

Title:
  firewire_ohci stops under moderate load

Status in linux-hwe-6.5 package in Ubuntu:
  New

Bug description:
  This began in 6.5.0-27 afaict, and is very noticeable in 6.5.0-28
  Using snd_dice to playback audio, doing anything else, even just scrolling in 
firefox or launching this bug report, causes the firewire driver to quit:

  Apr 23 12:48:53 Konnekt kernel: firewire_ohci :01:00.0: DMA
  context IT0 has stopped, error code: evt_timeout

  ProblemType: Bug
  DistroRelease: Ubuntu 22.04
  Package: linux-modules-extra-6.5.0-28-generic 6.5.0-28.29~22.04.1
  ProcVersionSignature: Ubuntu 6.5.0-28.29~22.04.1-generic 6.5.13
  Uname: Linux 6.5.0-28-generic x86_64
  NonfreeKernelModules: wl
  ApportVersion: 2.20.11-0ubuntu82.5
  Architecture: amd64
  CasperMD5CheckResult: unknown
  CurrentDesktop: LXQt
  Date: Tue Apr 23 13:08:53 2024
  Dependencies:
   linux-modules-6.5.0-28-generic 6.5.0-28.29~22.04.1
   wireless-regdb 2022.06.06-0ubuntu1~22.04.1
  InstallationDate: Installed on 2023-10-31 (175 days ago)
  InstallationMedia: Lubuntu 22.04.1 LTS "Jammy Jellyfish" - Release amd64 
(20220809)
  SourcePackage: linux-hwe-6.5
  UpgradeStatus: No upgrade log present (probably fresh install)

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux-hwe-6.5/+bug/2063254/+subscriptions


-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp