Hello folks,

I wonder if any of you who is using an AMD GPU (especially newer ones) has encountered the same problem as I do.

My problem is that sometimes the screen just freezes entirely, and I have to switch to another TTY and back in order to get it unstuck. But the same freeze will usually happen again after I get the things unstuck. Restart my PC doesn't fix the problem.

Here are some informations about my system:

- CPU/GPU: AMD Ryzen 7 PRO 7840U w/ Radeon 780M Graphics
- vainfo output: in attachment `vainfo-output`
- system log: in attachment `sys-log`

I do find a workaround online somewhere, that is to set `amdgpu.vm_update_mode=3` kernel parameter, and it seems to fix the problem.

I never have this kind of problem on Linux distros I used before Debian (Void, OpenSUSE,) so if anyone has some idea why this happens, please give me some advice, thanks!

--
Best,

ID
vainfo: VA-API version: 1.17 (libva 2.12.0)
vainfo: Driver version: Mesa Gallium driver 22.3.6 for AMD Radeon Graphics 
(gfx1103_r1, LLVM 15.0.6, DRM 3.54, 6.6.13+bpo-amd64)
vainfo: Supported profile and entrypoints
      VAProfileH264ConstrainedBaseline: VAEntrypointVLD
      VAProfileH264ConstrainedBaseline: VAEntrypointEncSlice
      VAProfileH264Main               : VAEntrypointVLD
      VAProfileH264Main               : VAEntrypointEncSlice
      VAProfileH264High               : VAEntrypointVLD
      VAProfileH264High               : VAEntrypointEncSlice
      VAProfileHEVCMain               : VAEntrypointVLD
      VAProfileHEVCMain               : VAEntrypointEncSlice
      VAProfileHEVCMain10             : VAEntrypointVLD
      VAProfileHEVCMain10             : VAEntrypointEncSlice
      VAProfileJPEGBaseline           : VAEntrypointVLD
      VAProfileVP9Profile0            : VAEntrypointVLD
      VAProfileVP9Profile2            : VAEntrypointVLD
      VAProfileAV1Profile0            : VAEntrypointVLD
      VAProfileNone                   : VAEntrypointVideoProc
Jul 02 15:05:02 thinkpad-btw kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* 
ring gfx_0.0.0 timeout, signaled seq=4611833, emitted seq=4611836
Jul 02 15:05:02 thinkpad-btw kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* 
Process information: process gnome-shell pid 1731 thread gnome-shel:cs0 pid 1756
Jul 02 15:05:02 thinkpad-btw kernel: amdgpu 0000:c3:00.0: amdgpu: GPU reset 
begin!
Jul 02 15:05:02 thinkpad-btw gnome-shell[1731]: amdgpu: The CS has been 
rejected (-125), but the context isn't robust.
Jul 02 15:05:02 thinkpad-btw gnome-shell[1731]: amdgpu: The process will be 
terminated.
Jul 02 15:05:02 thinkpad-btw kernel: amdgpu_cs_ioctl: 12 callbacks suppressed
Jul 02 15:05:02 thinkpad-btw kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* 
Failed to initialize parser -125!
Jul 02 15:05:03 thinkpad-btw kernel: 
[drm:mes_v11_0_submit_pkt_and_poll_completion.constprop.0 [amdgpu]] *ERROR* MES 
failed to response msg=3
Jul 02 15:05:03 thinkpad-btw kernel: [drm:amdgpu_mes_unmap_legacy_queue 
[amdgpu]] *ERROR* failed to unmap legacy queue
Jul 02 15:05:03 thinkpad-btw kernel: 
[drm:mes_v11_0_submit_pkt_and_poll_completion.constprop.0 [amdgpu]] *ERROR* MES 
failed to response msg=3
Jul 02 15:05:03 thinkpad-btw kernel: [drm:amdgpu_mes_unmap_legacy_queue 
[amdgpu]] *ERROR* failed to unmap legacy queue
Jul 02 15:05:03 thinkpad-btw kernel: 
[drm:mes_v11_0_submit_pkt_and_poll_completion.constprop.0 [amdgpu]] *ERROR* MES 
failed to response msg=3
Jul 02 15:05:03 thinkpad-btw kernel: [drm:amdgpu_mes_unmap_legacy_queue 
[amdgpu]] *ERROR* failed to unmap legacy queue
Jul 02 15:05:03 thinkpad-btw kernel: 
[drm:mes_v11_0_submit_pkt_and_poll_completion.constprop.0 [amdgpu]] *ERROR* MES 
failed to response msg=3
Jul 02 15:05:03 thinkpad-btw kernel: [drm:amdgpu_mes_unmap_legacy_queue 
[amdgpu]] *ERROR* failed to unmap legacy queue
Jul 02 15:05:03 thinkpad-btw kernel: 
[drm:mes_v11_0_submit_pkt_and_poll_completion.constprop.0 [amdgpu]] *ERROR* MES 
failed to response msg=3
Jul 02 15:05:03 thinkpad-btw kernel: [drm:amdgpu_mes_unmap_legacy_queue 
[amdgpu]] *ERROR* failed to unmap legacy queue
Jul 02 15:05:03 thinkpad-btw kernel: 
[drm:mes_v11_0_submit_pkt_and_poll_completion.constprop.0 [amdgpu]] *ERROR* MES 
failed to response msg=3
Jul 02 15:05:03 thinkpad-btw kernel: [drm:amdgpu_mes_unmap_legacy_queue 
[amdgpu]] *ERROR* failed to unmap legacy queue
Jul 02 15:05:04 thinkpad-btw kernel: 
[drm:mes_v11_0_submit_pkt_and_poll_completion.constprop.0 [amdgpu]] *ERROR* MES 
failed to response msg=3
Jul 02 15:05:04 thinkpad-btw kernel: [drm:amdgpu_mes_unmap_legacy_queue 
[amdgpu]] *ERROR* failed to unmap legacy queue
Jul 02 15:05:04 thinkpad-btw kernel: 
[drm:mes_v11_0_submit_pkt_and_poll_completion.constprop.0 [amdgpu]] *ERROR* MES 
failed to response msg=3
Jul 02 15:05:04 thinkpad-btw kernel: [drm:amdgpu_mes_unmap_legacy_queue 
[amdgpu]] *ERROR* failed to unmap legacy queue
Jul 02 15:05:04 thinkpad-btw kernel: 
[drm:mes_v11_0_submit_pkt_and_poll_completion.constprop.0 [amdgpu]] *ERROR* MES 
failed to response msg=3
Jul 02 15:05:04 thinkpad-btw kernel: [drm:amdgpu_mes_unmap_legacy_queue 
[amdgpu]] *ERROR* failed to unmap legacy queue
Jul 02 15:05:04 thinkpad-btw kernel: [drm:gfx_v11_0_hw_fini [amdgpu]] *ERROR* 
failed to halt cp gfx
Jul 02 15:05:04 thinkpad-btw kernel: amdgpu 0000:c3:00.0: amdgpu: MODE2 reset
Jul 02 15:05:04 thinkpad-btw kernel: amdgpu 0000:c3:00.0: amdgpu: GPU reset 
succeeded, trying to resume

Reply via email to