After running ereything with AMD_DEBUG=nongg, the "Waiting for fences" seems to be mostly gone, and I now get sdma0 timeouts. So this seems to be part of the gereral cluster of failures that seem to plague the linux navi drivers since the beginning.
Jun 22 07:24:43 alhazen kernel: [ 748.740480] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, signaled seq=134116, emitted seq=134118 Jun 22 07:24:43 alhazen kernel: [ 748.740549] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process Xorg pid 3589 thread Xorg:cs0 pid 3591 Jun 22 07:24:43 alhazen kernel: [ 748.740552] [drm] GPU recovery disabled. Jun 22 07:25:28 alhazen kernel: [ 794.386634] GpuWatchdog[5797]: segfault at 0 ip 0000556cdabbccb9 sp 00007f6a540a06c0 error 6 in chrome[556cd6a4e000+7095000] Jun 22 07:25:28 alhazen kernel: [ 794.386642] Code: 00 79 09 48 8b 7d c0 e8 d5 14 2b fc c7 45 c0 aa aa aa aa 0f ae f0 41 8b 84 24 e0 00 00 00 89 45 c0 48 8d 7d c0 e8 b7 31 e9 fb <c7> 04 25 00 00 00 00 37 13 00 00 48 83 c4 38 5b 41 5c 41 5d 41 5e Jun 22 07:25:38 alhazen kernel: [ 804.548718] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma0 timeout, signaled seq=29322, emitted seq=29326 Jun 22 07:25:38 alhazen kernel: [ 804.548788] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process pid 0 thread pid 0 Jun 22 07:25:38 alhazen kernel: [ 804.548791] [drm] GPU recovery disabled. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1883493 Title: amdgpu hangs from time to time with *ERROR* Waiting for fences timed out! To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1883493/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs