I've gotten the same errors on one of my SL 7 systems since the updates on Monday & Tuesday last week. I thought it was a hardware failure of the on-board GPU (an ASUS M5A78L-M/USB3 motherboard with on-board Radeon 3000), so I put in a PCIe graphics card (Radeon X1300/X1550) and didn't have further problems. I have 6 other SL 7 systems with a similar m/b and GPU, and didn't notice the same problem with any of the others, but now that I scrutinize the logs I see that one other of these systems got the same errors last Tuesday. It would seem that it doesn't affect all Radeons equally, and it's inconsistent even on the 3000 chipset, but it was happening quite frequently on one of my systems.

Looks like I need to do more testing on all these systems, and try backing out kernel and/or Xorg updates to see which one brought in this error.

On 2018-12-03 09:16, Andreas Nowack wrote:
Hello,

since the update of the Xorg packages last week I see the problems with the graphics card in /var/log/messages (see below). Every now and then, the graphics output is delayed. From time to time, the screen gets distorted during login or the computer does not accept any keyboard or mouse input.

Is this a known problem?


Dec  3 13:39:30 lxcip04 kernel: [  282.721762] radeon 0000:01:05.0: ring 0 stalled for more than 27178msec Dec  3 13:39:30 lxcip04 kernel: [  282.721771] radeon 0000:01:05.0: GPU lockup (current fence id 0x00000000000006d2 last fence id 0x00000000000006d5 on ring 0) Dec  3 13:39:30 lxcip04 kernel: [  283.222747] radeon 0000:01:05.0: ring 0 stalled for more than 27679msec Dec  3 13:39:30 lxcip04 kernel: [  283.222757] radeon 0000:01:05.0: GPU lockup (current fence id 0x00000000000006d2 last fence id 0x00000000000006d5 on ring 0) Dec  3 13:39:31 lxcip04 kernel: [  283.723741] radeon 0000:01:05.0: ring 0 stalled for more than 28180msec Dec  3 13:39:31 lxcip04 kernel: [  283.723751] radeon 0000:01:05.0: GPU lockup (current fence id 0x00000000000006d2 last fence id 0x00000000000006d5 on ring 0) Dec  3 13:39:31 lxcip04 kernel: [  284.224735] radeon 0000:01:05.0: ring 0 stalled for more than 28681msec Dec  3 13:39:31 lxcip04 kernel: [  284.224744] radeon 0000:01:05.0: GPU lockup (current fence id 0x00000000000006d2 last fence id 0x00000000000006d5 on ring 0) Dec  3 13:39:32 lxcip04 kernel: [  284.725725] radeon 0000:01:05.0: ring 0 stalled for more than 29182msec Dec  3 13:39:32 lxcip04 kernel: [  284.725735] radeon 0000:01:05.0: GPU lockup (current fence id 0x00000000000006d2 last fence id 0x00000000000006d5 on ring 0) Dec  3 13:39:32 lxcip04 kernel: [  285.226720] radeon 0000:01:05.0: ring 0 stalled for more than 29683msec Dec  3 13:39:32 lxcip04 kernel: [  285.226729] radeon 0000:01:05.0: GPU lockup (current fence id 0x00000000000006d2 last fence id 0x00000000000006d5 on ring 0) Dec  3 13:39:33 lxcip04 kernel: [  285.544838] radeon 0000:01:05.0: Saved 89 dwords of commands on ring 0. Dec  3 13:39:33 lxcip04 kernel: [  285.544851] radeon 0000:01:05.0: GPU softreset: 0x00000009 Dec  3 13:39:33 lxcip04 kernel: [  285.544857] radeon 0000:01:05.0:   R_008010_GRBM_STATUS      = 0xE4703030 Dec  3 13:39:33 lxcip04 kernel: [  285.544862] radeon 0000:01:05.0:   R_008014_GRBM_STATUS2     = 0x00110103 Dec  3 13:39:33 lxcip04 kernel: [  285.544866] radeon 0000:01:05.0:   R_000E50_SRBM_STATUS      = 0x20000040 Dec  3 13:39:33 lxcip04 kernel: [  285.544871] radeon 0000:01:05.0:   R_008674_CP_STALLED_STAT1 = 0x00000000 Dec  3 13:39:33 lxcip04 kernel: [  285.544875] radeon 0000:01:05.0:   R_008678_CP_STALLED_STAT2 = 0x00008002 Dec  3 13:39:33 lxcip04 kernel: [  285.544880] radeon 0000:01:05.0:   R_00867C_CP_BUSY_STAT     = 0x00008084 Dec  3 13:39:33 lxcip04 kernel: [  285.544884] radeon 0000:01:05.0:   R_008680_CP_STAT          = 0x80018645 Dec  3 13:39:33 lxcip04 kernel: [  285.544889] radeon 0000:01:05.0:   R_00D034_DMA_STATUS_REG   = 0x44C83D57 Dec  3 13:39:33 lxcip04 kernel: [  285.599358] radeon 0000:01:05.0: R_008020_GRBM_SOFT_RESET=0x00007FEF Dec  3 13:39:33 lxcip04 kernel: [  285.599415] radeon 0000:01:05.0: SRBM_SOFT_RESET=0x00000100 Dec  3 13:39:33 lxcip04 kernel: [  285.601524] radeon 0000:01:05.0:   R_008010_GRBM_STATUS      = 0xA0003030 Dec  3 13:39:33 lxcip04 kernel: [  285.601530] radeon 0000:01:05.0:   R_008014_GRBM_STATUS2     = 0x00000003 Dec  3 13:39:33 lxcip04 kernel: [  285.601534] radeon 0000:01:05.0:   R_000E50_SRBM_STATUS      = 0x20008040 Dec  3 13:39:33 lxcip04 kernel: [  285.601539] radeon 0000:01:05.0:   R_008674_CP_STALLED_STAT1 = 0x00000000 Dec  3 13:39:33 lxcip04 kernel: [  285.601544] radeon 0000:01:05.0:   R_008678_CP_STALLED_STAT2 = 0x00000000 Dec  3 13:39:33 lxcip04 kernel: [  285.601548] radeon 0000:01:05.0:   R_00867C_CP_BUSY_STAT     = 0x00000000 Dec  3 13:39:33 lxcip04 kernel: [  285.601553] radeon 0000:01:05.0:   R_008680_CP_STAT          = 0x80100000 Dec  3 13:39:33 lxcip04 kernel: [  285.601557] radeon 0000:01:05.0:   R_00D034_DMA_STATUS_REG   = 0x44C83D57 Dec  3 13:39:33 lxcip04 kernel: [  285.601565] radeon 0000:01:05.0: GPU reset succeeded, trying to resume Dec  3 13:39:33 lxcip04 kernel: [  285.621315] [drm] PCIE GART of 512M enabled (table at 0x00000000C0040000). Dec  3 13:39:33 lxcip04 kernel: [  285.621336] radeon 0000:01:05.0: WB enabled Dec  3 13:39:33 lxcip04 kernel: [  285.621344] radeon 0000:01:05.0: fence driver on ring 0 use gpu addr 0x00000000a0000c00 and cpu addr 0xffffa0f72f3adc00 Dec  3 13:39:33 lxcip04 kernel: [  285.652545] [drm] ring test on 0 succeeded in 1 usecs Dec  3 13:39:43 lxcip04 kernel: [  295.747591] radeon 0000:01:05.0: ring 0 stalled for more than 10020msec Dec  3 13:39:43 lxcip04 kernel: [  295.747602] radeon 0000:01:05.0: GPU lockup (current fence id 0x00000000000006d4 last fence id 0x00000000000006d5 on ring 0) Dec  3 13:39:43 lxcip04 org.gnome.Shell.desktop[10501]: radeon: The kernel rejected CS, see dmesg for more information (-16). Dec  3 13:39:43 lxcip04 kernel: [  295.749700] [drm:r600_ib_test [radeon]] *ERROR* radeon: fence wait failed (-35). Dec  3 13:39:43 lxcip04 kernel: [  295.749748] [drm:radeon_ib_ring_tests [radeon]] *ERROR* radeon: failed testing IB on GFX ring (-35). Dec  3 13:39:43 lxcip04 org.gnome.Shell.desktop[10501]: radeon: The kernel rejected CS, see dmesg for more information (-16). Dec  3 13:39:43 lxcip04 org.gnome.Shell.desktop[10501]: radeon: The kernel rejected CS, see dmesg for more information (-16). Dec  3 13:39:43 lxcip04 org.gnome.Shell.desktop[10501]: radeon: The kernel rejected CS, see dmesg for more information (-16). Dec  3 13:39:43 lxcip04 org.gnome.Shell.desktop[10501]: radeon: The kernel rejected CS, see dmesg for more information (-16). Dec  3 13:39:43 lxcip04 org.gnome.Shell.desktop[10501]: radeon: The kernel rejected CS, see dmesg for more information (-16). Dec  3 13:39:43 lxcip04 org.gnome.Shell.desktop[10501]: radeon: The kernel rejected CS, see dmesg for more information (-16).

Here is a list of installed relevant packages:

kernel-3.10.0-957.1.3.el7.x86_64
xorg-x11-apps-7.7-7.el7.x86_64
xorg-x11-docs-1.6-7.el7.noarch
xorg-x11-drivers-7.7-6.el7.x86_64
xorg-x11-drv-ati-18.0.1-1.el7.x86_64
xorg-x11-drv-dummy-0.3.7-1.el7.1.x86_64
xorg-x11-drv-evdev-2.10.6-1.el7.x86_64
xorg-x11-drv-fbdev-0.5.0-1.el7.x86_64
xorg-x11-drv-intel-2.99.917-28.20180530.el7.x86_64
xorg-x11-drv-keyboard-1.9.0-1.el7.x86_64
xorg-x11-drv-libinput-0.27.1-2.el7.x86_64
xorg-x11-drv-mouse-1.9.2-2.el7.x86_64
xorg-x11-drv-nouveau-1.0.15-1.el7.x86_64
xorg-x11-drv-openchrome-0.5.0-3.el7.1.x86_64
xorg-x11-drv-qxl-0.1.5-4.el7.1.x86_64
xorg-x11-drv-synaptics-1.9.0-2.el7.x86_64
xorg-x11-drv-v4l-0.2.0-49.el7.x86_64
xorg-x11-drv-vesa-2.4.0-1.el7.x86_64
xorg-x11-drv-vmmouse-13.1.0-1.el7.1.x86_64
xorg-x11-drv-vmware-13.2.1-1.el7.1.x86_64
xorg-x11-drv-void-1.4.1-2.el7.1.x86_64
xorg-x11-drv-wacom-0.36.1-1.el7.x86_64
xorg-x11-fonts-ISO8859-1-100dpi-7.5-9.el7.noarch
xorg-x11-fonts-ISO8859-1-75dpi-7.5-9.el7.noarch
xorg-x11-fonts-misc-7.5-9.el7.noarch
xorg-x11-fonts-Type1-7.5-9.el7.noarch
xorg-x11-font-utils-7.5-21.el7.x86_64
xorg-x11-proto-devel-2018.4-1.el7.noarch
xorg-x11-server-common-1.20.1-5.1.sl7.x86_64
xorg-x11-server-utils-7.7-20.el7.x86_64
xorg-x11-server-Xorg-1.20.1-5.1.sl7.x86_64
xorg-x11-utils-7.5-23.el7.x86_64
xorg-x11-xauth-1.0.9-1.el7.x86_64
xorg-x11-xbitmaps-1.1.1-6.el7.noarch
xorg-x11-xinit-1.3.4-2.el7.x86_64
xorg-x11-xkb-utils-7.7-14.el7.x86_64


Best regards,
  Andreas

------------------------------------------------------------------------
  Dr. Andreas Nowack               email: now...@physik.rwth-aachen.de
  RWTH Aachen
  III. Phys. Institut B
  Sommerfeldstr. / Physikzentrum   phone: +49 241 80-27282
  D-52056 Aachen                     fax: +49 241 80-22244
  Germany



--
Gilles R. Detillieux              E-mail: <grde...@scrc.umanitoba.ca>
Spinal Cord Research Centre       WWW:    
https://urldefense.proofpoint.com/v2/url?u=http-3A__www.scrc.umanitoba.ca_&d=DwIDbA&c=gRgGjJ3BkIsb5y6s49QqsA&r=gd8BzeSQcySVxr0gDWSEbN-P-pgDXkdyCtaMqdCgPPdW1cyL5RIpaIYrCn8C5x2A&m=63PoGckGjmKhHxdx62gqtp6MuytQnlk2lCpyEl9rai4&s=0kgvU4tW7HeYQVWQyJ1tXTI8l-j2kg1VtAta9SVmkOk&e=
Dept. of Physiology and Pathophysiology, Faculty of Health Sciences,
Univ. of Manitoba  Winnipeg, MB  R3E 0J9  (Canada)

Reply via email to