[Ubuntu-x-swat] [Bug 1724047] Re: xserver terminates frequently, journal says drm/i915: Resetting chip after gpu hang
16.04 is EOL, if you can reproduce on a newer release, please open a new bug, thanks ** Changed in: xorg-server-hwe-16.04 (Ubuntu) Status: Confirmed => Won't Fix -- You received this bug notification because you are a member of Ubuntu-X, which is subscribed to xorg-server-hwe-16.04 in Ubuntu. https://bugs.launchpad.net/bugs/1724047 Title: xserver terminates frequently, journal says drm/i915: Resetting chip after gpu hang To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/xorg-server-hwe-16.04/+bug/1724047/+subscriptions ___ Mailing list: https://launchpad.net/~ubuntu-x-swat Post to : ubuntu-x-swat@lists.launchpad.net Unsubscribe : https://launchpad.net/~ubuntu-x-swat More help : https://help.launchpad.net/ListHelp
[Ubuntu-x-swat] [Bug 1724047] Re: xserver terminates frequently, journal says drm/i915: Resetting chip after gpu hang
That didn't work either. Using the "intel" driver with: Option "AccelMethod" "UXA" still resulted in the following GPU HANG: [drm] GPU HANG: ecode 9:0:0x85da, in compiz [3398], reason: Hang on rcs0, action: reset i915 :00:02.0: Resetting rcs0 after gpu hang [drm:gen8_reset_engines [i915]] *ERROR* rcs0: reset request timeout i915 :00:02.0: Resetting chip after gpu hang [drm:gen8_reset_engines [i915]] *ERROR* rcs0: reset request timeout [drm:i915_reset [i915]] *ERROR* Failed to reset chip: -5 After all these tests, my current guess is that the hardware acceleration in my Web browsers (Firefox and Chromium) is triggering the error in the graphics driver or firmware. I disabled the hardware acceleration in Firefox and Chromium. I also now avoid running anything that uses Electron, such as the Skype and Signal desktop snap packages, because I don't know how to disable the hardware acceleration in an Electron app. After about a week, the problem occurs now only if I forget and leave Skype running for an afternoon. I'm hoping this can continue to function as a simple work-around. -- You received this bug notification because you are a member of Ubuntu-X, which is subscribed to xorg-server-hwe-16.04 in Ubuntu. https://bugs.launchpad.net/bugs/1724047 Title: xserver terminates frequently, journal says drm/i915: Resetting chip after gpu hang To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/xorg-server-hwe-16.04/+bug/1724047/+subscriptions ___ Mailing list: https://launchpad.net/~ubuntu-x-swat Post to : ubuntu-x-swat@lists.launchpad.net Unsubscribe : https://launchpad.net/~ubuntu-x-swat More help : https://help.launchpad.net/ListHelp
[Ubuntu-x-swat] [Bug 1724047] Re: xserver terminates frequently, journal says drm/i915: Resetting chip after gpu hang
Well, that didn't work for very long. Disabling the acceleration of the "intel" driver did not solve the problem for me. I tried three more changes, described below. (1) Next I tried going back to the "modesetting" driver but with the acceleration disabled: /etc/X11/xorg.conf Section "Device" Identifier "Intel Graphics" Driver "modesetting" Option "AccelMethod" "none" EndSection Eventually, the screen went into some kind of infinite-updating loop among all the window and was completely unresponsive. The kernel log file (/var/log/kern.log) contained the messages: Apr 12 10:00:25 tower kernel: [ 4941.921796] show_signal_msg: 21 callbacks suppressed Apr 12 10:00:25 tower kernel: [ 4941.921808] GpuWatchdog[7196]: segfault at 0 ip 55f89a514727 sp 7f8ec6d966d0 error 6 in signal-desktop[55f89733a000+53d7000] I'm not sure whether the Signal desktop snap application triggered the problem, but those were the last messages before having to power-off the system. (2) Next I removed the "/etc/X11/xorg.conf" file and reverted to the original Ubuntu 16.04 kernel version 4.4 instead of the Hardware Enablement (HWE) version 4.15. That eventually encountered the same infinite-updating loop problem. (3) So a couple of days ago I switched back to the HWE kernel version 4.15 and updated the Xorg configuration to use UXA instead of the default SNA acceleration method. Several posts on the Internet suggested that it might solve problems with Intel Skylake graphics. (I have Intel HD Graphics P530 on a 2015 "Skylake" Xeon E3-1225v5 processor.) /etc/X11/xorg.conf Section "Device" Identifier "Intel Graphics" Driver "intel" # Acceleration method: Unified Acceleration Architecture Option "AccelMethod" "UXA" EndSection So far, so good. It has been only a couple of days, though, and sometimes the problem takes several days to occur. -- You received this bug notification because you are a member of Ubuntu-X, which is subscribed to xorg-server-hwe-16.04 in Ubuntu. https://bugs.launchpad.net/bugs/1724047 Title: xserver terminates frequently, journal says drm/i915: Resetting chip after gpu hang To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/xorg-server-hwe-16.04/+bug/1724047/+subscriptions ___ Mailing list: https://launchpad.net/~ubuntu-x-swat Post to : ubuntu-x-swat@lists.launchpad.net Unsubscribe : https://launchpad.net/~ubuntu-x-swat More help : https://help.launchpad.net/ListHelp
[Ubuntu-x-swat] [Bug 1724047] Re: xserver terminates frequently, journal says drm/i915: Resetting chip after gpu hang
I just hit another GPU HANG, this time with the "intel" driver instead of the "modesetting" driver. The kernel log file (/var/log/kern.log) shows: Apr 6 08:59:36 tower kernel: [44387.837794] [drm] GPU HANG: ecode 9:0:0x85da, in Xorg [1751], reason: Hang on rcs0, action: reset i915 :00:02.0: Resetting rcs0 after gpu hang [drm:gen8_reset_engines [i915]] *ERROR* rcs0: reset request timeout i915 :00:02.0: Resetting chip after gpu hang asynchronous wait on fence i915:compiz[3304]/1:56366 timed out [drm:gen8_reset_engines [i915]] *ERROR* rcs0: reset request timeout [drm:i915_reset [i915]] *ERROR* Failed to reset chip: -5 Now I will try with the hardware acceleration disabled, still using the "intel" driver, as follows (/etc/X11/xorg.conf): --- # Tries "intel" instead of "modesetting" driver to fix GPU HANG # https://bugs.launchpad.net/ubuntu/+source/xorg-server-hwe-16.04/+bug/1724047 Section "Device" Identifier "Intel Graphics" Driver "intel" # Disables acceleration Option "NoAccel" "True" # Disables only 3D acceleration (Direct Rendering Infrastructure) # Option "DRI" "False" EndSection --- -- You received this bug notification because you are a member of Ubuntu-X, which is subscribed to xorg-server-hwe-16.04 in Ubuntu. https://bugs.launchpad.net/bugs/1724047 Title: xserver terminates frequently, journal says drm/i915: Resetting chip after gpu hang To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/xorg-server-hwe-16.04/+bug/1724047/+subscriptions ___ Mailing list: https://launchpad.net/~ubuntu-x-swat Post to : ubuntu-x-swat@lists.launchpad.net Unsubscribe : https://launchpad.net/~ubuntu-x-swat More help : https://help.launchpad.net/ListHelp
[Ubuntu-x-swat] [Bug 1724047] Re: xserver terminates frequently, journal says drm/i915: Resetting chip after gpu hang
I started to see the same GPU HANG error sometime this year (2020). I am using the default "modesetting" driver with automatic configuration and no "xorg.conf" file (/etc/X11/xorg.conf). The Xorg log file (/var/log/Xorg.1.log) is attached. The following error occurred on April 1, 2020, at 18:36:30. [drm] GPU HANG: ecode 9:0:0x85da, in Xorg [1784], reason: Hang on rcs0, action: reset i915 :00:02.0: Resetting rcs0 after gpu hang [drm:gen8_reset_engines [i915]] *ERROR* rcs0: reset request timeout i915 :00:02.0: Resetting chip after gpu hang [drm:gen8_reset_engines [i915]] *ERROR* rcs0: reset request timeout [drm:i915_reset [i915]] *ERROR* Failed to reset chip: -5 [drm] Reducing the compressed framebuffer size. This may lead to less power savings than a non-reduced-size. Try to increase stolen memory size if available in BIOS. The next error occurred on April 4, 2020 at 12:15:13. [drm] GPU HANG: ecode 9:0:0x85da, in skypeforlinux [17067], reason: Hang on rcs0, action: reset i915 :00:02.0: Resetting rcs0 after gpu hang [drm:gen8_reset_engines [i915]] *ERROR* rcs0: reset request timeout i915 :00:02.0: Resetting chip after gpu hang [drm:gen8_reset_engines [i915]] *ERROR* rcs0: reset request timeout [drm:i915_reset [i915]] *ERROR* Failed to reset chip: -5 [drm] Reducing the compressed framebuffer size. This may lead to less power savings than a non-reduced-size. Try to increase stolen memory size if available in BIOS. I have now switched to the older "intel" driver to see whether that might solve the problem, using the following "xorg.conf" file: # Try "intel" instead of "modesetting" driver to fix GPU HANG. Section "Device" Identifier "Intel Graphics" Driver "intel" # Disables acceleration # Option "NoAccel" "True" # Disables only 3D acceleration (Direct Rendering Infrastructure) # Option "DRI" "False" EndSection My system information is listed below (now showing the "intel" graphics driver). $ uname -a Linux tower 4.15.0-91-generic #92~16.04.1-Ubuntu SMP Fri Feb 28 14:57:22 UTC 2020 x86_64 x86_64 x86_64 GNU/Linux $ lsb_release -a No LSB modules are available. Distributor ID: Ubuntu Description:Ubuntu 16.04.6 LTS Release:16.04 Codename: xenial $ inxi -S -M -C -G -y 80 System: Host: tower Kernel: 4.15.0-91-generic x86_64 (64 bit) Desktop: Unity 7.4.5 Distro: Ubuntu 16.04 xenial Machine: System: Dell product: Precision Tower 3420 Mobo: Dell model: 08K0X7 v: A01 Bios: Dell v: 2.13.1 date: 06/14/2019 CPU: Quad core Intel Xeon E3-1225 v5 (-HT-MCP-) cache: 8192 KB clock speeds: max: 3300 MHz 1: 917 MHz 2: 1962 MHz 3: 1896 MHz 4: 1349 MHz Graphics: Card: Intel HD Graphics P530 Display Server: X.Org 1.19.6 driver: intel Resolution: 2560x1440@59.95hz GLX Renderer: Mesa DRI Intel HD Graphics P530 (Skylake GT2) GLX Version: 3.0 Mesa 18.0.5 ** Attachment added: "Matched modesetting as autoconfigured driver 0" https://bugs.launchpad.net/ubuntu/+source/xorg-server-hwe-16.04/+bug/1724047/+attachment/5347151/+files/Xorg.1.log -- You received this bug notification because you are a member of Ubuntu-X, which is subscribed to xorg-server-hwe-16.04 in Ubuntu. https://bugs.launchpad.net/bugs/1724047 Title: xserver terminates frequently, journal says drm/i915: Resetting chip after gpu hang To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/xorg-server-hwe-16.04/+bug/1724047/+subscriptions ___ Mailing list: https://launchpad.net/~ubuntu-x-swat Post to : ubuntu-x-swat@lists.launchpad.net Unsubscribe : https://launchpad.net/~ubuntu-x-swat More help : https://help.launchpad.net/ListHelp
[Ubuntu-x-swat] [Bug 1724047] Re: xserver terminates frequently, journal says drm/i915: Resetting chip after gpu hang
Status changed to 'Confirmed' because the bug affects multiple users. ** Changed in: xorg-server-hwe-16.04 (Ubuntu) Status: New => Confirmed -- You received this bug notification because you are a member of Ubuntu-X, which is subscribed to xorg-server-hwe-16.04 in Ubuntu. https://bugs.launchpad.net/bugs/1724047 Title: xserver terminates frequently, journal says drm/i915: Resetting chip after gpu hang To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/xorg-server-hwe-16.04/+bug/1724047/+subscriptions ___ Mailing list: https://launchpad.net/~ubuntu-x-swat Post to : ubuntu-x-swat@lists.launchpad.net Unsubscribe : https://launchpad.net/~ubuntu-x-swat More help : https://help.launchpad.net/ListHelp
[Ubuntu-x-swat] [Bug 1724047] Re: xserver terminates frequently, journal says drm/i915: Resetting chip after gpu hang
that hang comes from the kernel -- You received this bug notification because you are a member of Ubuntu-X, which is subscribed to xorg-server-hwe-16.04 in Ubuntu. https://bugs.launchpad.net/bugs/1724047 Title: xserver terminates frequently, journal says drm/i915: Resetting chip after gpu hang To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/xorg-server-hwe-16.04/+bug/1724047/+subscriptions ___ Mailing list: https://launchpad.net/~ubuntu-x-swat Post to : ubuntu-x-swat@lists.launchpad.net Unsubscribe : https://launchpad.net/~ubuntu-x-swat More help : https://help.launchpad.net/ListHelp