Bug#805579: nouveau: lockup with "[TTM] Buffer eviction failed"

2016-09-26 Thread Eric Cooper
Package: src:linux
Version: 4.6.4-1
Followup-For: Bug #805579

For a long time I didn't experience these lockups, but in the past
week or so they have started occurring again, about once a day:

[397467.956723] nouveau :01:00.0: fifo: read fault at 956000 engine 15 
[PCE0] client 01 [PCOPY0] reason 02 [PAGE_NOT_PRESENT] on channel 0 [001fe74000 
DRM]
[397467.956728] nouveau :01:00.0: fifo: ce0 engine fault on channel 0, 
recovering...
[397497.955174] [TTM] Buffer eviction failed
[397527.955845] [TTM] Buffer eviction failed
[397542.956067] [TTM] Buffer eviction failed

I haven't changed any of the hardware since my original report.
The bug seems to occur most often when I have lots of tabs open in
chromium.

-- Package-specific info:
** Version:
Linux version 4.6.0-1-amd64 (debian-ker...@lists.debian.org) (gcc version 5.4.0 
20160609 (Debian 5.4.0-6) ) #1 SMP Debian 4.6.4-1 (2016-07-18)

** Command line:
BOOT_IMAGE=/boot/vmlinuz-4.6.0-1-amd64 
root=UUID=1bd202e7-8d8c-4170-8388-abddc49cd075 ro quiet

** Tainted: E (8192)
 * Unsigned module has been loaded (currently expected).

** Kernel log:
[6.167715] systemd[1]: Started Remount Root and Kernel File Systems.
[6.183580] systemd[1]: Starting Load/Save Random Seed...
[6.192558] systemd[1]: Starting udev Coldplug all Devices...
[6.193067] systemd[1]: Activating swap /swap...
[6.242335] systemd[1]: Started udev Coldplug all Devices.
[6.360002] systemd[1]: Started Create list of required static device nodes 
for the current kernel.
[6.375615] systemd[1]: Starting Create Static Device Nodes in /dev...
[6.424481] systemd[1]: Started Set the console keyboard layout.
[6.433829] systemd[1]: Started Journal Service.
[6.725839] systemd-journald[1008]: Received request to flush runtime 
journal from PID 1
[6.781160] lp: driver loaded but no devices found
[6.812496] ppdev: user-space parallel port driver
[6.846108] parport_pc 00:07: reported by Plug and Play ACPI
[6.846173] parport0: PC-style at 0x378, irq 7 [PCSPP]
[6.850215] RPC: Registered named UNIX socket transport module.
[6.850217] RPC: Registered udp transport module.
[6.850218] RPC: Registered tcp transport module.
[6.850219] RPC: Registered tcp NFSv4.1 backchannel transport module.
[6.935616] lp0: using parport0 (interrupt-driven).
[6.994592] Installing knfsd (copyright (C) 1996 o...@monad.swb.de).
[7.442457] Adding 34179680k swap on /swap.  Priority:-1 extents:30 
across:38783584k FS
[8.463735] shpchp: Standard Hot Plug PCI Controller Driver version: 0.4
[8.467167] ACPI Warning: SystemIO range 
0x0400-0x041F conflicts with OpRegion 
0x0400-0x040F (\SMRG) (20160108/utaddress-255)
[8.467172] ACPI: If an ACPI driver is available for this device, you should 
use it instead of the native driver
[8.475249] input: Power Button as 
/devices/LNXSYSTM:00/LNXSYBUS:00/PNP0C0C:00/input/input7
[8.475253] ACPI: Power Button [PWRB]
[8.475290] input: Power Button as 
/devices/LNXSYSTM:00/LNXPWRBN:00/input/input8
[8.475292] ACPI: Power Button [PWRF]
[8.540063] ACPI Warning: SystemIO range 
0x0828-0x082F conflicts with OpRegion 
0x0800-0x084F (\PMRG) (20160108/utaddress-255)
[8.540070] ACPI: If an ACPI driver is available for this device, you should 
use it instead of the native driver
[8.540074] ACPI Warning: SystemIO range 
0x0530-0x053F conflicts with OpRegion 
0x0500-0x053F (\GPS0) (20160108/utaddress-255)
[8.540077] ACPI: If an ACPI driver is available for this device, you should 
use it instead of the native driver
[8.540079] ACPI Warning: SystemIO range 
0x0500-0x052F conflicts with OpRegion 
0x0500-0x053F (\GPS0) (20160108/utaddress-255)
[8.540082] ACPI: If an ACPI driver is available for this device, you should 
use it instead of the native driver
[8.540117] lpc_ich: Resource conflict(s) found affecting gpio_ich
[8.634159] [drm] Initialized drm 1.1.0 20060810
[8.653154] wmi: Mapper loaded
[9.533626] EDAC MC: Ver: 3.0.0
[9.607130] EDAC MC0: Giving out device to module i7core_edac.c controller 
i7 core #0: DEV :3f:03.0 (POLLED)
[9.607166] EDAC PCI0: Giving out device to module i7core_edac controller 
EDAC PCI controller: DEV :3f:03.0 (POLLED)
[9.607172] EDAC i7core: Driver loaded, 1 memory controller(s) found.
[9.624166] sd 2:0:0:0: Attached scsi generic sg0 type 0
[9.624197] sr 3:0:0:0: Attached scsi generic sg1 type 5
[9.697431] nouveau :01:00.0: NVIDIA GF108 (0c1000a1)
[9.706592] input: Griffin PowerMate as 
/devices/pci:00/:00:1d.0/usb2/2-1/2-1.8/2-1.8.3/2-1.8.3:1.0/input/input9
[9.706643] usbcore: registered new interface driver powermate
[9.789145] iTCO_vendor_support: vendor-support=0
[9.790831] iTCO_wdt: 

Bug#805579: nouveau: lockup with "[TTM] Buffer eviction failed"

2015-11-20 Thread Andreas Boll
Control: reassign -1 src:linux

This is likely a problem in the nouveau kernel driver or TTM.
Reassigning to src:linux.

Thanks,
Andreas

On Thu, Nov 19, 2015 at 01:46:36PM -0500, Eric Cooper wrote:
> Package: libdrm-nouveau2
> Version: 2.4.65-3
> Severity: normal
> 
> My graphics system occasionally locks up, with error messages like the
> following in the log:
> 
> [163467.868937] nouveau E[   PFIFO][:01:00.0] read fault at 0xe4f000 
> [PAGE_NOT_PRESENT] from PCE0/PCOPY0 on channel 0x001fdb2000 [DRM]
> [163467.868945] nouveau E[   PFIFO][:01:00.0] PCE0 engine fault on 
> channel 0, recovering...
> [163507.492514] [TTM] Buffer eviction failed
> [163537.456185] [TTM] Buffer eviction failed
> [163567.483936] [TTM] Buffer eviction failed
> 
> When it happens, I am still able to ssh in from another machine, but
> the shutdown also hangs and I have to manually power cycle it.
> 
> lspci says my video card is: NVIDIA Corporation GF108 [GeForce GT 440] (rev 
> a1)
> Here are the nouveau messages during startup:
> 
> [7.485439] nouveau  [  DEVICE][:01:00.0] BOOT0  : 0x0c1000a1
> [7.485442] nouveau  [  DEVICE][:01:00.0] Chipset: GF108 (NVC1)
> [7.485444] nouveau  [  DEVICE][:01:00.0] Family : NVC0
> [7.617965] nouveau  [   VBIOS][:01:00.0] using image from PRAMIN
> [7.618041] nouveau  [   VBIOS][:01:00.0] BIT signature found
> [7.618042] nouveau  [   VBIOS][:01:00.0] version 70.08.4d.00.00
> [7.618357] nouveau  [ PMC][:01:00.0] MSI interrupts enabled
> [7.618387] nouveau W[ PFB][:01:00.0][0x] reclocking of 
> this ram type unsupported
> [7.618389] nouveau  [ PFB][:01:00.0] RAM type: DDR3
> [7.618390] nouveau  [ PFB][:01:00.0] RAM size: 512 MiB
> [7.618391] nouveau  [ PFB][:01:00.0]ZCOMP: 0 tags
> [7.650175] nouveau  [  PTHERM][:01:00.0] FAN control: none / external
> [7.650189] nouveau  [  PTHERM][:01:00.0] fan management: automatic
> [7.650212] nouveau  [  PTHERM][:01:00.0] internal sensor: yes
> [7.670155] nouveau  [ CLK][:01:00.0] 03: core 50 MHz memory 135 
> MHz
> [7.670159] nouveau  [ CLK][:01:00.0] 07: core 405 MHz memory 324 
> MHz
> [7.670162] nouveau  [ CLK][:01:00.0] 0f: core 810 MHz memory 900 
> MHz
> [7.670310] nouveau  [ CLK][:01:00.0] --: core 405 MHz memory 324 
> MHz
> [7.679396] nouveau  [ DRM] VRAM: 512 MiB
> [7.679398] nouveau  [ DRM] GART: 1048576 MiB
> [7.679404] nouveau  [ DRM] TMDS table version 2.0
> [7.679407] nouveau  [ DRM] DCB version 4.0
> [7.679410] nouveau  [ DRM] DCB outp 00: 01000302 00020030
> [7.679414] nouveau  [ DRM] DCB outp 01: 02000300 
> [7.679417] nouveau  [ DRM] DCB outp 02: 08011392 00020020
> [7.679420] nouveau  [ DRM] DCB outp 03: 04022310 
> [7.679423] nouveau  [ DRM] DCB conn 00: 1030
> [7.679426] nouveau  [ DRM] DCB conn 01: 2161
> [7.679429] nouveau  [ DRM] DCB conn 02: 0200
> [7.694215] nouveau  [ DRM] MM: using COPY0 for buffer copies
> [7.852772] nouveau  [ DRM] allocated 1920x1200 fb: 0x6, bo 
> 8807fa975000
> [7.852873] fbcon: nouveaufb (fb0) is primary device
> [7.948094] nouveau :01:00.0: fb0: nouveaufb frame buffer device
> [7.948096] nouveau :01:00.0: registered panic notifier
> [7.965941] [drm] Initialized nouveau 1.2.2 20120801 for :01:00.0 on 
> minor 0
> 
> -- System Information:
> Debian Release: stretch/sid
>   APT prefers testing
>   APT policy: (500, 'testing'), (400, 'unstable'), (1, 'experimental')
> Architecture: amd64 (x86_64)
> 
> Kernel: Linux 4.2.0-1-amd64 (SMP w/8 CPU cores)
> Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8)
> Shell: /bin/sh linked to /bin/dash
> Init: systemd (via /run/systemd/system)
> 
> Versions of packages libdrm-nouveau2 depends on:
> ii  libc62.19-22
> ii  libdrm2  2.4.65-3
> 
> libdrm-nouveau2 recommends no packages.
> 
> libdrm-nouveau2 suggests no packages.
> 
> -- no debconf information


signature.asc
Description: Digital signature


Bug#805579: nouveau: lockup with "[TTM] Buffer eviction failed"

2015-11-19 Thread Eric Cooper
Package: libdrm-nouveau2
Version: 2.4.65-3
Severity: normal

My graphics system occasionally locks up, with error messages like the
following in the log:

[163467.868937] nouveau E[   PFIFO][:01:00.0] read fault at 0xe4f000 
[PAGE_NOT_PRESENT] from PCE0/PCOPY0 on channel 0x001fdb2000 [DRM]
[163467.868945] nouveau E[   PFIFO][:01:00.0] PCE0 engine fault on channel 
0, recovering...
[163507.492514] [TTM] Buffer eviction failed
[163537.456185] [TTM] Buffer eviction failed
[163567.483936] [TTM] Buffer eviction failed

When it happens, I am still able to ssh in from another machine, but
the shutdown also hangs and I have to manually power cycle it.

lspci says my video card is: NVIDIA Corporation GF108 [GeForce GT 440] (rev a1)
Here are the nouveau messages during startup:

[7.485439] nouveau  [  DEVICE][:01:00.0] BOOT0  : 0x0c1000a1
[7.485442] nouveau  [  DEVICE][:01:00.0] Chipset: GF108 (NVC1)
[7.485444] nouveau  [  DEVICE][:01:00.0] Family : NVC0
[7.617965] nouveau  [   VBIOS][:01:00.0] using image from PRAMIN
[7.618041] nouveau  [   VBIOS][:01:00.0] BIT signature found
[7.618042] nouveau  [   VBIOS][:01:00.0] version 70.08.4d.00.00
[7.618357] nouveau  [ PMC][:01:00.0] MSI interrupts enabled
[7.618387] nouveau W[ PFB][:01:00.0][0x] reclocking of this 
ram type unsupported
[7.618389] nouveau  [ PFB][:01:00.0] RAM type: DDR3
[7.618390] nouveau  [ PFB][:01:00.0] RAM size: 512 MiB
[7.618391] nouveau  [ PFB][:01:00.0]ZCOMP: 0 tags
[7.650175] nouveau  [  PTHERM][:01:00.0] FAN control: none / external
[7.650189] nouveau  [  PTHERM][:01:00.0] fan management: automatic
[7.650212] nouveau  [  PTHERM][:01:00.0] internal sensor: yes
[7.670155] nouveau  [ CLK][:01:00.0] 03: core 50 MHz memory 135 MHz
[7.670159] nouveau  [ CLK][:01:00.0] 07: core 405 MHz memory 324 MHz
[7.670162] nouveau  [ CLK][:01:00.0] 0f: core 810 MHz memory 900 MHz
[7.670310] nouveau  [ CLK][:01:00.0] --: core 405 MHz memory 324 MHz
[7.679396] nouveau  [ DRM] VRAM: 512 MiB
[7.679398] nouveau  [ DRM] GART: 1048576 MiB
[7.679404] nouveau  [ DRM] TMDS table version 2.0
[7.679407] nouveau  [ DRM] DCB version 4.0
[7.679410] nouveau  [ DRM] DCB outp 00: 01000302 00020030
[7.679414] nouveau  [ DRM] DCB outp 01: 02000300 
[7.679417] nouveau  [ DRM] DCB outp 02: 08011392 00020020
[7.679420] nouveau  [ DRM] DCB outp 03: 04022310 
[7.679423] nouveau  [ DRM] DCB conn 00: 1030
[7.679426] nouveau  [ DRM] DCB conn 01: 2161
[7.679429] nouveau  [ DRM] DCB conn 02: 0200
[7.694215] nouveau  [ DRM] MM: using COPY0 for buffer copies
[7.852772] nouveau  [ DRM] allocated 1920x1200 fb: 0x6, bo 
8807fa975000
[7.852873] fbcon: nouveaufb (fb0) is primary device
[7.948094] nouveau :01:00.0: fb0: nouveaufb frame buffer device
[7.948096] nouveau :01:00.0: registered panic notifier
[7.965941] [drm] Initialized nouveau 1.2.2 20120801 for :01:00.0 on 
minor 0

-- System Information:
Debian Release: stretch/sid
  APT prefers testing
  APT policy: (500, 'testing'), (400, 'unstable'), (1, 'experimental')
Architecture: amd64 (x86_64)

Kernel: Linux 4.2.0-1-amd64 (SMP w/8 CPU cores)
Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash
Init: systemd (via /run/systemd/system)

Versions of packages libdrm-nouveau2 depends on:
ii  libc62.19-22
ii  libdrm2  2.4.65-3

libdrm-nouveau2 recommends no packages.

libdrm-nouveau2 suggests no packages.

-- no debconf information