[RESEND] nouveau regression 3.19: unable to load BIOS from ACPI
Hi Ben, Since 3.19 the NV BIOS can no longer be loaded via ACPI. This breaks my HP laptop. Looking at the recent changes (ad4a3626 split out shadow methods) in the bios shadow code, I think this happens: - nvbios_shadow loops over all possible bios sources - shadow_method - shadow_score - shadow_image tries to validate the image contents *before* loading it via ACPI calls - nvbios_imagen calls nv_ro16 on the bios object which tries to read 16 bytes directly from memory. Before the change, the code was: - mthd->shadow(bios); - which for ACPI calls nouveau_bios_shadow_acpi which doesn't try to validate the image mthd->score = nouveau_bios_score(bios, mthd->rw); which validates the image So shadowing always happened *before* trying to look at the bios data. The relevant log is below. Ortwin 3.18: Feb 15 11:28:50 localhost kernel: nouveau [ DEVICE][:01:00.0] BOOT0 : 0x0e63c0a1 Feb 15 11:28:50 localhost kernel: nouveau [ DEVICE][:01:00.0] Chipset: GK106 (NVE6) Feb 15 11:28:50 localhost kernel: nouveau [ DEVICE][:01:00.0] Family : NVE0 Feb 15 11:28:50 localhost kernel: nouveau [ VBIOS][:01:00.0] checking PRAMIN for image... Feb 15 11:28:50 localhost kernel: nouveau [ VBIOS][:01:00.0] ... signature not found Feb 15 11:28:50 localhost kernel: nouveau [ VBIOS][:01:00.0] checking PROM for image... Feb 15 11:28:50 localhost kernel: fbcon: inteldrmfb (fb0) is primary device Feb 15 11:28:50 localhost kernel: nouveau [ VBIOS][:01:00.0] ... signature not found Feb 15 11:28:50 localhost kernel: nouveau [ VBIOS][:01:00.0] checking ACPI for image... Feb 15 11:28:50 localhost kernel: nouveau [ VBIOS][:01:00.0] ... appears to be valid Feb 15 11:28:50 localhost kernel: nouveau [ VBIOS][:01:00.0] using image from ACPI Feb 15 11:28:50 localhost kernel: nouveau [ VBIOS][:01:00.0] BIT signature found 3.19: Feb 15 11:30:40 localhost kernel: VGA switcheroo: detected Optimus DSM method \_SB_.PCI0.PEGP.DGFX handle Feb 15 11:30:40 localhost kernel: nouveau :01:00.0: enabling device (0004 -> 0007) Feb 15 11:30:40 localhost kernel: nouveau [ DEVICE][:01:00.0] BOOT0 : 0x0e63c0a1 Feb 15 11:30:40 localhost kernel: nouveau [ DEVICE][:01:00.0] Chipset: GK106 (NVE6) Feb 15 11:30:40 localhost kernel: nouveau [ DEVICE][:01:00.0] Family : NVE0 Feb 15 11:30:40 localhost kernel: nouveau D[ VBIOS][:01:00.0] trying ACPI... Feb 15 11:30:40 localhost kernel: nouveau D[ VBIOS][:01:00.0] : type 00, 65536 bytes Feb 15 11:30:40 localhost kernel: nouveau D[ VBIOS][:01:00.0] : fetch failed Feb 15 11:30:40 localhost kernel: nouveau D[ VBIOS][:01:00.0] scored 0 Feb 15 11:30:40 localhost kernel: nouveau D[ VBIOS][:01:00.0] trying ACPI... Feb 15 11:30:40 localhost kernel: nouveau D[ VBIOS][:01:00.0] : type 00, 65536 bytes Feb 15 11:30:40 localhost kernel: nouveau D[ VBIOS][:01:00.0] : fetch failed Feb 15 11:30:40 localhost kernel: nouveau D[ VBIOS][:01:00.0] scored 0 Feb 15 11:30:40 localhost kernel: nouveau E[ VBIOS][:01:00.0] ACPI invalid Feb 15 11:30:40 localhost kernel: nouveau [ VBIOS][:01:00.0] checking (null) for image... Feb 15 11:30:40 localhost kernel: nouveau [ VBIOS][:01:00.0] checking PRAMIN for image... Feb 15 11:30:40 localhost kernel: nouveau D[ VBIOS][:01:00.0] trying PRAMIN... Feb 15 11:30:40 localhost kernel: nouveau D[ VBIOS][:01:00.0] ... not enabled Feb 15 11:30:40 localhost kernel: nouveau [ VBIOS][:01:00.0] checking PROM for image... Feb 15 11:30:40 localhost kernel: nouveau D[ VBIOS][:01:00.0] trying PROM... Feb 15 11:30:40 localhost kernel: nouveau D[ VBIOS][:01:00.0] : ROM signature () unknown Feb 15 11:30:40 localhost kernel: nouveau D[ VBIOS][:01:00.0] image 0 invalid Feb 15 11:30:40 localhost kernel: nouveau D[ VBIOS][:01:00.0] scored 0 Feb 15 11:30:40 localhost kernel: nouveau [ VBIOS][:01:00.0] checking ACPI for image... Feb 15 11:30:40 localhost kernel: nouveau D[ VBIOS][:01:00.0] trying ACPI... Feb 15 11:30:40 localhost kernel: nouveau D[ VBIOS][:01:00.0] : type 00, 65536 bytes Feb 15 11:30:40 localhost kernel: nouveau D[ VBIOS][:01:00.0] : fetch failed Feb 15 11:30:40 localhost kernel: nouveau D[ VBIOS][:01:00.0] scored 0 Feb 15 11:30:40 localhost kernel: nouveau [ VBIOS][:01:00.0] checking ACPI for image... Feb 15 11:30:40 localhost kernel: nouveau D[ VBIOS][:01:00.0] trying ACPI... Feb 15 11:30:40 localhost kernel: nouveau D[ VBIOS][:01:00.0] : type 00, 65536 bytes Feb 15 11:30:40 localhost kernel: nouveau D[ VBIOS][:01:00.0] : fetch failed Feb 15 11:30:40 localhost kernel: nouveau D[ VBIOS][:01:00.0] scored 0 Feb 15 11:30:40 localhost kernel: nouveau [ VBIOS][:01:00.0] checking PCIROM for image... Feb 15 11:30:40
nouveau regression 3.19: unable to load BIOS from ACPI
Hi Ben, Since 3.19 the NV BIOS can no longer be loaded via ACPI. This breaks my HP laptop. Looking at the recent changes (ad4a3626 split out shadow methods) in the bios shadow code, I think this happens: - nvbios_shadow loops over all possible bios sources - shadow_method - shadow_score - shadow_image tries to validate the image contents *before* loading it via ACPI calls - nvbios_imagen calls nv_ro16 on the bios object which tries to read 16 bytes directly from memory. Before the change, the code was: - mthd->shadow(bios); - which for ACPI calls nouveau_bios_shadow_acpi which doesn't try to validate the image mthd->score = nouveau_bios_score(bios, mthd->rw); which validates the image So shadowing always happened *before* trying to look at the bios data. The relevant log is below. Ortwin 3.18: Feb 15 11:28:50 localhost kernel: nouveau [ DEVICE][:01:00.0] BOOT0 : 0x0e63c0a1 Feb 15 11:28:50 localhost kernel: nouveau [ DEVICE][:01:00.0] Chipset: GK106 (NVE6) Feb 15 11:28:50 localhost kernel: nouveau [ DEVICE][:01:00.0] Family : NVE0 Feb 15 11:28:50 localhost kernel: nouveau [ VBIOS][:01:00.0] checking PRAMIN for image... Feb 15 11:28:50 localhost kernel: nouveau [ VBIOS][:01:00.0] ... signature not found Feb 15 11:28:50 localhost kernel: nouveau [ VBIOS][:01:00.0] checking PROM for image... Feb 15 11:28:50 localhost kernel: fbcon: inteldrmfb (fb0) is primary device Feb 15 11:28:50 localhost kernel: nouveau [ VBIOS][:01:00.0] ... signature not found Feb 15 11:28:50 localhost kernel: nouveau [ VBIOS][:01:00.0] checking ACPI for image... Feb 15 11:28:50 localhost kernel: Switched to clocksource tsc Feb 15 11:28:50 localhost kernel: nouveau [ VBIOS][:01:00.0] ... appears to be valid Feb 15 11:28:50 localhost kernel: nouveau [ VBIOS][:01:00.0] using image from ACPI Feb 15 11:28:50 localhost kernel: nouveau [ VBIOS][:01:00.0] BIT signature found 3.19: Feb 15 11:30:40 localhost kernel: VGA switcheroo: detected Optimus DSM method \_SB_.PCI0.PEGP.DGFX handle Feb 15 11:30:40 localhost kernel: nouveau :01:00.0: enabling device (0004 -> 0007) Feb 15 11:30:40 localhost kernel: nouveau [ DEVICE][:01:00.0] BOOT0 : 0x0e63c0a1 Feb 15 11:30:40 localhost kernel: nouveau [ DEVICE][:01:00.0] Chipset: GK106 (NVE6) Feb 15 11:30:40 localhost kernel: nouveau [ DEVICE][:01:00.0] Family : NVE0 Feb 15 11:30:40 localhost kernel: nouveau D[ VBIOS][:01:00.0] trying ACPI... Feb 15 11:30:40 localhost kernel: nouveau D[ VBIOS][:01:00.0] : type 00, 65536 bytes Feb 15 11:30:40 localhost kernel: nouveau D[ VBIOS][:01:00.0] : fetch failed Feb 15 11:30:40 localhost kernel: nouveau D[ VBIOS][:01:00.0] scored 0 Feb 15 11:30:40 localhost kernel: nouveau D[ VBIOS][:01:00.0] trying ACPI... Feb 15 11:30:40 localhost kernel: nouveau D[ VBIOS][:01:00.0] : type 00, 65536 bytes Feb 15 11:30:40 localhost kernel: nouveau D[ VBIOS][:01:00.0] : fetch failed Feb 15 11:30:40 localhost kernel: nouveau D[ VBIOS][:01:00.0] scored 0 Feb 15 11:30:40 localhost kernel: nouveau E[ VBIOS][:01:00.0] ACPI invalid Feb 15 11:30:40 localhost kernel: nouveau [ VBIOS][:01:00.0] checking (null) for image... Feb 15 11:30:40 localhost kernel: nouveau [ VBIOS][:01:00.0] checking PRAMIN for image... Feb 15 11:30:40 localhost kernel: nouveau D[ VBIOS][:01:00.0] trying PRAMIN... Feb 15 11:30:40 localhost kernel: nouveau D[ VBIOS][:01:00.0] ... not enabled Feb 15 11:30:40 localhost kernel: nouveau [ VBIOS][:01:00.0] checking PROM for image... Feb 15 11:30:40 localhost kernel: nouveau D[ VBIOS][:01:00.0] trying PROM... Feb 15 11:30:40 localhost kernel: nouveau D[ VBIOS][:01:00.0] : ROM signature () unknown Feb 15 11:30:40 localhost kernel: nouveau D[ VBIOS][:01:00.0] image 0 invalid Feb 15 11:30:40 localhost kernel: nouveau D[ VBIOS][:01:00.0] scored 0 Feb 15 11:30:40 localhost kernel: nouveau [ VBIOS][:01:00.0] checking ACPI for image... Feb 15 11:30:40 localhost kernel: nouveau D[ VBIOS][:01:00.0] trying ACPI... Feb 15 11:30:40 localhost kernel: nouveau D[ VBIOS][:01:00.0] : type 00, 65536 bytes Feb 15 11:30:40 localhost kernel: nouveau D[ VBIOS][:01:00.0] : fetch failed Feb 15 11:30:40 localhost kernel: nouveau D[ VBIOS][:01:00.0] scored 0 Feb 15 11:30:40 localhost kernel: nouveau [ VBIOS][:01:00.0] checking ACPI for image... Feb 15 11:30:40 localhost kernel: nouveau D[ VBIOS][:01:00.0] trying ACPI... Feb 15 11:30:40 localhost kernel: nouveau D[ VBIOS][:01:00.0] : type 00, 65536 bytes Feb 15 11:30:40 localhost kernel: nouveau D[ VBIOS][:01:00.0] : fetch failed Feb 15 11:30:40 localhost kernel: nouveau D[ VBIOS][:01:00.0] scored 0 Feb 15 11:30:40 localhost kernel: nouveau [
[BUG] nouveau regression: ext monitor dead after resume
On 19.09.2014 19:01, Ilia Mirkin wrote: > Try booting with log_buf_len=100M Done and the slightly lengthy files attached to the Bugzilla entry. Ortwin
[BUG] nouveau regression: ext monitor dead after resume
On 09/19/2014 07:01 PM, Ilia Mirkin wrote: > Thanks! Hopefully you still have those kernels handy, as your logs got > cut off. Yeah, I noticed and hoped it wouldn't matter as it is mostly the boot log that's been cut off until syslog came up (it's from /var/log/messages). So suspend/resume cycle should be complete. But I can give it another try on Monday when I'm back in the office. Ortwin
[BUG] nouveau regression: ext monitor dead after resume
On 19.09.2014 17:58, Ilia Mirkin wrote: > git checkout 415f12efc1b2308411b2cbc3e82666b3db8a7758^ Thanks again. I confirm that Bugzilla 83550 is the same issue. I have attached the captured logs there for reference. Ortwin
[BUG] nouveau regression: ext monitor dead after resume
On 18.09.2014 16:58, Ilia Mirkin wrote: > This has been reported a few times already -- probably the same thing > as bug https://bugs.freedesktop.org/show_bug.cgi?id=83550 Ah, thanks. I would like to try with that commit reverted, but unfortunately it no longer reverts cleanly, and my attempts to make a sensible merge were futile. If you can send me a patch that reverts the changes on 3.17-rc5 or 3.16 I am glad to try it out and give you the requested feedback. Ortwin
[BUG] nouveau regression: ext monitor dead after resume
I have tried and reverted these commits but to no avail. 028791bb7d6 drm/nouveau/kms: restore fbcon after display has been resumed 456b0579fb0 drm/nouveau: use connector events for HPD instead of GPIO watching
[BUG] nouveau regression: ext monitor dead after resume
Hi, Since 3.16 an external monitor stays dark after resume from sleep. I didn't manage to activate it again with xrand. According to xrandr it is "connected" and configured with a mode, but I get no signal. Happens since 3.16 and is still broken with 3.17-rc5. Hardware: HP EliteBook 8540w 01:00.0 VGA compatible controller: NVIDIA Corporation GT215GLM [Quadro FX 1800M] (rev a2) 0 External Monitor connected via DVI on the docking station. XRandR before amd after suspend looks the same: $ xrandr Screen 0: minimum 320 x 200, current 1920 x 1080, maximum 8192 x 8192 LVDS-1 connected 1920x1080+0+0 (normal left inverted right x axis y axis) 345mm x 194mm 1920x1080 59.9*+ 60.0 1680x1050 60.0 1400x1050 60.0 1280x1024 59.9 1280x960 59.9 1152x864 60.0 1024x768 59.9 800x60059.9 640x48059.4 720x40059.6 640x40060.0 640x35059.8 DP-1 disconnected (normal left inverted right x axis y axis) DP-2 disconnected (normal left inverted right x axis y axis) eDP-1 disconnected (normal left inverted right x axis y axis) DP-3 connected 1920x1080+0+0 (normal left inverted right x axis y axis) 598mm x 336mm 1920x1080 60.0*+ 1680x1050 59.9 1280x1024 75.0 60.0 1440x900 75.0 59.9 1024x768 75.1 60.0 800x60075.0 60.3 640x48075.0 72.8 66.7 60.0 720x40070.1 VGA-1 disconnected (normal left inverted right x axis y axis) dmesg output of a suspend/resume cycle attached. -- next part -- [0.00] Initializing cgroup subsys cpu [0.00] Linux version 3.17.0-rc5 (root at ortwin-hp) (gcc version 4.7.3 (Gentoo 4.7.3-r1 p1.3, pie-0.5.5) ) #2 SMP PREEMPT Thu Sep 18 12:27:26 CEST 2014 [0.00] Command line: BOOT_IMAGE=/boot/linux-3.17.0-rc5 root=/dev/sda1 rootfstype=ext4 nouveau.noaccel=1 net.ifnames=0 pcie_aspm=force [0.00] e820: BIOS-provided physical RAM map: [0.00] BIOS-e820: [mem 0x-0x0009fbff] usable [0.00] BIOS-e820: [mem 0x0009fc00-0x0009] reserved [0.00] BIOS-e820: [mem 0x000e-0x000f] reserved [0.00] BIOS-e820: [mem 0x0010-0xbefc1fff] usable [0.00] BIOS-e820: [mem 0xbefc2000-0xbf6c1fff] reserved [0.00] BIOS-e820: [mem 0xbf6c2000-0xbf7c1fff] ACPI NVS [0.00] BIOS-e820: [mem 0xbf7c2000-0xbf7fefff] ACPI data [0.00] BIOS-e820: [mem 0xbf7ff000-0xbf7f] usable [0.00] BIOS-e820: [mem 0xbf80-0xbfff] reserved [0.00] BIOS-e820: [mem 0xe000-0xefff] reserved [0.00] BIOS-e820: [mem 0xfec0-0xfec00fff] reserved [0.00] BIOS-e820: [mem 0xfed1-0xfed13fff] reserved [0.00] BIOS-e820: [mem 0xfed19000-0xfed19fff] reserved [0.00] BIOS-e820: [mem 0xfed1b000-0xfed1] reserved [0.00] BIOS-e820: [mem 0xfee0-0xfee00fff] reserved [0.00] BIOS-e820: [mem 0xffd0-0x] reserved [0.00] BIOS-e820: [mem 0x0001-0x0001fbff] usable [0.00] BIOS-e820: [mem 0x0001fc00-0x0001] reserved [0.00] BIOS-e820: [mem 0x0002-0x00023bff] usable [0.00] NX (Execute Disable) protection: active [0.00] SMBIOS 2.6 present. [0.00] DMI: Hewlett-Packard HP EliteBook 8540w/1521, BIOS 68CVD Ver. F.0E 11/25/2010 [0.00] e820: update [mem 0x-0x0fff] usable ==> reserved [0.00] e820: remove [mem 0x000a-0x000f] usable [0.00] e820: last_pfn = 0x23c000 max_arch_pfn = 0x4 [0.00] MTRR default type: uncachable [0.00] MTRR fixed ranges enabled: [0.00] 0-9 write-back [0.00] A-B uncachable [0.00] C-F write-protect [0.00] MTRR variable ranges enabled: [0.00] 0 base 0FFC0 mask FFFC0 write-protect [0.00] 1 base 0 mask F8000 write-back [0.00] 2 base 08000 mask FC000 write-back [0.00] 3 base 1 mask F write-back [0.00] 4 base 2 mask FC000 write-back [0.00] 5 base 23C00 mask FFC00 uncachable [0.00] 6 disabled [0.00] 7 disabled [0.00] x86 PAT enabled: cpu 0, old 0x7040600070406, new 0x7010600070106 [0.00] e820: last_pfn = 0xbf800 max_arch_pfn = 0x4 [0.00] Base memory trampoline at [88099000] 99000 size 24576 [0.00] init_memory_mapping: [mem 0x-0x000f] [0.00] [mem 0x-0x000f] page 4k [0.00] BRK [0x02371000, 0x02371fff] PGTABLE [0.00] BRK
[BUG] 3.7-rc3: nouveau: NULL pointer dereference at nouveau_channel_new
This is a regression towards 3.6. Occurs when starting X. I had already reported this before -rc1 and -rc2, on 10. and 16. October. This is with noaccel=1. Without the option, the machine hangs on a black screen as soon as X starts up (which has always been like that). Tell me if you need more information. Ortwin -- next part -- Oct 30 18:44:30 localhost kernel: Initializing cgroup subsys cpu Oct 30 18:44:30 localhost kernel: Linux version 3.7.0-rc3 (root at ortwin-hp) (gcc version 4.5.4 (Gentoo 4.5.4 p1.0, pie-0.4.7) ) #2 SMP PREEMPT Tue Oct 30 18:41:04 CET 2012 Oct 30 18:44:30 localhost kernel: Command line: root=/dev/sda1 rootfstype=ext4 nouveau.noaccel=1 Oct 30 18:44:30 localhost kernel: e820: BIOS-provided physical RAM map: Oct 30 18:44:30 localhost kernel: BIOS-e820: [mem 0x-0x0009fbff] usable Oct 30 18:44:30 localhost kernel: BIOS-e820: [mem 0x0009fc00-0x0009] reserved Oct 30 18:44:30 localhost kernel: BIOS-e820: [mem 0x000e-0x000f] reserved Oct 30 18:44:30 localhost kernel: BIOS-e820: [mem 0x0010-0xbefc1fff] usable Oct 30 18:44:30 localhost kernel: BIOS-e820: [mem 0xbefc2000-0xbf6c1fff] reserved Oct 30 18:44:30 localhost kernel: BIOS-e820: [mem 0xbf6c2000-0xbf7c1fff] ACPI NVS Oct 30 18:44:30 localhost kernel: BIOS-e820: [mem 0xbf7c2000-0xbf7fefff] ACPI data Oct 30 18:44:30 localhost kernel: BIOS-e820: [mem 0xbf7ff000-0xbf7f] usable Oct 30 18:44:30 localhost kernel: BIOS-e820: [mem 0xbf80-0xbfff] reserved Oct 30 18:44:30 localhost kernel: BIOS-e820: [mem 0xe000-0xefff] reserved Oct 30 18:44:30 localhost kernel: BIOS-e820: [mem 0xfec0-0xfec00fff] reserved Oct 30 18:44:30 localhost kernel: BIOS-e820: [mem 0xfed1-0xfed13fff] reserved Oct 30 18:44:30 localhost kernel: BIOS-e820: [mem 0xfed19000-0xfed19fff] reserved Oct 30 18:44:30 localhost kernel: BIOS-e820: [mem 0xfed1b000-0xfed1] reserved Oct 30 18:44:30 localhost kernel: BIOS-e820: [mem 0xfee0-0xfee00fff] reserved Oct 30 18:44:30 localhost kernel: BIOS-e820: [mem 0xffd0-0x] reserved Oct 30 18:44:30 localhost kernel: BIOS-e820: [mem 0x0001-0x0001fbff] usable Oct 30 18:44:30 localhost kernel: BIOS-e820: [mem 0x0001fc00-0x0001] reserved Oct 30 18:44:30 localhost kernel: BIOS-e820: [mem 0x0002-0x00023bff] usable Oct 30 18:44:30 localhost kernel: NX (Execute Disable) protection: active Oct 30 18:44:30 localhost kernel: DMI 2.6 present. Oct 30 18:44:30 localhost kernel: DMI: Hewlett-Packard HP EliteBook 8540w/1521, BIOS 68CVD Ver. F.0E 11/25/2010 Oct 30 18:44:30 localhost kernel: e820: update [mem 0x-0x] usable ==> reserved Oct 30 18:44:30 localhost kernel: e820: remove [mem 0x000a-0x000f] usable Oct 30 18:44:30 localhost kernel: No AGP bridge found Oct 30 18:44:30 localhost kernel: e820: last_pfn = 0x23c000 max_arch_pfn = 0x4 Oct 30 18:44:30 localhost kernel: MTRR default type: uncachable Oct 30 18:44:30 localhost kernel: MTRR fixed ranges enabled: Oct 30 18:44:30 localhost kernel: 0-9 write-back Oct 30 18:44:30 localhost kernel: A-B uncachable Oct 30 18:44:30 localhost kernel: C-F write-protect Oct 30 18:44:30 localhost kernel: MTRR variable ranges enabled: Oct 30 18:44:30 localhost kernel: 0 base 0FFC0 mask FFFC0 write-protect Oct 30 18:44:30 localhost kernel: 1 base 0 mask F8000 write-back Oct 30 18:44:30 localhost kernel: 2 base 08000 mask FC000 write-back Oct 30 18:44:30 localhost kernel: 3 base 1 mask F write-back Oct 30 18:44:30 localhost kernel: 4 base 2 mask FC000 write-back Oct 30 18:44:30 localhost kernel: 5 base 23C00 mask FFC00 uncachable Oct 30 18:44:30 localhost kernel: 6 disabled Oct 30 18:44:30 localhost kernel: 7 disabled Oct 30 18:44:30 localhost kernel: x86 PAT enabled: cpu 0, old 0x7040600070406, new 0x7010600070106 Oct 30 18:44:30 localhost kernel: e820: last_pfn = 0xbf800 max_arch_pfn = 0x4 Oct 30 18:44:30 localhost kernel: initial memory mapped: [mem 0x-0x1fff] Oct 30 18:44:30 localhost kernel: Base memory trampoline at [88099000] 99000 size 24576 Oct 30 18:44:30 localhost kernel: init_memory_mapping: [mem 0x-0xbf7f] Oct 30 18:44:30 localhost kernel: [mem 0x-0xbf7f] page 2M Oct 30 18:44:30 localhost kernel: kernel direct mapping tables up to 0xbf7f @ [mem 0x1fffc000-0x1fff] Oct 30 18:44:30 localhost kernel: init_memory_mapping: [mem 0x1-0x1fbff] Oct 30 18:44:30 localhost kernel: [mem 0x1-0x1fbff] page 2M Oct 30 18:44:30 localhost kernel: kernel direct mapping tables up to 0x1fbff @
[BUG] 3.7-rc1: nouveau: NULL pointer dereference at nouveau_channel_new
This is a regression towards 3.6. Occurs when starting X. I had already reported this before -rc1, but nobody listened. -- next part -- Oct 16 18:27:04 localhost kernel: Initializing cgroup subsys cpu Oct 16 18:27:04 localhost kernel: Linux version 3.7.0-rc1 (root at ortwin-hp) (gcc version 4.5.4 (Gentoo 4.5.4 p1.0, pie-0.4.7) ) #1 SMP PREEMPT Tue Oct 16 18:24:16 CEST 2012 Oct 16 18:27:04 localhost kernel: Command line: root=/dev/sda1 rootfstype=ext4 nouveau.noaccel=1 Oct 16 18:27:04 localhost kernel: e820: BIOS-provided physical RAM map: Oct 16 18:27:04 localhost kernel: BIOS-e820: [mem 0x-0x0009fbff] usable Oct 16 18:27:04 localhost kernel: BIOS-e820: [mem 0x0009fc00-0x0009] reserved Oct 16 18:27:04 localhost kernel: BIOS-e820: [mem 0x000e-0x000f] reserved Oct 16 18:27:04 localhost kernel: BIOS-e820: [mem 0x0010-0xbefc1fff] usable Oct 16 18:27:04 localhost kernel: BIOS-e820: [mem 0xbefc2000-0xbf6c1fff] reserved Oct 16 18:27:04 localhost kernel: BIOS-e820: [mem 0xbf6c2000-0xbf7c1fff] ACPI NVS Oct 16 18:27:04 localhost kernel: BIOS-e820: [mem 0xbf7c2000-0xbf7fefff] ACPI data Oct 16 18:27:04 localhost kernel: BIOS-e820: [mem 0xbf7ff000-0xbf7f] usable Oct 16 18:27:04 localhost kernel: BIOS-e820: [mem 0xbf80-0xbfff] reserved Oct 16 18:27:04 localhost kernel: BIOS-e820: [mem 0xe000-0xefff] reserved Oct 16 18:27:04 localhost kernel: BIOS-e820: [mem 0xfec0-0xfec00fff] reserved Oct 16 18:27:04 localhost kernel: BIOS-e820: [mem 0xfed1-0xfed13fff] reserved Oct 16 18:27:04 localhost kernel: BIOS-e820: [mem 0xfed19000-0xfed19fff] reserved Oct 16 18:27:04 localhost kernel: BIOS-e820: [mem 0xfed1b000-0xfed1] reserved Oct 16 18:27:04 localhost kernel: BIOS-e820: [mem 0xfee0-0xfee00fff] reserved Oct 16 18:27:04 localhost kernel: BIOS-e820: [mem 0xffd0-0x] reserved Oct 16 18:27:04 localhost kernel: BIOS-e820: [mem 0x0001-0x0001fbff] usable Oct 16 18:27:04 localhost kernel: BIOS-e820: [mem 0x0001fc00-0x0001] reserved Oct 16 18:27:04 localhost kernel: BIOS-e820: [mem 0x0002-0x00023bff] usable Oct 16 18:27:04 localhost kernel: NX (Execute Disable) protection: active Oct 16 18:27:04 localhost kernel: DMI 2.6 present. Oct 16 18:27:04 localhost kernel: DMI: Hewlett-Packard HP EliteBook 8540w/1521, BIOS 68CVD Ver. F.0E 11/25/2010 Oct 16 18:27:04 localhost kernel: e820: update [mem 0x-0x] usable ==> reserved Oct 16 18:27:04 localhost kernel: e820: remove [mem 0x000a-0x000f] usable Oct 16 18:27:04 localhost kernel: No AGP bridge found Oct 16 18:27:04 localhost kernel: e820: last_pfn = 0x23c000 max_arch_pfn = 0x4 Oct 16 18:27:04 localhost kernel: MTRR default type: uncachable Oct 16 18:27:04 localhost kernel: MTRR fixed ranges enabled: Oct 16 18:27:04 localhost kernel: 0-9 write-back Oct 16 18:27:04 localhost kernel: A-B uncachable Oct 16 18:27:04 localhost kernel: C-F write-protect Oct 16 18:27:04 localhost kernel: MTRR variable ranges enabled: Oct 16 18:27:04 localhost kernel: 0 base 0FFC0 mask FFFC0 write-protect Oct 16 18:27:04 localhost kernel: 1 base 0 mask F8000 write-back Oct 16 18:27:04 localhost kernel: 2 base 08000 mask FC000 write-back Oct 16 18:27:04 localhost kernel: 3 base 1 mask F write-back Oct 16 18:27:04 localhost kernel: 4 base 2 mask FC000 write-back Oct 16 18:27:04 localhost kernel: 5 base 23C00 mask FFC00 uncachable Oct 16 18:27:04 localhost kernel: 6 disabled Oct 16 18:27:04 localhost kernel: 7 disabled Oct 16 18:27:04 localhost kernel: x86 PAT enabled: cpu 0, old 0x7040600070406, new 0x7010600070106 Oct 16 18:27:04 localhost kernel: e820: last_pfn = 0xbf800 max_arch_pfn = 0x4 Oct 16 18:27:04 localhost kernel: initial memory mapped: [mem 0x-0x1fff] Oct 16 18:27:04 localhost kernel: Base memory trampoline at [88099000] 99000 size 24576 Oct 16 18:27:04 localhost kernel: init_memory_mapping: [mem 0x-0xbf7f] Oct 16 18:27:04 localhost kernel: [mem 0x-0xbf7f] page 2M Oct 16 18:27:04 localhost kernel: kernel direct mapping tables up to 0xbf7f @ [mem 0x1fa0-0x1fff] Oct 16 18:27:04 localhost kernel: init_memory_mapping: [mem 0x1-0x23bff] Oct 16 18:27:04 localhost kernel: [mem 0x1-0x23bff] page 2M Oct 16 18:27:04 localhost kernel: kernel direct mapping tables up to 0x23bff @ [mem 0xbefb8000-0xbefc1fff] Oct 16 18:27:04 localhost kernel: ACPI: RSDP 000fddc0 00024 (v02 HPQOEM) Oct 16 18:27:04 localhost kernel: ACPI: XSDT bf7fe120 00094 (v01 HPQOEM SLIC-MPC
[BUG] drm/nouveau: NULL pointer dereference in nouveau_channel_new()
Hi, Current nouveau code in Linus' tree oopses with noaccel. Didn't happen in 3.6. Full dmesg attached. Ortwin BUG: unable to handle kernel NULL pointer dereference at 0018 Oct 10 18:05:49 localhost kernel: IP: [] nouveau_channel_new+0x41e/0x670 Oct 10 18:05:49 localhost kernel: PGD 22ddd7067 PUD 22f76f067 PMD 0 Oct 10 18:05:49 localhost kernel: Oops: [#1] PREEMPT SMP Oct 10 18:05:49 localhost kernel: Modules linked in: Oct 10 18:05:49 localhost kernel: CPU 0 Oct 10 18:05:49 localhost kernel: Pid: 2723, comm: X Not tainted 3.7.0-pre #1 Hewlett-Packard HP EliteBook 8540w/1521 Oct 10 18:05:49 localhost kernel: RIP: 0010:[] [] nouveau_channel_new+0x41e/0x670 Oct 10 18:05:49 localhost kernel: RSP: 0018:88023026bc68 EFLAGS: 00010202 Oct 10 18:05:49 localhost kernel: RAX: RBX: 88022ceeca80 RCX: 0088 Oct 10 18:05:49 localhost kernel: RDX: c900127e408c RSI: c900127e408c RDI: 88022ceeca80 Oct 10 18:05:49 localhost kernel: RBP: 88023026bce8 R08: R09: Oct 10 18:05:49 localhost kernel: R10: 0010 R11: 000f R12: Oct 10 18:05:49 localhost kernel: R13: 880231f36800 R14: 88022f68ca00 R15: 88022f68ca00 Oct 10 18:05:49 localhost kernel: FS: 7f129a597880() GS:88023bc0() knlGS: Oct 10 18:05:49 localhost kernel: CS: 0010 DS: ES: CR0: 80050033 Oct 10 18:05:49 localhost kernel: CR2: 0018 CR3: 0002300a1000 CR4: 07f0 Oct 10 18:05:49 localhost kernel: DR0: DR1: DR2: Oct 10 18:05:49 localhost kernel: DR3: DR6: 0ff0 DR7: 0400 Oct 10 18:05:49 localhost kernel: Process X (pid: 2723, threadinfo 88023026a000, task 880231edbde0) Oct 10 18:05:49 localhost kernel: Stack: Oct 10 18:05:49 localhost kernel: 88023026bcb0 88022f68ca00 88022f68caa8 8802311be9c0 Oct 10 18:05:49 localhost kernel: d801d802 88023026bc98 2000 Oct 10 18:05:49 localhost kernel: 00ff 880231f9afa0 8802323c0840 Oct 10 18:05:49 localhost kernel: Call Trace: Oct 10 18:05:49 localhost kernel: [] nouveau_abi16_ioctl_channel_alloc+0x15c/0x360 Oct 10 18:05:49 localhost kernel: [] drm_ioctl+0x2a0/0x4f0 Oct 10 18:05:49 localhost kernel: [] ? nouveau_abi16_ioctl_setparam+0x10/0x10 Oct 10 18:05:49 localhost kernel: [] ? __do_page_fault+0x1c4/0x490 Oct 10 18:05:49 localhost kernel: [] do_vfs_ioctl+0x97/0x4f0 Oct 10 18:05:49 localhost kernel: [] sys_ioctl+0x57/0x90 Oct 10 18:05:49 localhost kernel: [] system_call_fastpath+0x16/0x1b Oct 10 18:05:49 localhost kernel: Code: e2 02 48 89 df e8 d3 02 00 00 8b 83 80 00 00 00 89 83 84 00 00 00 0f 1f 80 00 00 00 00 48 8b 43 08 48 89 df 48 8b 80 f8 06 00 00 50 18 41 89 c4 85 c0 0f 84 65 fd ff ff 0f 1f 40 00 4c 89 ff Oct 10 18:05:49 localhost kernel: RIP [] nouveau_channel_new+0x41e/0x670 Oct 10 18:05:49 localhost kernel: RSP Oct 10 18:05:49 localhost kernel: CR2: 0018 Oct 10 18:05:49 localhost kernel: ---[ end trace e23ae1067c361c88 ]--- -- next part -- Oct 10 18:05:42 localhost kernel: Initializing cgroup subsys cpu Oct 10 18:05:42 localhost kernel: Linux version 3.7.0-pre (root at localhost) (gcc version 4.5.4 (Gentoo 4.5.4 p1.0, pie-0.4.7) ) #1 SMP PREEMPT Wed Oct 10 17:35:48 CEST 2012 Oct 10 18:05:42 localhost kernel: Command line: root=/dev/sda1 rootfstype=ext4 nouveau.noaccel=1 Oct 10 18:05:42 localhost kernel: e820: BIOS-provided physical RAM map: Oct 10 18:05:42 localhost kernel: BIOS-e820: [mem 0x-0x0009fbff] usable Oct 10 18:05:42 localhost kernel: BIOS-e820: [mem 0x0009fc00-0x0009] reserved Oct 10 18:05:42 localhost kernel: BIOS-e820: [mem 0x000e-0x000f] reserved Oct 10 18:05:42 localhost kernel: BIOS-e820: [mem 0x0010-0xbefc1fff] usable Oct 10 18:05:42 localhost kernel: BIOS-e820: [mem 0xbefc2000-0xbf6c1fff] reserved Oct 10 18:05:42 localhost kernel: BIOS-e820: [mem 0xbf6c2000-0xbf7c1fff] ACPI NVS Oct 10 18:05:42 localhost kernel: BIOS-e820: [mem 0xbf7c2000-0xbf7fefff] ACPI data Oct 10 18:05:42 localhost kernel: BIOS-e820: [mem 0xbf7ff000-0xbf7f] usable Oct 10 18:05:42 localhost kernel: BIOS-e820: [mem 0xbf80-0xbfff] reserved Oct 10 18:05:42 localhost kernel: BIOS-e820: [mem 0xe000-0xefff] reserved Oct 10 18:05:42 localhost kernel: BIOS-e820: [mem 0xfec0-0xfec00fff] reserved Oct 10 18:05:42 localhost kernel: BIOS-e820: [mem 0xfed1-0xfed13fff] reserved Oct 10 18:05:42 localhost kernel: BIOS-e820: [mem 0xfed19000-0xfed19fff] reserved Oct 10 18:05:42 localhost kernel:
[PATCH] drm/nouveau: fix crash regression
Work around a crash during boot if noaccel is set. NB: still broken in 3.5 as well, used to work in 3.4. Why are people ignoring this? It's a regression! Signed-off-by: Ortwin Gl?ck --- diff --git a/drivers/gpu/drm/nouveau/nv50_display.c b/drivers/gpu/drm/nouveau/nv50_display.c index b244d99..c7ffa63 100644 --- a/drivers/gpu/drm/nouveau/nv50_display.c +++ b/drivers/gpu/drm/nouveau/nv50_display.c @@ -650,6 +650,12 @@ nv50_display_vblank_crtc_handler(struct drm_device *dev, int crtc) struct nouveau_software_priv *psw = nv_engine(dev, NVOBJ_ENGINE_SW); struct nouveau_software_chan *pch, *tmp; +if (!psw) { +WARN_ON_ONCE(1); +printk(KERN_ERR "NULL software engine\n"); +return; +} + list_for_each_entry_safe(pch, tmp, >vblank, vblank.list) { if (pch->vblank.head != crtc) continue;
[PATCH] drm/nouveau: fix crash regression
Work around a crash during boot if noaccel is set. NB: still broken in 3.5 as well, used to work in 3.4. Why are people ignoring this? It's a regression! Signed-off-by: Ortwin Glück o...@odi.ch --- diff --git a/drivers/gpu/drm/nouveau/nv50_display.c b/drivers/gpu/drm/nouveau/nv50_display.c index b244d99..c7ffa63 100644 --- a/drivers/gpu/drm/nouveau/nv50_display.c +++ b/drivers/gpu/drm/nouveau/nv50_display.c @@ -650,6 +650,12 @@ nv50_display_vblank_crtc_handler(struct drm_device *dev, int crtc) struct nouveau_software_priv *psw = nv_engine(dev, NVOBJ_ENGINE_SW); struct nouveau_software_chan *pch, *tmp; +if (!psw) { +WARN_ON_ONCE(1); +printk(KERN_ERR NULL software engine\n); +return; +} + list_for_each_entry_safe(pch, tmp, psw-vblank, vblank.list) { if (pch-vblank.head != crtc) continue; ___ dri-devel mailing list dri-devel@lists.freedesktop.org http://lists.freedesktop.org/mailman/listinfo/dri-devel
drm/nouveau: Work around a crash during boot if noaccel is set.
NB: still broken in 3.5 as well. Signed-off-by: Ortwin Gl?ck --- diff --git a/drivers/gpu/drm/nouveau/nv50_display.c b/drivers/gpu/drm/nouveau/nv50_display.c index b244d99..c7ffa63 100644 --- a/drivers/gpu/drm/nouveau/nv50_display.c +++ b/drivers/gpu/drm/nouveau/nv50_display.c @@ -650,6 +650,12 @@ nv50_display_vblank_crtc_handler(struct drm_device *dev, int crtc) struct nouveau_software_priv *psw = nv_engine(dev, NVOBJ_ENGINE_SW); struct nouveau_software_chan *pch, *tmp; +if (!psw) { +WARN_ON_ONCE(1); +printk(KERN_ERR "NULL software engine\n"); +return; +} + list_for_each_entry_safe(pch, tmp, >vblank, vblank.list) { if (pch->vblank.head != crtc) continue;
drm/nouveau: Work around a crash during boot if noaccel is set.
NB: still broken in 3.5 as well. Signed-off-by: Ortwin Glück o...@odi.ch --- diff --git a/drivers/gpu/drm/nouveau/nv50_display.c b/drivers/gpu/drm/nouveau/nv50_display.c index b244d99..c7ffa63 100644 --- a/drivers/gpu/drm/nouveau/nv50_display.c +++ b/drivers/gpu/drm/nouveau/nv50_display.c @@ -650,6 +650,12 @@ nv50_display_vblank_crtc_handler(struct drm_device *dev, int crtc) struct nouveau_software_priv *psw = nv_engine(dev, NVOBJ_ENGINE_SW); struct nouveau_software_chan *pch, *tmp; +if (!psw) { +WARN_ON_ONCE(1); +printk(KERN_ERR NULL software engine\n); +return; +} + list_for_each_entry_safe(pch, tmp, psw-vblank, vblank.list) { if (pch-vblank.head != crtc) continue; ___ dri-devel mailing list dri-devel@lists.freedesktop.org http://lists.freedesktop.org/mailman/listinfo/dri-devel
drm/nouveau: crash regression in 3.5
I have managed to turn the crash into a WARN_ON, by adding this to the begin of nouveau_software_vblank(): if (!psw) { WARN_ON(1); return; } And I have also managed to load the module manually instead by udev. So I am happy to attach a full log of what's going on here. See also my added printk's starting with XXX that mark some interesting points in the initialization. This should give you enough information to track down the cause of the problem. To my non-expert eyes it looks like "noaccel" prevents registration of NVOBJ_ENGINE_SW or at least delays it too much. Thanks, Ortwin -- next part -- Aug 2 13:03:22 localhost kernel: Linux version 3.5.0 (root at ortwin-hp) (gcc version 4.5.3 (Gentoo 4.5.3-r2 p1.1, pie-0.4.7) ) #5 SMP PREEMPT Thu Aug 2 13:01:46 CEST 2012 Aug 2 13:03:22 localhost kernel: Command line: root=/dev/sda5 rootfstype=ext4 pciehp_force=1 nouveau.modeset=1 nouveau.noaccel=1 netconsole= at 10.11.1.234/eth0, at 10.11.1.19/00:17:f2:c7:5f:06 drm.debug=15 debug nox Aug 2 13:03:22 localhost kernel: e820: BIOS-provided physical RAM map: Aug 2 13:03:22 localhost kernel: BIOS-e820: [mem 0x-0x0009fbff] usable Aug 2 13:03:22 localhost kernel: BIOS-e820: [mem 0x0009fc00-0x0009] reserved Aug 2 13:03:22 localhost kernel: BIOS-e820: [mem 0x000e-0x000f] reserved Aug 2 13:03:22 localhost kernel: BIOS-e820: [mem 0x0010-0xbefc1fff] usable Aug 2 13:03:22 localhost kernel: BIOS-e820: [mem 0xbefc2000-0xbf6c1fff] reserved Aug 2 13:03:22 localhost kernel: BIOS-e820: [mem 0xbf6c2000-0xbf7c1fff] ACPI NVS Aug 2 13:03:22 localhost kernel: BIOS-e820: [mem 0xbf7c2000-0xbf7fefff] ACPI data Aug 2 13:03:22 localhost kernel: BIOS-e820: [mem 0xbf7ff000-0xbf7f] usable Aug 2 13:03:22 localhost kernel: BIOS-e820: [mem 0xbf80-0xbfff] reserved Aug 2 13:03:22 localhost kernel: BIOS-e820: [mem 0xe000-0xefff] reserved Aug 2 13:03:22 localhost kernel: BIOS-e820: [mem 0xfec0-0xfec00fff] reserved Aug 2 13:03:22 localhost kernel: BIOS-e820: [mem 0xfed1-0xfed13fff] reserved Aug 2 13:03:22 localhost kernel: BIOS-e820: [mem 0xfed19000-0xfed19fff] reserved Aug 2 13:03:22 localhost kernel: BIOS-e820: [mem 0xfed1b000-0xfed1] reserved Aug 2 13:03:22 localhost kernel: BIOS-e820: [mem 0xfee0-0xfee00fff] reserved Aug 2 13:03:22 localhost kernel: BIOS-e820: [mem 0xffd0-0x] reserved Aug 2 13:03:22 localhost kernel: BIOS-e820: [mem 0x0001-0x0001fbff] usable Aug 2 13:03:22 localhost kernel: BIOS-e820: [mem 0x0001fc00-0x0001] reserved Aug 2 13:03:22 localhost kernel: BIOS-e820: [mem 0x0002-0x00023bff] usable Aug 2 13:03:22 localhost kernel: NX (Execute Disable) protection: active Aug 2 13:03:22 localhost kernel: DMI 2.6 present. Aug 2 13:03:22 localhost kernel: DMI: Hewlett-Packard HP EliteBook 8540w/1521, BIOS 68CVD Ver. F.0E 11/25/2010 Aug 2 13:03:22 localhost kernel: e820: update [mem 0x-0x] usable ==> reserved Aug 2 13:03:22 localhost kernel: e820: remove [mem 0x000a-0x000f] usable Aug 2 13:03:22 localhost kernel: No AGP bridge found Aug 2 13:03:22 localhost kernel: e820: last_pfn = 0x23c000 max_arch_pfn = 0x4 Aug 2 13:03:22 localhost start-stop-daemon: pam_unix(start-stop-daemon:session): session opened for user nobody by (uid=0) Aug 2 13:03:22 localhost kernel: MTRR default type: uncachable Aug 2 13:03:22 localhost kernel: MTRR fixed ranges enabled: Aug 2 13:03:22 localhost kernel: 0-9 write-back Aug 2 13:03:22 localhost kernel: A-B uncachable Aug 2 13:03:22 localhost kernel: C-F write-protect Aug 2 13:03:22 localhost kernel: MTRR variable ranges enabled: Aug 2 13:03:22 localhost kernel: 0 base 0FFC0 mask FFFC0 write-protect Aug 2 13:03:22 localhost kernel: 1 base 0 mask F8000 write-back Aug 2 13:03:22 localhost kernel: 2 base 08000 mask FC000 write-back Aug 2 13:03:22 localhost kernel: 3 base 1 mask F write-back Aug 2 13:03:22 localhost kernel: 4 base 2 mask FC000 write-back Aug 2 13:03:22 localhost kernel: 5 base 23C00 mask FFC00 uncachable Aug 2 13:03:22 localhost kernel: 6 disabled Aug 2 13:03:22 localhost kernel: 7 disabled Aug 2 13:03:22 localhost kernel: x86 PAT enabled: cpu 0, old 0x7040600070406, new 0x7010600070106 Aug 2 13:03:22 localhost kernel: e820: last_pfn = 0xbf800 max_arch_pfn = 0x4 Aug 2 13:03:22 localhost kernel: initial memory mapped: [mem 0x-0x1fff] Aug 2 13:03:22 localhost kernel: Base memory trampoline at [88099000] 99000
drm/nouveau: crash regression in 3.5
On 30.07.2012 19:16, Marcin Slusarz wrote: > Are you sure you boot the correct kernel? I'm asking because your panic says > its > version is "3.5.0 #3" - exactly the same as in previous crash log. I am using the correct kernel, don't worry. (.version may not be incremented on each build necessarily). I am still seeing the same crash with all your patches, but only with "noaccel=1". I have added some printk's (starting with XXX), but can not obtain them via netconsole. I think netconsole may have been disabled and replaced with the real console at that point already? When I boot with noaccel I can see the printk's in the log on disk. That log is attached. I am sorry that I can not provide better information. Thanks, Ortwin -- next part -- Aug 1 17:34:31 localhost kernel: console [netcon0] enabled Aug 1 17:34:31 localhost kernel: netconsole: network logging started Aug 1 17:34:31 localhost kernel: ALSA device list: Aug 1 17:34:31 localhost kernel: #0: HDA Intel at 0xd752 irq 54 Aug 1 17:34:31 localhost kernel: #1: HDA NVidia at 0xd300 irq 17 Aug 1 17:34:31 localhost kernel: hub 2-0:1.0: hub_suspend Aug 1 17:34:31 localhost kernel: usb usb2: bus auto-suspend, wakeup 1 Aug 1 17:34:31 localhost kernel: ehci_hcd :00:1d.0: suspend root hub Aug 1 17:34:31 localhost kernel: EXT4-fs (sda5): mounted filesystem with ordered data mode. Opts: (null) Aug 1 17:34:31 localhost kernel: VFS: Mounted root (ext4 filesystem) readonly on device 8:5. Aug 1 17:34:31 localhost kernel: devtmpfs: mounted Aug 1 17:34:31 localhost kernel: Freeing unused kernel memory: 540k freed Aug 1 17:34:31 localhost kernel: Write protecting the kernel read-only data: 12288k Aug 1 17:34:31 localhost kernel: Freeing unused kernel memory: 476k freed Aug 1 17:34:31 localhost kernel: Freeing unused kernel memory: 1052k freed Aug 1 17:34:31 localhost udevd[1411]: starting version 171 Aug 1 17:34:31 localhost kernel: usb 1-1.5: usb auto-resume Aug 1 17:34:31 localhost kernel: usb 1-1.5: finish resume Aug 1 17:34:31 localhost kernel: usb 1-1.5: disable remote wakeup, status -32 Aug 1 17:34:31 localhost kernel: usb 1-1.5: link qh4-0001/8802288bb080 start 3 [1/0 us] Aug 1 17:34:31 localhost kernel: usb 1-1.5: unlink qh4-0001/8802288bb080 start 3 [1/0 us] Aug 1 17:34:31 localhost kernel: iwlwifi :44:00.0: loaded firmware version 9.221.4.1 build 25532 Aug 1 17:34:31 localhost kernel: iwlwifi :44:00.0: CONFIG_IWLWIFI_DEBUG disabled Aug 1 17:34:31 localhost kernel: iwlwifi :44:00.0: CONFIG_IWLWIFI_DEBUGFS disabled Aug 1 17:34:31 localhost kernel: iwlwifi :44:00.0: CONFIG_IWLWIFI_DEVICE_TRACING disabled Aug 1 17:34:31 localhost kernel: iwlwifi :44:00.0: CONFIG_IWLWIFI_DEVICE_TESTMODE disabled Aug 1 17:34:31 localhost kernel: iwlwifi :44:00.0: CONFIG_IWLWIFI_P2P disabled Aug 1 17:34:31 localhost kernel: iwlwifi :44:00.0: Detected Intel(R) Centrino(R) Ultimate-N 6300 AGN, REV=0x74 Aug 1 17:34:31 localhost kernel: iwlwifi :44:00.0: L1 Disabled; Enabling L0S Aug 1 17:34:31 localhost kernel: iwlwifi :44:00.0: device EEPROM VER=0x436, CALIB=0x6 Aug 1 17:34:31 localhost kernel: iwlwifi :44:00.0: Device SKU: 0x1F0 Aug 1 17:34:31 localhost kernel: iwlwifi :44:00.0: Valid Tx ant: 0x7, Valid Rx ant: 0x7 Aug 1 17:34:31 localhost kernel: iwlwifi :44:00.0: Tunable channels: 13 802.11bg, 24 802.11a channels Aug 1 17:34:31 localhost kernel: Registered led device: phy0-led Aug 1 17:34:31 localhost kernel: ieee80211 phy0: Selected rate control algorithm 'iwl-agn-rs' Aug 1 17:34:31 localhost kernel: pcmcia_socket pcmcia_socket0: cs: memory probe 0x0c-0x0f: Aug 1 17:34:31 localhost kernel: excluding 0xc-0xf Aug 1 17:34:31 localhost kernel: pcmcia_socket pcmcia_socket0: cs: memory probe 0xa000-0xa0ff: Aug 1 17:34:31 localhost kernel: excluding 0xa000-0xa0ff Aug 1 17:34:31 localhost kernel: pcmcia_socket pcmcia_socket0: cs: memory probe 0x6000-0x60ff: Aug 1 17:34:31 localhost kernel: excluding 0x6000-0x60ff Aug 1 17:34:31 localhost kernel: MXM: GUID detected in BIOS Aug 1 17:34:31 localhost kernel: [drm:drm_pci_init], Aug 1 17:34:31 localhost kernel: [drm:drm_get_pci_dev], Aug 1 17:34:31 localhost kernel: [drm:drm_get_minor], Aug 1 17:34:31 localhost kernel: [drm:drm_get_minor], new minor assigned 64 Aug 1 17:34:31 localhost kernel: [drm:drm_get_minor], Aug 1 17:34:31 localhost kernel: [drm:drm_get_minor], new minor assigned 0 Aug 1 17:34:31 localhost kernel: [drm] nouveau :01:00.0: nouveau_load:1045 - vendor: 0x10DE device: 0xCBC class: 0x3 Aug 1 17:34:31 localhost kernel: [drm] nouveau :01:00.0: Detected an NV50 generation card (0x0a3e00a2) Aug 1 17:34:31 localhost NetworkManager[2346]:SCPlugin-Ifnet: Initializing! Aug 1 17:34:31 localhost NetworkManager[2346]:SCPlugin-Ifnet: management mode: managed Aug 1 17:34:31 localhost
drm/nouveau: crash regression in 3.5
Yes, as far as I can tell. I didn't do anything different this time. The date on the kernel file looks ok. Just did a fresh make && make install again, and got the same behaviour. When is that number after the hash sign upped? Marcin Slusarz wrote: >Are you sure you boot the correct kernel? I'm asking because your panic >says its >version is "3.5.0 #3" - exactly the same as in previous crash log. > >Marcin
drm/nouveau: crash regression in 3.5
On 29.07.2012 22:15, Marcin Slusarz wrote: > No, the real problem is: with "noaccel" we don't register "software engine", > but vblank ISR relies on its existance and happily derefences NULL pointer. > > Now, this patch should fix it for real... Unfortunately I am still seeing the crash. Without "noaccel" it works though (until X crashes the machine, but that is a different thing). Thanks, Ortwin -- next part -- Initializing cgroup subsys cpu Linux version 3.5.0 (root at ortwin-hp) (gcc version 4.5.3 (Gentoo 4.5.3-r2 p1.1, pie-0.4.7) ) #3 SMP PREEMPT Thu Jul 26 14:42:43 CEST 2012 Command line: root=/dev/sda5 rootfstype=ext4 pciehp_force=1 nouveau.modeset=1 nouveau.noaccel=1 netconsole= at 10.11.1.234/eth0, at 10.11.1.19/00:17:f2:c7:5f:06 drm.debug=15 e820: BIOS-provided physical RAM map: BIOS-e820: [mem 0x-0x0009fbff] usable BIOS-e820: [mem 0x0009fc00-0x0009] reserved BIOS-e820: [mem 0x000e-0x000f] reserved BIOS-e820: [mem 0x0010-0xbefc1fff] usable BIOS-e820: [mem 0xbefc2000-0xbf6c1fff] reserved BIOS-e820: [mem 0xbf6c2000-0xbf7c1fff] ACPI NVS BIOS-e820: [mem 0xbf7c2000-0xbf7fefff] ACPI data BIOS-e820: [mem 0xbf7ff000-0xbf7f] usable BIOS-e820: [mem 0xbf80-0xbfff] reserved BIOS-e820: [mem 0xe000-0xefff] reserved BIOS-e820: [mem 0xfec0-0xfec00fff] reserved BIOS-e820: [mem 0xfed1-0xfed13fff] reserved BIOS-e820: [mem 0xfed19000-0xfed19fff] reserved BIOS-e820: [mem 0xfed1b000-0xfed1] reserved BIOS-e820: [mem 0xfee0-0xfee00fff] reserved BIOS-e820: [mem 0xffd0-0x] reserved BIOS-e820: [mem 0x0001-0x0001fbff] usable BIOS-e820: [mem 0x0001fc00-0x0001] reserved BIOS-e820: [mem 0x0002-0x00023bff] usable NX (Execute Disable) protection: active DMI 2.6 present. No AGP bridge found e820: last_pfn = 0x23c000 max_arch_pfn = 0x4 x86 PAT enabled: cpu 0, old 0x7040600070406, new 0x7010600070106 e820: last_pfn = 0xbf800 max_arch_pfn = 0x4 init_memory_mapping: [mem 0x-0xbf7f] init_memory_mapping: [mem 0x1-0x23bff] ACPI: RSDP 000fddc0 00024 (v02 HPQOEM) ACPI: XSDT bf7fe120 00094 (v01 HPQOEM SLIC-MPC 000F 0113) ACPI: FACP bf7fc000 000F4 (v03 HPQOEM 1521 000F HP 0001) ACPI: DSDT bf7da000 1C77F (v02 HPQOEM 1521 0001 INTL 20060912) ACPI: FACS bf76 00040 ACPI: HPET bf7fb000 00038 (v01 HPQOEM 1521 0001 HP 0001) ACPI: APIC bf7fa000 000BC (v01 HPQOEM 1521 0001 HP 0001) ACPI: MCFG bf7f9000 0003C (v01 HPQOEM 1521 0001 HP 0001) ACPI: TCPA bf7f7000 00032 (v02 HPQOEM 1521 HP 0001) ACPI: SSDT bf7d7000 00135 (v01 HPQOEM SataAhci 1000 INTL 20060912) ACPI: SSDT bf7d6000 00314 (v01 HPQOEM PtidDevc 1000 INTL 20060912) ACPI: SLIC bf7d5000 00176 (v01 HPQOEM SLIC-MPC 0001 HP 0001) ACPI: SSDT bf7d1000 02576 (v01 HPQOEM NVIDIAGF 0001 INTL 20060912) ACPI: DMAR bf7d 00080 (v01 INTEL CP_DALE 0001 INTL 0001) ACPI: SSDT bf7cf000 00A10 (v01 PmRefCpuPm 3000 INTL 20060912) ACPI: SSDT bf7ce000 00288 (v01 PmRef Cpu0Tst 3000 INTL 20060912) ACPI: SSDT bf7cd000 00225 (v01 PmRefApTst 3000 INTL 20060912) ACPI: ASF! bf7f8000 000A0 (v32 HPQOEM 1521 0001 HP 0001) Zone ranges: DMA [mem 0x0001-0x00ff] DMA32[mem 0x0100-0x] Normal [mem 0x1-0x23bff] Movable zone start for each node Early memory node ranges node 0: [mem 0x0001-0x0009efff] node 0: [mem 0x0010-0xbefc1fff] node 0: [mem 0xbf7ff000-0xbf7f] node 0: [mem 0x1-0x1fbff] node 0: [mem 0x2-0x23bff] ACPI: PM-Timer IO Port: 0x408 ACPI: LAPIC (acpi_id[0x00] lapic_id[0x00] enabled) ACPI: LAPIC (acpi_id[0x01] lapic_id[0x01] enabled) ACPI: LAPIC (acpi_id[0x02] lapic_id[0x04] enabled) ACPI: LAPIC (acpi_id[0x03] lapic_id[0x05] enabled) ACPI: LAPIC (acpi_id[0x04] lapic_id[0x00] disabled) ACPI: LAPIC (acpi_id[0x05] lapic_id[0x00] disabled) ACPI: LAPIC (acpi_id[0x06] lapic_id[0x00] disabled) ACPI: LAPIC (acpi_id[0x07] lapic_id[0x00] disabled) ACPI: LAPIC_NMI (acpi_id[0x00] high edge lint[0x1]) ACPI: LAPIC_NMI (acpi_id[0x01] high edge lint[0x1]) ACPI: LAPIC_NMI (acpi_id[0x02] high edge lint[0x1]) ACPI: LAPIC_NMI (acpi_id[0x03] high edge lint[0x1]) ACPI: LAPIC_NMI (acpi_id[0x04] high edge lint[0x1]) ACPI: LAPIC_NMI (acpi_id[0x05] high edge lint[0x1]) ACPI: LAPIC_NMI (acpi_id[0x06] high edge lint[0x1]) ACPI: LAPIC_NMI (acpi_id[0x07] high edge lint[0x1]) ACPI:
Re: drm/nouveau: crash regression in 3.5
On 29.07.2012 22:15, Marcin Slusarz wrote: No, the real problem is: with noaccel we don't register software engine, but vblank ISR relies on its existance and happily derefences NULL pointer. Now, this patch should fix it for real... Unfortunately I am still seeing the crash. Without noaccel it works though (until X crashes the machine, but that is a different thing). Thanks, Ortwin Initializing cgroup subsys cpu Linux version 3.5.0 (root@ortwin-hp) (gcc version 4.5.3 (Gentoo 4.5.3-r2 p1.1, pie-0.4.7) ) #3 SMP PREEMPT Thu Jul 26 14:42:43 CEST 2012 Command line: root=/dev/sda5 rootfstype=ext4 pciehp_force=1 nouveau.modeset=1 nouveau.noaccel=1 netconsole=@10.11.1.234/eth0,@10.11.1.19/00:17:f2:c7:5f:06 drm.debug=15 e820: BIOS-provided physical RAM map: BIOS-e820: [mem 0x-0x0009fbff] usable BIOS-e820: [mem 0x0009fc00-0x0009] reserved BIOS-e820: [mem 0x000e-0x000f] reserved BIOS-e820: [mem 0x0010-0xbefc1fff] usable BIOS-e820: [mem 0xbefc2000-0xbf6c1fff] reserved BIOS-e820: [mem 0xbf6c2000-0xbf7c1fff] ACPI NVS BIOS-e820: [mem 0xbf7c2000-0xbf7fefff] ACPI data BIOS-e820: [mem 0xbf7ff000-0xbf7f] usable BIOS-e820: [mem 0xbf80-0xbfff] reserved BIOS-e820: [mem 0xe000-0xefff] reserved BIOS-e820: [mem 0xfec0-0xfec00fff] reserved BIOS-e820: [mem 0xfed1-0xfed13fff] reserved BIOS-e820: [mem 0xfed19000-0xfed19fff] reserved BIOS-e820: [mem 0xfed1b000-0xfed1] reserved BIOS-e820: [mem 0xfee0-0xfee00fff] reserved BIOS-e820: [mem 0xffd0-0x] reserved BIOS-e820: [mem 0x0001-0x0001fbff] usable BIOS-e820: [mem 0x0001fc00-0x0001] reserved BIOS-e820: [mem 0x0002-0x00023bff] usable NX (Execute Disable) protection: active DMI 2.6 present. No AGP bridge found e820: last_pfn = 0x23c000 max_arch_pfn = 0x4 x86 PAT enabled: cpu 0, old 0x7040600070406, new 0x7010600070106 e820: last_pfn = 0xbf800 max_arch_pfn = 0x4 init_memory_mapping: [mem 0x-0xbf7f] init_memory_mapping: [mem 0x1-0x23bff] ACPI: RSDP 000fddc0 00024 (v02 HPQOEM) ACPI: XSDT bf7fe120 00094 (v01 HPQOEM SLIC-MPC 000F 0113) ACPI: FACP bf7fc000 000F4 (v03 HPQOEM 1521 000F HP 0001) ACPI: DSDT bf7da000 1C77F (v02 HPQOEM 1521 0001 INTL 20060912) ACPI: FACS bf76 00040 ACPI: HPET bf7fb000 00038 (v01 HPQOEM 1521 0001 HP 0001) ACPI: APIC bf7fa000 000BC (v01 HPQOEM 1521 0001 HP 0001) ACPI: MCFG bf7f9000 0003C (v01 HPQOEM 1521 0001 HP 0001) ACPI: TCPA bf7f7000 00032 (v02 HPQOEM 1521 HP 0001) ACPI: SSDT bf7d7000 00135 (v01 HPQOEM SataAhci 1000 INTL 20060912) ACPI: SSDT bf7d6000 00314 (v01 HPQOEM PtidDevc 1000 INTL 20060912) ACPI: SLIC bf7d5000 00176 (v01 HPQOEM SLIC-MPC 0001 HP 0001) ACPI: SSDT bf7d1000 02576 (v01 HPQOEM NVIDIAGF 0001 INTL 20060912) ACPI: DMAR bf7d 00080 (v01 INTEL CP_DALE 0001 INTL 0001) ACPI: SSDT bf7cf000 00A10 (v01 PmRefCpuPm 3000 INTL 20060912) ACPI: SSDT bf7ce000 00288 (v01 PmRef Cpu0Tst 3000 INTL 20060912) ACPI: SSDT bf7cd000 00225 (v01 PmRefApTst 3000 INTL 20060912) ACPI: ASF! bf7f8000 000A0 (v32 HPQOEM 1521 0001 HP 0001) Zone ranges: DMA [mem 0x0001-0x00ff] DMA32[mem 0x0100-0x] Normal [mem 0x1-0x23bff] Movable zone start for each node Early memory node ranges node 0: [mem 0x0001-0x0009efff] node 0: [mem 0x0010-0xbefc1fff] node 0: [mem 0xbf7ff000-0xbf7f] node 0: [mem 0x1-0x1fbff] node 0: [mem 0x2-0x23bff] ACPI: PM-Timer IO Port: 0x408 ACPI: LAPIC (acpi_id[0x00] lapic_id[0x00] enabled) ACPI: LAPIC (acpi_id[0x01] lapic_id[0x01] enabled) ACPI: LAPIC (acpi_id[0x02] lapic_id[0x04] enabled) ACPI: LAPIC (acpi_id[0x03] lapic_id[0x05] enabled) ACPI: LAPIC (acpi_id[0x04] lapic_id[0x00] disabled) ACPI: LAPIC (acpi_id[0x05] lapic_id[0x00] disabled) ACPI: LAPIC (acpi_id[0x06] lapic_id[0x00] disabled) ACPI: LAPIC (acpi_id[0x07] lapic_id[0x00] disabled) ACPI: LAPIC_NMI (acpi_id[0x00] high edge lint[0x1]) ACPI: LAPIC_NMI (acpi_id[0x01] high edge lint[0x1]) ACPI: LAPIC_NMI (acpi_id[0x02] high edge lint[0x1]) ACPI: LAPIC_NMI (acpi_id[0x03] high edge lint[0x1]) ACPI: LAPIC_NMI (acpi_id[0x04] high edge lint[0x1]) ACPI: LAPIC_NMI (acpi_id[0x05] high edge lint[0x1]) ACPI: LAPIC_NMI (acpi_id[0x06] high edge lint[0x1]) ACPI: LAPIC_NMI (acpi_id[0x07] high edge lint[0x1]) ACPI: IOAPIC (id[0x01] address[0xfec0] gsi_base[0]) IOAPIC[0]:
Re: drm/nouveau: crash regression in 3.5
On 25.07.2012 20:42, Marcin Slusarz wrote: Good, below patch should fix this panic. Note that you can hit an oops in drm_handle_vblank because patch from http://lists.freedesktop.org/archives/dri-devel/2012-May/023498.html has not been applied (yet?). After applying your patch, it still crashes, although with a slightly different stack trace. I then also applied the second patch, but that doesn't make any difference. New log attached. Looks like interrupt occurs before nouveau_software_context_new() is called? Shouldn't the initialization be done from nouveau_irq_preinstall() so it is available when the irq occurs? Again, I am not an expert here. Just guessing... Thanks, Ortwin Initializing cgroup subsys cpu Linux version 3.5.0 (root@ortwin-hp) (gcc version 4.5.3 (Gentoo 4.5.3-r2 p1.1, pie-0.4.7) ) #3 SMP PREEMPT Thu Jul 26 14:42:43 CEST 2012 Command line: root=/dev/sda5 rootfstype=ext4 pciehp_force=1 nouveau.modeset=1 nouveau.noaccel=1 netconsole=@10.11.1.234/eth0,@10.11.1.19/00:1a:64:89:71:b8 drm.debug=1 e820: BIOS-provided physical RAM map: BIOS-e820: [mem 0x-0x0009fbff] usable BIOS-e820: [mem 0x0009fc00-0x0009] reserved BIOS-e820: [mem 0x000e-0x000f] reserved BIOS-e820: [mem 0x0010-0xbefc1fff] usable BIOS-e820: [mem 0xbefc2000-0xbf6c1fff] reserved BIOS-e820: [mem 0xbf6c2000-0xbf7c1fff] ACPI NVS BIOS-e820: [mem 0xbf7c2000-0xbf7fefff] ACPI data BIOS-e820: [mem 0xbf7ff000-0xbf7f] usable BIOS-e820: [mem 0xbf80-0xbfff] reserved BIOS-e820: [mem 0xe000-0xefff] reserved BIOS-e820: [mem 0xfec0-0xfec00fff] reserved BIOS-e820: [mem 0xfed1-0xfed13fff] reserved BIOS-e820: [mem 0xfed19000-0xfed19fff] reserved BIOS-e820: [mem 0xfed1b000-0xfed1] reserved BIOS-e820: [mem 0xfee0-0xfee00fff] reserved BIOS-e820: [mem 0xffd0-0x] reserved BIOS-e820: [mem 0x0001-0x0001fbff] usable BIOS-e820: [mem 0x0001fc00-0x0001] reserved BIOS-e820: [mem 0x0002-0x00023bff] usable NX (Execute Disable) protection: active DMI 2.6 present. No AGP bridge found e820: last_pfn = 0x23c000 max_arch_pfn = 0x4 x86 PAT enabled: cpu 0, old 0x7040600070406, new 0x7010600070106 e820: last_pfn = 0xbf800 max_arch_pfn = 0x4 init_memory_mapping: [mem 0x-0xbf7f] init_memory_mapping: [mem 0x1-0x23bff] ACPI: RSDP 000fddc0 00024 (v02 HPQOEM) ACPI: XSDT bf7fe120 00094 (v01 HPQOEM SLIC-MPC 000F 0113) ACPI: FACP bf7fc000 000F4 (v03 HPQOEM 1521 000F HP 0001) ACPI: DSDT bf7da000 1C77F (v02 HPQOEM 1521 0001 INTL 20060912) ACPI: FACS bf76 00040 ACPI: HPET bf7fb000 00038 (v01 HPQOEM 1521 0001 HP 0001) ACPI: APIC bf7fa000 000BC (v01 HPQOEM 1521 0001 HP 0001) ACPI: MCFG bf7f9000 0003C (v01 HPQOEM 1521 0001 HP 0001) ACPI: TCPA bf7f7000 00032 (v02 HPQOEM 1521 HP 0001) ACPI: SSDT bf7d7000 00135 (v01 HPQOEM SataAhci 1000 INTL 20060912) ACPI: SSDT bf7d6000 00314 (v01 HPQOEM PtidDevc 1000 INTL 20060912) ACPI: SLIC bf7d5000 00176 (v01 HPQOEM SLIC-MPC 0001 HP 0001) ACPI: SSDT bf7d1000 02576 (v01 HPQOEM NVIDIAGF 0001 INTL 20060912) ACPI: DMAR bf7d 00080 (v01 INTEL CP_DALE 0001 INTL 0001) ACPI: SSDT bf7cf000 00A10 (v01 PmRefCpuPm 3000 INTL 20060912) ACPI: SSDT bf7ce000 00288 (v01 PmRef Cpu0Tst 3000 INTL 20060912) ACPI: SSDT bf7cd000 00225 (v01 PmRefApTst 3000 INTL 20060912) ACPI: ASF! bf7f8000 000A0 (v32 HPQOEM 1521 0001 HP 0001) Zone ranges: DMA [mem 0x0001-0x00ff] DMA32[mem 0x0100-0x] Normal [mem 0x1-0x23bff] Movable zone start for each node Early memory node ranges node 0: [mem 0x0001-0x0009efff] node 0: [mem 0x0010-0xbefc1fff] node 0: [mem 0xbf7ff000-0xbf7f] node 0: [mem 0x1-0x1fbff] node 0: [mem 0x2-0x23bff] ACPI: PM-Timer IO Port: 0x408 ACPI: LAPIC (acpi_id[0x00] lapic_id[0x00] enabled) ACPI: LAPIC (acpi_id[0x01] lapic_id[0x01] enabled) ACPI: LAPIC (acpi_id[0x02] lapic_id[0x04] enabled) ACPI: LAPIC (acpi_id[0x03] lapic_id[0x05] enabled) ACPI: LAPIC (acpi_id[0x04] lapic_id[0x00] disabled) ACPI: LAPIC (acpi_id[0x05] lapic_id[0x00] disabled) ACPI: LAPIC (acpi_id[0x06] lapic_id[0x00] disabled) ACPI: LAPIC (acpi_id[0x07] lapic_id[0x00] disabled) ACPI: LAPIC_NMI (acpi_id[0x00] high edge lint[0x1]) ACPI: LAPIC_NMI (acpi_id[0x01] high edge lint[0x1]) ACPI: LAPIC_NMI (acpi_id[0x02] high edge lint[0x1]) ACPI: LAPIC_NMI
drm/nouveau: crash regression in 3.5
On 25.07.2012 20:42, Marcin Slusarz wrote: > Good, below patch should fix this panic. > > Note that you can hit an oops in drm_handle_vblank because patch from > http://lists.freedesktop.org/archives/dri-devel/2012-May/023498.html > has not been applied (yet?). After applying your patch, it still crashes, although with a slightly different stack trace. I then also applied the second patch, but that doesn't make any difference. New log attached. Looks like interrupt occurs before nouveau_software_context_new() is called? Shouldn't the initialization be done from nouveau_irq_preinstall() so it is available when the irq occurs? Again, I am not an expert here. Just guessing... Thanks, Ortwin -- next part -- Initializing cgroup subsys cpu Linux version 3.5.0 (root at ortwin-hp) (gcc version 4.5.3 (Gentoo 4.5.3-r2 p1.1, pie-0.4.7) ) #3 SMP PREEMPT Thu Jul 26 14:42:43 CEST 2012 Command line: root=/dev/sda5 rootfstype=ext4 pciehp_force=1 nouveau.modeset=1 nouveau.noaccel=1 netconsole= at 10.11.1.234/eth0, at 10.11.1.19/00:1a:64:89:71:b8 drm.debug=1 e820: BIOS-provided physical RAM map: BIOS-e820: [mem 0x-0x0009fbff] usable BIOS-e820: [mem 0x0009fc00-0x0009] reserved BIOS-e820: [mem 0x000e-0x000f] reserved BIOS-e820: [mem 0x0010-0xbefc1fff] usable BIOS-e820: [mem 0xbefc2000-0xbf6c1fff] reserved BIOS-e820: [mem 0xbf6c2000-0xbf7c1fff] ACPI NVS BIOS-e820: [mem 0xbf7c2000-0xbf7fefff] ACPI data BIOS-e820: [mem 0xbf7ff000-0xbf7f] usable BIOS-e820: [mem 0xbf80-0xbfff] reserved BIOS-e820: [mem 0xe000-0xefff] reserved BIOS-e820: [mem 0xfec0-0xfec00fff] reserved BIOS-e820: [mem 0xfed1-0xfed13fff] reserved BIOS-e820: [mem 0xfed19000-0xfed19fff] reserved BIOS-e820: [mem 0xfed1b000-0xfed1] reserved BIOS-e820: [mem 0xfee0-0xfee00fff] reserved BIOS-e820: [mem 0xffd0-0x] reserved BIOS-e820: [mem 0x0001-0x0001fbff] usable BIOS-e820: [mem 0x0001fc00-0x0001] reserved BIOS-e820: [mem 0x0002-0x00023bff] usable NX (Execute Disable) protection: active DMI 2.6 present. No AGP bridge found e820: last_pfn = 0x23c000 max_arch_pfn = 0x4 x86 PAT enabled: cpu 0, old 0x7040600070406, new 0x7010600070106 e820: last_pfn = 0xbf800 max_arch_pfn = 0x4 init_memory_mapping: [mem 0x-0xbf7f] init_memory_mapping: [mem 0x1-0x23bff] ACPI: RSDP 000fddc0 00024 (v02 HPQOEM) ACPI: XSDT bf7fe120 00094 (v01 HPQOEM SLIC-MPC 000F 0113) ACPI: FACP bf7fc000 000F4 (v03 HPQOEM 1521 000F HP 0001) ACPI: DSDT bf7da000 1C77F (v02 HPQOEM 1521 0001 INTL 20060912) ACPI: FACS bf76 00040 ACPI: HPET bf7fb000 00038 (v01 HPQOEM 1521 0001 HP 0001) ACPI: APIC bf7fa000 000BC (v01 HPQOEM 1521 0001 HP 0001) ACPI: MCFG bf7f9000 0003C (v01 HPQOEM 1521 0001 HP 0001) ACPI: TCPA bf7f7000 00032 (v02 HPQOEM 1521 HP 0001) ACPI: SSDT bf7d7000 00135 (v01 HPQOEM SataAhci 1000 INTL 20060912) ACPI: SSDT bf7d6000 00314 (v01 HPQOEM PtidDevc 1000 INTL 20060912) ACPI: SLIC bf7d5000 00176 (v01 HPQOEM SLIC-MPC 0001 HP 0001) ACPI: SSDT bf7d1000 02576 (v01 HPQOEM NVIDIAGF 0001 INTL 20060912) ACPI: DMAR bf7d 00080 (v01 INTEL CP_DALE 0001 INTL 0001) ACPI: SSDT bf7cf000 00A10 (v01 PmRefCpuPm 3000 INTL 20060912) ACPI: SSDT bf7ce000 00288 (v01 PmRef Cpu0Tst 3000 INTL 20060912) ACPI: SSDT bf7cd000 00225 (v01 PmRefApTst 3000 INTL 20060912) ACPI: ASF! bf7f8000 000A0 (v32 HPQOEM 1521 0001 HP 0001) Zone ranges: DMA [mem 0x0001-0x00ff] DMA32[mem 0x0100-0x] Normal [mem 0x1-0x23bff] Movable zone start for each node Early memory node ranges node 0: [mem 0x0001-0x0009efff] node 0: [mem 0x0010-0xbefc1fff] node 0: [mem 0xbf7ff000-0xbf7f] node 0: [mem 0x1-0x1fbff] node 0: [mem 0x2-0x23bff] ACPI: PM-Timer IO Port: 0x408 ACPI: LAPIC (acpi_id[0x00] lapic_id[0x00] enabled) ACPI: LAPIC (acpi_id[0x01] lapic_id[0x01] enabled) ACPI: LAPIC (acpi_id[0x02] lapic_id[0x04] enabled) ACPI: LAPIC (acpi_id[0x03] lapic_id[0x05] enabled) ACPI: LAPIC (acpi_id[0x04] lapic_id[0x00] disabled) ACPI: LAPIC (acpi_id[0x05] lapic_id[0x00] disabled) ACPI: LAPIC (acpi_id[0x06] lapic_id[0x00] disabled) ACPI: LAPIC (acpi_id[0x07] lapic_id[0x00] disabled) ACPI: LAPIC_NMI (acpi_id[0x00] high edge lint[0x1]) ACPI: LAPIC_NMI (acpi_id[0x01] high edge lint[0x1]) ACPI: LAPIC_NMI
Re: drm/nouveau: crash regression in 3.5
On 24.07.2012 19:00, Marcin Slusarz wrote: Please post the crash log. Sorry, I was not precise: it boots until drm performs modesetting (so it seems). The screen goes black and the machine is dead. So there is nothing I could post here, unfortunately. This is a video of 3.5 booting: http://www.odi.ch/VIDEO0010.3gp Don't worry about the external monitor: same behaviour with the built-in panel. Thanks, Ortwin ___ dri-devel mailing list dri-devel@lists.freedesktop.org http://lists.freedesktop.org/mailman/listinfo/dri-devel
Re: drm/nouveau: crash regression in 3.5
Does it work if you boot without X and modprobe nouveau manually? If it does, can you disable page flipping in xorg.conf (Option PageFlip 0 in nouveau device section) and recheck with X? It happens long before X, when the nouveau module is loaded. Does it work if you disable acceleration (nouveau.noaccel=1 in kernel command line)? nouveau.noaccel=1 is already on my cmdline as running X with accel enabled never worked anyway. Is there anything saved in /var/log/ from previous boot? Can you ssh into and check dmesg? Can you use netconsole and catch full log? Thanks for the netconsole tip. I have attached the log. I am not an expert but it looks like a crash in the inlined nouveau_software_vblank(). Is the vblank.list already initialized at this point? Thanks, Ortwin Initializing cgroup subsys cpu Linux version 3.5.0 (root@ortwin-hp) (gcc version 4.5.3 (Gentoo 4.5.3-r2 p1.1, pie-0.4.7) ) #2 SMP PREEMPT Wed Jul 25 09:39:45 CEST 2012 Command line: root=/dev/sda5 rootfstype=ext4 pciehp_force=1 nouveau.modeset=1 nouveau.noaccel=1 netconsole=@10.11.1.234/eth0,@10.11.1.19/00:1a:64:89:71:b8 e820: BIOS-provided physical RAM map: BIOS-e820: [mem 0x-0x0009fbff] usable BIOS-e820: [mem 0x0009fc00-0x0009] reserved BIOS-e820: [mem 0x000e-0x000f] reserved BIOS-e820: [mem 0x0010-0xbefc1fff] usable BIOS-e820: [mem 0xbefc2000-0xbf6c1fff] reserved BIOS-e820: [mem 0xbf6c2000-0xbf7c1fff] ACPI NVS BIOS-e820: [mem 0xbf7c2000-0xbf7fefff] ACPI data BIOS-e820: [mem 0xbf7ff000-0xbf7f] usable BIOS-e820: [mem 0xbf80-0xbfff] reserved BIOS-e820: [mem 0xe000-0xefff] reserved BIOS-e820: [mem 0xfec0-0xfec00fff] reserved BIOS-e820: [mem 0xfed1-0xfed13fff] reserved BIOS-e820: [mem 0xfed19000-0xfed19fff] reserved BIOS-e820: [mem 0xfed1b000-0xfed1] reserved BIOS-e820: [mem 0xfee0-0xfee00fff] reserved BIOS-e820: [mem 0xffd0-0x] reserved BIOS-e820: [mem 0x0001-0x0001fbff] usable BIOS-e820: [mem 0x0001fc00-0x0001] reserved BIOS-e820: [mem 0x0002-0x00023bff] usable NX (Execute Disable) protection: active DMI 2.6 present. No AGP bridge found e820: last_pfn = 0x23c000 max_arch_pfn = 0x4 x86 PAT enabled: cpu 0, old 0x7040600070406, new 0x7010600070106 e820: last_pfn = 0xbf800 max_arch_pfn = 0x4 init_memory_mapping: [mem 0x-0xbf7f] init_memory_mapping: [mem 0x1-0x23bff] ACPI: RSDP 000fddc0 00024 (v02 HPQOEM) ACPI: XSDT bf7fe120 00094 (v01 HPQOEM SLIC-MPC 000F 0113) ACPI: FACP bf7fc000 000F4 (v03 HPQOEM 1521 000F HP 0001) ACPI: DSDT bf7da000 1C77F (v02 HPQOEM 1521 0001 INTL 20060912) ACPI: FACS bf76 00040 ACPI: HPET bf7fb000 00038 (v01 HPQOEM 1521 0001 HP 0001) ACPI: APIC bf7fa000 000BC (v01 HPQOEM 1521 0001 HP 0001) ACPI: MCFG bf7f9000 0003C (v01 HPQOEM 1521 0001 HP 0001) ACPI: TCPA bf7f7000 00032 (v02 HPQOEM 1521 HP 0001) ACPI: SSDT bf7d7000 00135 (v01 HPQOEM SataAhci 1000 INTL 20060912) ACPI: SSDT bf7d6000 00314 (v01 HPQOEM PtidDevc 1000 INTL 20060912) ACPI: SLIC bf7d5000 00176 (v01 HPQOEM SLIC-MPC 0001 HP 0001) ACPI: SSDT bf7d1000 02576 (v01 HPQOEM NVIDIAGF 0001 INTL 20060912) ACPI: DMAR bf7d 00080 (v01 INTEL CP_DALE 0001 INTL 0001) ACPI: SSDT bf7cf000 00A10 (v01 PmRefCpuPm 3000 INTL 20060912) ACPI: SSDT bf7ce000 00288 (v01 PmRef Cpu0Tst 3000 INTL 20060912) ACPI: SSDT bf7cd000 00225 (v01 PmRefApTst 3000 INTL 20060912) ACPI: ASF! bf7f8000 000A0 (v32 HPQOEM 1521 0001 HP 0001) Zone ranges: DMA [mem 0x0001-0x00ff] DMA32[mem 0x0100-0x] Normal [mem 0x1-0x23bff] Movable zone start for each node Early memory node ranges node 0: [mem 0x0001-0x0009efff] node 0: [mem 0x0010-0xbefc1fff] node 0: [mem 0xbf7ff000-0xbf7f] node 0: [mem 0x1-0x1fbff] node 0: [mem 0x2-0x23bff] ACPI: PM-Timer IO Port: 0x408 ACPI: LAPIC (acpi_id[0x00] lapic_id[0x00] enabled) ACPI: LAPIC (acpi_id[0x01] lapic_id[0x01] enabled) ACPI: LAPIC (acpi_id[0x02] lapic_id[0x04] enabled) ACPI: LAPIC (acpi_id[0x03] lapic_id[0x05] enabled) ACPI: LAPIC (acpi_id[0x04] lapic_id[0x00] disabled) ACPI: LAPIC (acpi_id[0x05] lapic_id[0x00] disabled) ACPI: LAPIC (acpi_id[0x06] lapic_id[0x00] disabled) ACPI: LAPIC (acpi_id[0x07] lapic_id[0x00] disabled) ACPI: LAPIC_NMI (acpi_id[0x00] high edge lint[0x1]) ACPI: LAPIC_NMI (acpi_id[0x01] high edge
drm/nouveau: crash regression in 3.5
> Does it work if you boot without X and modprobe nouveau manually? If it does, > can you disable page flipping in xorg.conf (Option "PageFlip" "0" in nouveau > device section) and recheck with X? It happens long before X, when the nouveau module is loaded. > Does it work if you disable acceleration (nouveau.noaccel=1 in kernel command > line)? nouveau.noaccel=1 is already on my cmdline as running X with accel enabled never worked anyway. > Is there anything saved in /var/log/ from previous boot? Can you ssh > into and check dmesg? Can you use netconsole and catch full log? Thanks for the netconsole tip. I have attached the log. I am not an expert but it looks like a crash in the inlined nouveau_software_vblank(). Is the vblank.list already initialized at this point? Thanks, Ortwin -- next part -- Initializing cgroup subsys cpu Linux version 3.5.0 (root at ortwin-hp) (gcc version 4.5.3 (Gentoo 4.5.3-r2 p1.1, pie-0.4.7) ) #2 SMP PREEMPT Wed Jul 25 09:39:45 CEST 2012 Command line: root=/dev/sda5 rootfstype=ext4 pciehp_force=1 nouveau.modeset=1 nouveau.noaccel=1 netconsole= at 10.11.1.234/eth0, at 10.11.1.19/00:1a:64:89:71:b8 e820: BIOS-provided physical RAM map: BIOS-e820: [mem 0x-0x0009fbff] usable BIOS-e820: [mem 0x0009fc00-0x0009] reserved BIOS-e820: [mem 0x000e-0x000f] reserved BIOS-e820: [mem 0x0010-0xbefc1fff] usable BIOS-e820: [mem 0xbefc2000-0xbf6c1fff] reserved BIOS-e820: [mem 0xbf6c2000-0xbf7c1fff] ACPI NVS BIOS-e820: [mem 0xbf7c2000-0xbf7fefff] ACPI data BIOS-e820: [mem 0xbf7ff000-0xbf7f] usable BIOS-e820: [mem 0xbf80-0xbfff] reserved BIOS-e820: [mem 0xe000-0xefff] reserved BIOS-e820: [mem 0xfec0-0xfec00fff] reserved BIOS-e820: [mem 0xfed1-0xfed13fff] reserved BIOS-e820: [mem 0xfed19000-0xfed19fff] reserved BIOS-e820: [mem 0xfed1b000-0xfed1] reserved BIOS-e820: [mem 0xfee0-0xfee00fff] reserved BIOS-e820: [mem 0xffd0-0x] reserved BIOS-e820: [mem 0x0001-0x0001fbff] usable BIOS-e820: [mem 0x0001fc00-0x0001] reserved BIOS-e820: [mem 0x0002-0x00023bff] usable NX (Execute Disable) protection: active DMI 2.6 present. No AGP bridge found e820: last_pfn = 0x23c000 max_arch_pfn = 0x4 x86 PAT enabled: cpu 0, old 0x7040600070406, new 0x7010600070106 e820: last_pfn = 0xbf800 max_arch_pfn = 0x4 init_memory_mapping: [mem 0x-0xbf7f] init_memory_mapping: [mem 0x1-0x23bff] ACPI: RSDP 000fddc0 00024 (v02 HPQOEM) ACPI: XSDT bf7fe120 00094 (v01 HPQOEM SLIC-MPC 000F 0113) ACPI: FACP bf7fc000 000F4 (v03 HPQOEM 1521 000F HP 0001) ACPI: DSDT bf7da000 1C77F (v02 HPQOEM 1521 0001 INTL 20060912) ACPI: FACS bf76 00040 ACPI: HPET bf7fb000 00038 (v01 HPQOEM 1521 0001 HP 0001) ACPI: APIC bf7fa000 000BC (v01 HPQOEM 1521 0001 HP 0001) ACPI: MCFG bf7f9000 0003C (v01 HPQOEM 1521 0001 HP 0001) ACPI: TCPA bf7f7000 00032 (v02 HPQOEM 1521 HP 0001) ACPI: SSDT bf7d7000 00135 (v01 HPQOEM SataAhci 1000 INTL 20060912) ACPI: SSDT bf7d6000 00314 (v01 HPQOEM PtidDevc 1000 INTL 20060912) ACPI: SLIC bf7d5000 00176 (v01 HPQOEM SLIC-MPC 0001 HP 0001) ACPI: SSDT bf7d1000 02576 (v01 HPQOEM NVIDIAGF 0001 INTL 20060912) ACPI: DMAR bf7d 00080 (v01 INTEL CP_DALE 0001 INTL 0001) ACPI: SSDT bf7cf000 00A10 (v01 PmRefCpuPm 3000 INTL 20060912) ACPI: SSDT bf7ce000 00288 (v01 PmRef Cpu0Tst 3000 INTL 20060912) ACPI: SSDT bf7cd000 00225 (v01 PmRefApTst 3000 INTL 20060912) ACPI: ASF! bf7f8000 000A0 (v32 HPQOEM 1521 0001 HP 0001) Zone ranges: DMA [mem 0x0001-0x00ff] DMA32[mem 0x0100-0x] Normal [mem 0x1-0x23bff] Movable zone start for each node Early memory node ranges node 0: [mem 0x0001-0x0009efff] node 0: [mem 0x0010-0xbefc1fff] node 0: [mem 0xbf7ff000-0xbf7f] node 0: [mem 0x1-0x1fbff] node 0: [mem 0x2-0x23bff] ACPI: PM-Timer IO Port: 0x408 ACPI: LAPIC (acpi_id[0x00] lapic_id[0x00] enabled) ACPI: LAPIC (acpi_id[0x01] lapic_id[0x01] enabled) ACPI: LAPIC (acpi_id[0x02] lapic_id[0x04] enabled) ACPI: LAPIC (acpi_id[0x03] lapic_id[0x05] enabled) ACPI: LAPIC (acpi_id[0x04] lapic_id[0x00] disabled) ACPI: LAPIC (acpi_id[0x05] lapic_id[0x00] disabled) ACPI: LAPIC (acpi_id[0x06] lapic_id[0x00] disabled) ACPI: LAPIC (acpi_id[0x07] lapic_id[0x00] disabled) ACPI: LAPIC_NMI (acpi_id[0x00] high
drm/nouveau: crash regression in 3.5
On 24.07.2012 19:00, Marcin Slusarz wrote: > Please post the crash log. Sorry, I was not precise: it boots until drm performs modesetting (so it seems). The screen goes black and the machine is dead. So there is nothing I could post here, unfortunately. This is a video of 3.5 booting: http://www.odi.ch/VIDEO0010.3gp Don't worry about the external monitor: same behaviour with the built-in panel. Thanks, Ortwin
drm/nouveau: crash regression in 3.5
Hi, My HP Elitebook 8540w now crashes on boot with 3.5. All works fine with 3.4. Bisected to the following commit: 20abd1634a6e2eedb84ca977adea56b8aa06cc3e is the first bad commit commit 20abd1634a6e2eedb84ca977adea56b8aa06cc3e Author: Ben Skeggs bske...@redhat.com Date: Mon Apr 30 11:33:43 2012 -0500 drm/nouveau: create real execution engine for software object class Just a cleanup more or less, and to remove the need for special handling of software objects. This removes a heap of documentation on dma/graph object formats. The info is very out of date with our current understanding, and is far better documented in rnndb in envytools git. Signed-off-by: Ben Skeggs bske...@redhat.com lspci: 01:00.0 VGA compatible controller: NVIDIA Corporation GT215 [Quadro FX 1800M] (rev a2) kernel output from a working 3.4: Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: Detected an NV50 generation card (0x0a3e00a2) Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: Checking PRAMIN for VBIOS Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: ... appears to be valid Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: Using VBIOS from PRAMIN Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: BIT BIOS found Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: Bios version 70.15.43.00 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: TMDS table version 2.0 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: MXM: BIOS version 3.0 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: MXM: MXMS Version 3.0 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: DCB version 4.0 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: DCB outp 00: 01000313 00010034 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: DCB outp 03: 080153d6 0f220020 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: DCB outp 04: 08015392 00020020 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: DCB outp 05: 080143c6 0f220010 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: DCB outp 06: 08014382 00020010 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: DCB outp 08: 040383b6 0f230014 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: DCB outp 10: 020273a6 0f220010 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: DCB outp 11: 02027362 00020010 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: DCB outp 13: 02049300 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: DCB conn 00: 0040 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: DCB conn 01: 1161 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: DCB conn 02: 1231 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: DCB conn 03: 01000331 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: DCB conn 04: 01000446 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: DCB conn 05: 02000546 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: DCB conn 06: 00010631 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: DCB conn 07: 00010746 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: DCB conn 08: 00020847 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: DCB conn 09: 0900 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: Parsing VBIOS init table 0 at offset 0x7AE4 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: 0x7E6B: Condition still not met after 20ms, skipping follow ing opcodes Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: 0x7E6F: Condition still not met after 20ms, skipping follow ing opcodes Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: Parsing VBIOS init table 1 at offset 0x809A Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: Parsing VBIOS init table 2 at offset 0x951E Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: Parsing VBIOS init table 3 at offset 0x955C Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: Parsing VBIOS init table 4 at offset 0x97CA Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: Parsing VBIOS init table at offset 0x982F Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: 0x982F: Condition still not met after 20ms, skipping follow ing opcodes Jul 23 19:49:57 localhost kernel: [TTM] Zone kernel: Available graphics memory: 4008772 kiB Jul 23 19:49:57 localhost kernel: [TTM] Zone dma32: Available graphics memory: 2097152 kiB Jul 23 19:49:57 localhost kernel: [TTM] Initializing pool allocator Jul 23 19:49:57 localhost kernel: [TTM] Initializing DMA pool allocator Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: Detected 1024MiB VRAM (GDDR5) Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: 512 MiB GART (aperture) Jul
drm/nouveau: crash regression in 3.5
Hi, My HP Elitebook 8540w now crashes on boot with 3.5. All works fine with 3.4. Bisected to the following commit: 20abd1634a6e2eedb84ca977adea56b8aa06cc3e is the first bad commit commit 20abd1634a6e2eedb84ca977adea56b8aa06cc3e Author: Ben Skeggs Date: Mon Apr 30 11:33:43 2012 -0500 drm/nouveau: create real execution engine for software object class Just a cleanup more or less, and to remove the need for special handling of software objects. This removes a heap of documentation on dma/graph object formats. The info is very out of date with our current understanding, and is far better documented in rnndb in envytools git. Signed-off-by: Ben Skeggs lspci: 01:00.0 VGA compatible controller: NVIDIA Corporation GT215 [Quadro FX 1800M] (rev a2) kernel output from a working 3.4: Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: Detected an NV50 generation card (0x0a3e00a2) Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: Checking PRAMIN for VBIOS Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: ... appears to be valid Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: Using VBIOS from PRAMIN Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: BIT BIOS found Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: Bios version 70.15.43.00 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: TMDS table version 2.0 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: MXM: BIOS version 3.0 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: MXM: MXMS Version 3.0 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: DCB version 4.0 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: DCB outp 00: 01000313 00010034 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: DCB outp 03: 080153d6 0f220020 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: DCB outp 04: 08015392 00020020 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: DCB outp 05: 080143c6 0f220010 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: DCB outp 06: 08014382 00020010 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: DCB outp 08: 040383b6 0f230014 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: DCB outp 10: 020273a6 0f220010 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: DCB outp 11: 02027362 00020010 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: DCB outp 13: 02049300 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: DCB conn 00: 0040 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: DCB conn 01: 1161 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: DCB conn 02: 1231 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: DCB conn 03: 01000331 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: DCB conn 04: 01000446 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: DCB conn 05: 02000546 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: DCB conn 06: 00010631 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: DCB conn 07: 00010746 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: DCB conn 08: 00020847 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: DCB conn 09: 0900 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: Parsing VBIOS init table 0 at offset 0x7AE4 Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: 0x7E6B: Condition still not met after 20ms, skipping follow ing opcodes Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: 0x7E6F: Condition still not met after 20ms, skipping follow ing opcodes Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: Parsing VBIOS init table 1 at offset 0x809A Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: Parsing VBIOS init table 2 at offset 0x951E Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: Parsing VBIOS init table 3 at offset 0x955C Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: Parsing VBIOS init table 4 at offset 0x97CA Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: Parsing VBIOS init table at offset 0x982F Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: 0x982F: Condition still not met after 20ms, skipping follow ing opcodes Jul 23 19:49:57 localhost kernel: [TTM] Zone kernel: Available graphics memory: 4008772 kiB Jul 23 19:49:57 localhost kernel: [TTM] Zone dma32: Available graphics memory: 2097152 kiB Jul 23 19:49:57 localhost kernel: [TTM] Initializing pool allocator Jul 23 19:49:57 localhost kernel: [TTM] Initializing DMA pool allocator Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: Detected 1024MiB VRAM (GDDR5) Jul 23 19:49:57 localhost kernel: [drm] nouveau :01:00.0: 512 MiB GART (aperture) Jul 23 19:49:57 localhost kernel: [drm]
radeon: *ERROR* Buffer to small for return answer
I am seeing this on a 4670 M96XT RV730 (iMac 11,2) with 2.6.37-rc5 [drm:radeon_process_aux_ch] *ERROR* Buffer to small for return answer 1 16 [drm] Initialized drm 1.1.0 20060810 [drm] radeon defaulting to kernel modesetting. [drm] radeon kernel modesetting enabled. [drm] initializing kernel modesetting (RV730 0x1002:0x9488). [drm] register mmio base: 0xD042 [drm] register mmio size: 65536 [drm] Detected VRAM RAM=256M, BAR=256M [drm] RAM width 128bits DDR [drm] radeon: 256M of VRAM memory ready [drm] radeon: 512M of GTT memory ready. [drm] radeon: irq initialized. [drm] GART: num cpu pages 131072, num gpu pages 131072 [drm] Loading RV730 Microcode [drm] ring test succeeded in 1 usecs [drm] radeon: ib pool ready. [drm] ib test succeeded in 0 usecs [drm] Enabling audio support [drm] Radeon Display Connectors [drm] Connector 0: [drm] DisplayPort [drm] HPD1 [drm] DDC: 0x7e40 0x7e40 0x7e44 0x7e44 0x7e48 0x7e48 0x7e4c 0x7e4c [drm] Encoders: [drm] DFP1: INTERNAL_UNIPHY [drm] Connector 1: [drm] eDP [drm] HPD2 [drm] DDC: 0x7e50 0x7e50 0x7e54 0x7e54 0x7e58 0x7e58 0x7e5c 0x7e5c [drm] Encoders: [drm] LCD1: INTERNAL_UNIPHY [drm] Connector 2: [drm] VGA [drm] DDC: 0x7f10 0x7f10 0x7f14 0x7f14 0x7f18 0x7f18 0x7f1c 0x7f1c [drm] Encoders: [drm] CRT2: INTERNAL_KLDSCP_DAC2 [drm] Special thermal controller config [drm] radeon: power management initialized [drm:radeon_process_aux_ch] *ERROR* Buffer to small for return answer 1 16 [drm] fb mappable at 0xC0142000 [drm] vram apper at 0xC000 [drm] size 3145728 [drm] fb depth is 24 [drm]pitch is 4096 fb0: radeondrmfb frame buffer device drm: registered panic notifier [drm] Initialized radeon 2.7.0 20080528 for :01:00.0 on minor 0