Launchpad has imported 69 comments from the remote bug at
https://bugs.freedesktop.org/show_bug.cgi?id=55984.

If you reply to an imported comment from within Launchpad, your comment
will be sent to the remote bug automatically. Read more about
Launchpad's inter-bugtracker facilities at
https://help.launchpad.net/InterBugTracking.

------------------------------------------------------------------------
On 2012-10-14T23:47:26+00:00 Airlied-freedesktop wrote:

Created attachment 68564
error state

Okay I've seen both my ilk machines gpu hang over the weekend, and I've
never seen them do it before.

I've got an error state from one at least, if its not in there then I
suspect rc6.

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/0

------------------------------------------------------------------------
On 2012-10-15T08:30:42+00:00 Chris Wilson wrote:

The hang looks pretty clean (no suspicious operations), more or less
upon the transition from a 3D to BLT within a UXA batch buffer; rc6
requiring w/a would not surprise me.

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/1

------------------------------------------------------------------------
On 2012-10-15T13:35:46+00:00 Daniel-ffwll wrote:

Ok, I've hunted around in our docs a bit and found a few ilk w/as we
don't implement. Or at least what I think we miss, given our sorry state
of docs. Pushed out to

http://cgit.freedesktop.org/~danvet/drm/log/?h=ilk-wa-pile

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/2

------------------------------------------------------------------------
On 2012-10-17T17:07:12+00:00 Chris Wilson wrote:

Also if you want to pin the blame on rc6, i915.i915_enable_rc6=0...

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/3

------------------------------------------------------------------------
On 2012-10-29T21:34:10+00:00 Airlied-freedesktop wrote:

okay got another death with rc6 disabled like Norbert.

took about 3-4 days this time.

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/4

------------------------------------------------------------------------
On 2012-11-02T06:40:32+00:00 Airlied-freedesktop wrote:

Created attachment 69415
another error state

[133200.848120] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... 
GPU hung
[133200.848128] [drm] capturing error event; look for more information in 
/debug/dri/0/i915_error_state
[133202.367409] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... 
GPU hung
[133202.367692] [drm:i915_reset] *ERROR* GPU hanging too fast, declaring wedged!
[133202.367695] [drm:i915_reset] *ERROR* Failed to reset chip.

bits from dmesg.

this is 3.6.0 + -next + ilks wa, I'll try and start a bisect on it now,
4-5 days a hang, back in a few years

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/5

------------------------------------------------------------------------
On 2012-11-02T08:57:32+00:00 Chris Wilson wrote:

That error-state is more consistent with a relocation failure than
Norbert's - it fails trying to execute a composite operation within the
middle of a batch.

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/6

------------------------------------------------------------------------
On 2012-11-03T15:55:39+00:00 Lekensteyn wrote:

Created attachment 69489
i915_error_state.txt.gz

Having seen https://lkml.org/lkml/2012/10/23/155 I think I am affected by the 
same bug. While I was compiling a kernel in a tmpfs, all of sudden KWin died. 
When I looked in dmesg, I saw:
[95597.708097] pci 0000:01:00.0: power state changed by ACPI to D3cold
[98683.176729] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU 
hung
[98683.176736] [drm] capturing error event; look for more information in 
/debug/dri/0/i915_error_state
[98683.184252] [drm:init_ring_common] *ERROR* failed to set render ring head to 
zero ctl 00000000 head 69c191cc tail 00000000 start 00003000
[98683.240710] [drm:init_ring_common] *ERROR* render ring initialization failed 
ctl 0001f001 head 69c191cc tail 00000000 start 00003000
[98686.163041] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU 
hung
[98686.163202] [drm:i915_reset] *ERROR* GPU hanging too fast, declaring wedged!
[98686.163205] [drm:i915_reset] *ERROR* Failed to reset chip.

Attached is an i915_error_state from today, running 3.7-rc2-492-ge657e07. (only 
some ARM patches before 3.7-rc3).
I remember that I had exactly the same error message in a -testing branch on 
3.6 
(http://cgit.freedesktop.org/~danvet/drm-intel/tag/?h=drm-intel-testing&id=drm-intel-next-2012-09-20).
 I built that kernel on Sep 21 and it locked up on Sep 27 (no rebooting, just 
suspends). If you want a dmesg (nothing interesting) or logs/i915_error_state 
from that 3.6 kernel, let me know.

# lspci -vv -s 00:02.0
00:02.0 VGA compatible controller: Intel Corporation Core Processor Integrated 
Graphics Controller (rev 02) (prog-if 00 [VGA controller])                      
                                      
        Subsystem: CLEVO/KAPOK Computer Device 7130                             
                                                                                
                                     
        Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- 
Stepping- SERR- FastB2B- DisINTx+                                               
                                         
        Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=fast >TAbort- <TAbort- 
<MAbort- >SERR- <PERR- INTx-                                                    
                                     
        Latency: 0                                                              
                                                                                
                                     
        Interrupt: pin A routed to IRQ 47
        Region 0: Memory at fd000000 (64-bit, non-prefetchable) [size=4M]
        Region 2: Memory at d0000000 (64-bit, prefetchable) [size=256M]
        Region 4: I/O ports at 1800 [size=8]
        Expansion ROM at <unassigned> [disabled]
        Capabilities: [90] MSI: Enable+ Count=1/1 Maskable- 64bit-
                Address: fee0f00c  Data: 4142
        Capabilities: [d0] Power Management version 2
                Flags: PMEClk- DSI+ D1- D2- AuxCurrent=0mA 
PME(D0-,D1-,D2-,D3hot-,D3cold-)
                Status: D0 NoSoftRst- PME-Enable- DSel=0 DScale=0 PME-
        Capabilities: [a4] PCI Advanced Features
                AFCap: TP+ FLR+
                AFCtrl: FLR-
                AFStatus: TP-
        Kernel driver in use: i915

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/7

------------------------------------------------------------------------
On 2012-11-03T16:00:01+00:00 Lekensteyn wrote:

Created attachment 69490
Xorg.0.log

In Xorg, I only changed to use SNA instead of the default (UXA?).

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/8

------------------------------------------------------------------------
On 2012-11-05T20:41:04+00:00 Lekensteyn wrote:

In case it gets lost, I bisected the hang to:

504c7267a1e84b157cbd7e9c1b805e1bc0c2c846 is the first bad commit
commit 504c7267a1e84b157cbd7e9c1b805e1bc0c2c846
Author: Chris Wilson <ch...@chris-wilson.co.uk>
Date:   Thu Aug 23 13:12:52 2012 +0100

    drm/i915: Use cpu relocations if the object is in the GTT but not mappable
    
    This prevents the case of unbinding the object in order to process the
    relocations through the GTT and then rebinding it only to then proceed
    to use cpu relocations as the object is now in the CPU write domain. By
    choosing to use cpu relocations up front, we can therefore avoid the
    rebind penalty.
    
    Signed-off-by: Chris Wilson <ch...@chris-wilson.co.uk>
    Signed-off-by: Daniel Vetter <daniel.vet...@ffwll.ch>

:040000 040000 090ed3d52b4f3210b988877f747b6ff86e123385
1d48be89ded4777a543b693db833de64877059c4 M      drivers

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/9

------------------------------------------------------------------------
On 2012-11-06T09:56:41+00:00 Daniel-ffwll wrote:

Ok, doesn't look like an rc6 thing, but very much like a regression.

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/10

------------------------------------------------------------------------
On 2012-11-06T10:04:56+00:00 Lekensteyn wrote:

Reverting that commit on top of 3.7-rc4 did not fix the hang issu. If
you need any guinea pig for testing, here I am.

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/11

------------------------------------------------------------------------
On 2012-11-06T10:08:24+00:00 Daniel-ffwll wrote:

Created attachment 69604
disable unmappable

Since right now we still have tons of signs pointing at unmappable gtt
handling to be broken/non-coherent somehow, let's try this sledgehammer
here and simply disable it all.

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/12

------------------------------------------------------------------------
On 2012-11-06T10:40:05+00:00 Lekensteyn wrote:

I applied that "sledgehammer" patch on 3.7-rc4, but the error persists.
I saved dmesg and the i915_error_state file. If you need more
information (or those logs), please give a call.

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/13

------------------------------------------------------------------------
On 2012-11-06T11:04:13+00:00 Daniel-ffwll wrote:

Can you please attach the new error_state with the sledgehammer? Maybe
things shifted around enough to see what's going on ...

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/14

------------------------------------------------------------------------
On 2012-11-06T11:09:54+00:00 Lekensteyn wrote:

Created attachment 69615
i915_error_state.gz with sledgehammer patch

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/15

------------------------------------------------------------------------
On 2012-11-06T16:52:28+00:00 Chris Wilson wrote:

(In reply to comment #15)
> Created attachment 69615 [details]
> i915_error_state.gz with sledgehammer patch

Note that this hang is slightly different again, closer to the one
reported by Norbert, in that the hang is the HEAD didn't advanced into
the batchbuffer as opposed to a hang within or after the batch.

So can you please try the hack in conjunction with the ilk-wa-pile?

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/16

------------------------------------------------------------------------
On 2012-11-06T17:19:44+00:00 Lekensteyn wrote:

Created attachment 69630
i915_error_state.txt.gz on ilk-wa-pile 6ef21d3 + sledgehammer

The issue still exists. Same errors in dmesg.

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/17

------------------------------------------------------------------------
On 2012-11-06T17:44:17+00:00 Chris Wilson wrote:

Ok, next interesting observation is that your error states both have a
double emission of the request seqno, so perhaps submitting that many
PIPE_CONTROL in sequence is triggering an error? Can you please test, on
top of everything else,

diff --git a/drivers/gpu/drm/i915/intel_ringbuffer.c 
b/drivers/gpu/drm/i915/intel_ringbuffer.c
index 3af1f2f..994d752 100644
--- a/drivers/gpu/drm/i915/intel_ringbuffer.c
+++ b/drivers/gpu/drm/i915/intel_ringbuffer.c
@@ -662,10 +662,11 @@ pc_render_add_request(struct intel_ring_buffer *ring,
         * incoherence by flushing the 6 PIPE_NOTIFY buffers out to
         * memory before requesting an interrupt.
         */
-       ret = intel_ring_begin(ring, 32);
+       ret = intel_ring_begin(ring, 34);
        if (ret)
                return ret;
 
+       intel_ring_emit(ring, MI_FLUSH);
        intel_ring_emit(ring, GFX_OP_PIPE_CONTROL(4) | PIPE_CONTROL_QW_WRITE |
                        PIPE_CONTROL_WRITE_FLUSH |
                        PIPE_CONTROL_TEXTURE_CACHE_INVALIDATE);
@@ -691,6 +692,7 @@ pc_render_add_request(struct intel_ring_buffer *ring,
        intel_ring_emit(ring, pc->gtt_offset | PIPE_CONTROL_GLOBAL_GTT);
        intel_ring_emit(ring, seqno);
        intel_ring_emit(ring, 0);
+       intel_ring_emit(ring, MI_FLUSH);
        intel_ring_advance(ring);
 
        *result = seqno;

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/18

------------------------------------------------------------------------
On 2012-11-06T20:31:01+00:00 Lekensteyn wrote:

Created attachment 69639
i915_error_state.txt.gz ilk-wa-pipe 6ef21d3 + sledgehammer + ring flush

The bug is still triggered.

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/19

------------------------------------------------------------------------
On 2012-11-07T08:40:14+00:00 Norbert Preining wrote:

Created attachment 69656
ilk-wa-pipe + sledgehammer + ring flush

Same here, I got a hang with all the mentioned patches while compiling a
big bunch of TeX Live. Error state is here now.

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/20

------------------------------------------------------------------------
On 2012-11-07T08:52:44+00:00 Chris Wilson wrote:

ARGH!

Still it hangs in the middle of a series of requests (with no
intervening batches or other operations). That should be impossible
design wise, and improbable hardware wise.

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/21

------------------------------------------------------------------------
On 2012-11-08T23:31:33+00:00 Norbert Preining wrote:

Created attachment 69781
i915 error state with ilk-wa-pipe + sledgehammer + ring flush (another hang)

Here is another hang with a different error state (at least to my eyes).
Happened when running git checkout on a big repository. No other
messages.

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/22

------------------------------------------------------------------------
On 2012-11-09T15:54:41+00:00 Lekensteyn wrote:

Is there anything to test? I mentioned before that this occurs when the
memory is almost full. I have no swap, but 8GB RAM. Copied five times
1.2GiB (=6GiB total) to tmpfs (/dev/shm and /tmp).

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/23

------------------------------------------------------------------------
On 2012-11-09T16:25:19+00:00 Chris Wilson wrote:

8GB machine (i3-330m) with no swap:

$ mount -ttmpfs -osize=100% none /tmp/wtf
$ while :; do yes wtf > /tmp/wtf/wtf; done &
$ sudo X -ac -noreset & while :; do x11perf -aa10text -d :0; done

with that I am able to repeatedly drive the machine to oom without
triggering a GPU hang. Note I am using this set of patches on top of
dinq: http://cgit.freedesktop.org/~ickle/linux-2.6/commit/?h=fastboot

Peter, is that close enough to your test case to trigger the bug, or do
I need to tweak it slightly? Can you please also test with the patches
in fastboot, in case there is an accidental fix?

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/24

------------------------------------------------------------------------
On 2012-11-09T18:48:51+00:00 Lekensteyn wrote:

Created attachment 69833
i915_error_state.txt.gz ickle/linux-2.6 fastboot with sledgehammer + ring flush

The bug still triggers, w/ and w/o the sledgehammer+ring flush patches.

The dmesg is now slightly different on the ickle/linux-2.6 fastboot
branch:

[  501.214949] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU 
hung
[  501.214958] [drm] capturing error event; look for more information in 
/debug/dri/0/i915_error_state
[  501.219393] [drm:init_ring_common] *ERROR* failed to set render ring head to 
zero ctl 00000000 head 09e16d8c tail 00000000 start 00300000
[  501.262795] [drm:intel_dp_aux_wait_done] *ERROR* dp aux hw did not signal 
timeout (has irq: 1)!
[  501.274784] [drm:init_ring_common] *ERROR* render ring initialization failed 
ctl 0001f001 head 09e16d8c tail 00000000 start 00300000
[  501.302762] [drm:intel_dp_aux_wait_done] *ERROR* dp aux hw did not signal 
timeout (has irq: 1)!
[  502.274145] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU 
hung
[  502.274293] [drm:i915_reset] *ERROR* GPU hanging too fast, declaring wedged!
[  502.274298] [drm:i915_reset] *ERROR* Failed to reset chip.


One thing that I now notice (because I did not try it before) is that switching 
to a text console (ctrl+alt+F1) gives me a black screen with some flashing 
large rectangles on screen (possibly the flashing cursor for the username)

Starting just X from a TTY and then running x11perf/glxgears/glxspheres
with cp linux tree / yes / dd if=/dev/zero of=/tmp/wtf.. did not work.
Even if OOM killed half my machine

I can only reproduce it after logging into KDE and running a GL program
(like glxgears). x11perf does not trigger the bug, even in KDE. Maybe
other (compositing) window managers work too, but I have not tested
that.

I am using the below bash script after logging into KDE. After starting this 
script, I watch the kernel log (journalctl -f) and run `glxgears`.
#!/bin/bash
mkdir -p /tmp/wtf
mountpoint /tmp/wtf||sudo mount -osize=6200M -t tmpfs none /tmp/wtf
echo 15 > /proc/$$/oom_score_adj # just in case...
pids=
for i in /tmp/wtf/hang-{1..6}; do
        rm -rf "$i"
        #yes wtf > $i & # did not work
        cp -ra ~/Linux-src/linux "$i" &
        pids="$pids $!"
done
trap "kill $pids" EXIT
wait

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/25

------------------------------------------------------------------------
On 2012-11-09T19:11:32+00:00 Chris Wilson wrote:

Peter, have you tested i915.i915_enable_rc6=0 (on top of the
sledgehammer and w/a)? You have a most peculiar failure pattern where
the GPU should be idle and then dies in a flush.

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/26

------------------------------------------------------------------------
On 2012-11-09T20:25:40+00:00 Daniel-ffwll wrote:

A similar bug on i965gm (bug #56916) mentions that things _only_ blow up
when a mesa program is running. So everyone who can hit this, please
reply with your exact mesa version and what (if any) GL programs you
have running when this happens (GL compositor, ...). Also, those who can
readily reproduce the hangs, please check whether stopping all GL
clients (disable the compositor or use a non-GL one) prevents the hangs.

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/27

------------------------------------------------------------------------
On 2012-11-09T22:26:22+00:00 Lekensteyn wrote:

New results:
- with rc6 disabled, both 3.7-rc4 and wa+sledgehammer+ringflush does not expose 
the bug
- with rc6 not disabled (i.e. the default, -1), wa+sledgehammer+ringflush and 
GL compositing disabled in KWin, the bug is not trigerred. (in the same boot, 
GL compositing was enabled again and the bug shows up)

I am using the standard Mesa packages shipped with Arch Linux, that is 9.0.
The bug is triggered when KDE's KWin is active and glxgears is running. 
(instead of glxgears, I first tried glxspheres which triggers the bug too)

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/28

------------------------------------------------------------------------
On 2012-11-10T08:19:21+00:00 Chris Wilson wrote:

Definitely looks like we have a pair of independent unresolved "cpu-
relocs" and rc6 issues.

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/29

------------------------------------------------------------------------
On 2012-11-10T13:50:37+00:00 Daniel-ffwll wrote:

Ok, it looks like we have different bugs here, or at least non-
overlapping sets of workarounds :(

Peter Wu, can you please check what happens when you manually enable rc6
on a 3.6 kernel?

Norbert Preining, test-results for your machine wrt rc6 vs. "mesa
client/compositor running" vs. 3.6/3.7-rc would be really interesting,
since iirc you can blow up your machine rather quickly, too.

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/30

------------------------------------------------------------------------
On 2012-11-11T10:42:09+00:00 Lekensteyn wrote:

3.6.6 w/o patches, w/ i915.i915_enable_rc6=1, w/ OpenGL compositing WM
(KWin) and glxgears does *not* trigger the bug. I do get a very sluggish
desktop which ultimately leads to some OOMs, but that is normal.

If it helps, I have tested the stock arch kernel config:
https://projects.archlinux.org/svntogit/packages.git/tree/trunk/config.x86_64?h=packages/linux&id=89de8dc7df6894c219e746326ca338e9279c2e3f

and my own config:
https://github.com/Lekensteyn/aur/blob/13feda6a55fb67c912c0611dc0c019bb084e7560
/linux-custom/config

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/31

------------------------------------------------------------------------
On 2012-11-11T11:50:03+00:00 Norbert Preining wrote:

(In reply to comment #30)
> Norbert Preining, test-results for your machine wrt rc6 vs. "mesa
> client/compositor running" vs. 3.6/3.7-rc would be really interesting, since
> iirc you can blow up your machine rather quickly, too.

I am running now with rc6 disabled and all the patches mentioned above.
I am trying with Gnome3 and some GLX programs to see what I can do.

Norbert

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/32

------------------------------------------------------------------------
On 2012-11-11T13:41:06+00:00 Norbert Preining wrote:

Created attachment 69900
i915 error state, i915_enable_rc6=0, rc4 + ilk-wa-pipe + sledgehammer + ring 
flush

As requested, here is another hang with rc6 disabled and the above patches.
Happened again when doing heavy photo viewing with quick switching in shotwell.

If you need other configurations or tests, please let me know

Norbert

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/33

------------------------------------------------------------------------
On 2012-11-11T13:51:30+00:00 Daniel-ffwll wrote:

(In reply to comment #33)
> Created attachment 69900 [details]
> i915 error state, i915_enable_rc6=0, rc4 + ilk-wa-pipe + sledgehammer + ring
> flush

Same hang as before on your machine between a rectlist PRIM and a BLT.

> As requested, here is another hang with rc6 disabled and the above patches.
> Happened again when doing heavy photo viewing with quick switching in
> shotwell.

To check: Is this with a GL client/compositor running?

> If you need other configurations or tests, please let me know

If the above is with a GL client, then trying to hang the box without
any GL client/compositor running would be interesting.

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/34

------------------------------------------------------------------------
On 2012-11-11T13:55:05+00:00 Norbert Preining wrote:

(In reply to comment #34)
> Same hang as before on your machine between a rectlist PRIM and a BLT.

Ok, at least repeatable ;-) So in my case rc6 does not make a change,
former one was without any specific rc6 cmdline.

> To check: Is this with a GL client/compositor running?

Gnome3, so I guess there is a compositor running.

> If the above is with a GL client, then trying to hang the box without any GL
> client/compositor running would be interesting.

Hmm, what WM could I use, guess I have to try fvwm back again. Will try
in one way or the other.

Norbert

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/35

------------------------------------------------------------------------
On 2012-11-13T16:48:13+00:00 Chris Wilson wrote:

Norbet, since you see a slightly different presentation of this bug, it
would be useful if you could also test
http://cgit.freedesktop.org/~ickle/linux-2.6/log/?h=fastboot which
despite its name also contains some work on the mb() around the
relocations.

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/36

------------------------------------------------------------------------
On 2012-11-13T22:32:38+00:00 Norbert Preining wrote:

(In reply to comment #36)
> http://cgit.freedesktop.org/~ickle/linux-2.6/log/?h=fastboot which despite
> its name also contains some work on the mb() around the relocations.

Ok, there is one merge conflict with current kernel master, but I am
trying to build the kernel now after fixing the conflict in one way
(keeping the code).

I tried also to merge that with the ilk-pile but that was hopeless with
loads of merge conflicts.

Will give feedback as soon as I can.

Norbert

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/37

------------------------------------------------------------------------
On 2012-11-14T09:30:11+00:00 Daniel-ffwll wrote:

Our QA discovered a random corruption issue (bug #56859) and bisected it
to

commit 7f1290f2f2a4d2c3f1b7ce8e87256e052ca23125
Author: Jianguo Wu <wujian...@huawei.com>
Date:   Mon Oct 8 16:33:06 2012 -0700

    mm: fix-up zone present pages

Can those who can reproduce this bug here easily please test whether
reverting that commit changes anything?

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/38

------------------------------------------------------------------------
On 2012-11-14T18:55:16+00:00 Lekensteyn wrote:

Reverting that commit on top of 3.7-rc5-git-14-g9924a19 does not help.

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/39

------------------------------------------------------------------------
On 2012-11-14T19:04:43+00:00 Chris Wilson wrote:

(just restoring priority so it doesn't fall out of our p1 lists)

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/40

------------------------------------------------------------------------
On 2012-11-15T11:38:49+00:00 Chris Wilson wrote:

So one thing worth trying is:

diff --git a/drivers/char/agp/intel-gtt.c b/drivers/char/agp/intel-gtt.c
index 7ad6d13..6177daa 100644
--- a/drivers/char/agp/intel-gtt.c
+++ b/drivers/char/agp/intel-gtt.c
@@ -573,7 +573,7 @@ static int intel_gtt_init(void)
                return ret;
 
        intel_private.base.gtt_mappable_entries = intel_gtt_mappable_entries();
-       intel_private.base.gtt_total_entries = intel_gtt_total_entries();
+       intel_private.base.gtt_total_entries = intel_gtt_mappable_entries();
 
        /* save the PGETBL reg for resume */
        intel_private.PGETBL_save =

(It's a bit shotgun, but if it still continues to fail after that all
the earlier symptoms have just been canaries.)

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/41

------------------------------------------------------------------------
On 2012-11-15T12:16:27+00:00 Daniel-ffwll wrote:

(In reply to comment #41)
> So one thing worth trying is:
> 
> diff --git a/drivers/char/agp/intel-gtt.c b/drivers/char/agp/intel-gtt.c
> index 7ad6d13..6177daa 100644
> --- a/drivers/char/agp/intel-gtt.c
> +++ b/drivers/char/agp/intel-gtt.c
> @@ -573,7 +573,7 @@ static int intel_gtt_init(void)
>                 return ret;
>  
>         intel_private.base.gtt_mappable_entries =
> intel_gtt_mappable_entries();
> -       intel_private.base.gtt_total_entries = intel_gtt_total_entries();
> +       intel_private.base.gtt_total_entries = intel_gtt_mappable_entries();
>  
>         /* save the PGETBL reg for resume */
>         intel_private.PGETBL_save =
> 
> (It's a bit shotgun, but if it still continues to fail after that all the
> earlier symptoms have just been canaries.)

Looks eerily similar to attachment #69604 i.e. has been tried, doesn't
work on at least Peter's machine.

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/42

------------------------------------------------------------------------
On 2012-11-15T13:15:37+00:00 Daniel-ffwll wrote:

Created attachment 70111
disable unbound tracking

Silly me just noticed that the unbound tracking has been merged into
3.7, not 3.6. This has a big enough impact to explain all kinds of
things. Please try the attached patch, thanks.

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/43

------------------------------------------------------------------------
On 2012-11-15T13:30:28+00:00 Chris Wilson wrote:

(In reply to comment #43)
> Created attachment 70111 [details] [review]
> disable unbound tracking
> 
> Silly me just noticed that the unbound tracking has been merged into 3.7,
> not 3.6. This has a big enough impact to explain all kinds of things. Please
> try the attached patch, thanks.

i.e. i915_gem_object_set_to_cpu_domain(obj, true); on unbind which would
more explicitly test the failure mechanism.

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/44

------------------------------------------------------------------------
On 2012-11-15T13:45:15+00:00 Chris Wilson wrote:

Created attachment 70114
always do set-gtt-domain

As a follow-on test, one of the areas where we short-circuit domain
tracking that may be fouled up by not calling set-to-cpu-domain upon
unbind.

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/45

------------------------------------------------------------------------
On 2012-11-15T21:13:45+00:00 Chris Wilson wrote:

Created attachment 70142
always do set-gtt-domain

Better patch, maybe a fix for something...

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/46

------------------------------------------------------------------------
On 2012-11-15T22:29:52+00:00 Norbert Preining wrote:

(In reply to comment #46)
> Created attachment 70142 [details] [review]
> always do set-gtt-domain
> 
> Better patch, maybe a fix for something...

On top of what should we try that? rc5 plain? rc5+ilk-pile? ...?
Only this patch or some others from this thread, too?

Thanks

Norbert

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/47

------------------------------------------------------------------------
On 2012-11-15T22:32:37+00:00 Daniel-ffwll wrote:

(In reply to comment #47)
> (In reply to comment #46)
> > Created attachment 70142 [details] [review] [review]
> > always do set-gtt-domain
> > 
> > Better patch, maybe a fix for something...
> 
> On top of what should we try that? rc5 plain? rc5+ilk-pile? ...?
> Only this patch or some others from this thread, too?

Plain 3.7-rc kernel, just pick one that's broken ;-) Please also test
the patch in comment #43 since that one tests a different theory.

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/48

------------------------------------------------------------------------
On 2012-11-15T22:39:23+00:00 Norbert Preining wrote:

(In reply to comment #48)
> Plain 3.7-rc kernel, just pick one that's broken ;-) Please also test the
> patch in comment #43 since that one tests a different theory.

Ok, compiling now. Thanks

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/49

------------------------------------------------------------------------
On 2012-11-15T22:51:05+00:00 Chris Wilson wrote:

(In reply to comment #48)
> Plain 3.7-rc kernel, just pick one that's broken ;-) Please also test the
> patch in comment #43 since that one tests a different theory.

#44 / #46 are both elements of #43... All 3 are worth testing
independently.

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/50

------------------------------------------------------------------------
On 2012-11-15T23:26:54+00:00 Lekensteyn wrote:

Still broken with the same dmesg messages:

- 3.7.0-rc5-git-68-gc5e35d6 + disable unbound tracking
- 3.7.0-rc5-git-68-gc5e35d6 + always-do-set-to-gtt

Do you want me to add the i915_error_states?

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/51

------------------------------------------------------------------------
On 2012-11-16T14:57:33+00:00 Daniel-ffwll wrote:

Peter Wu, since you seem to have dug out the only bisect result (which
didn't check out when reverting), can you please check whether the
parent of the bad commit is working out for you perfectly well? Afaict
this should be

commit 0327d6ba998ca181013a5a1709701a6532a41972
Author: Chris Wilson <ch...@chris-wilson.co.uk>
Date:   Sat Aug 11 15:41:06 2012 +0100

    drm/i915: Extract general object init routine

Pretty much all hairy changes in gem for 3.7 are before that commit, so
knowing that things are solid with this sha1 would be rather helpful. So
please beat on this extensively, thanks.

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/52

------------------------------------------------------------------------
On 2012-11-16T18:24:03+00:00 Daniel-ffwll wrote:

Created attachment 70168
disable cpu relocs completely

I'm not completely sure, but I think we haven't ruled this one out yet.
Please test, thanks.

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/53

------------------------------------------------------------------------
On 2012-11-16T18:58:05+00:00 Lekensteyn wrote:

Still affected:
- 3.6.0-rc2-git-87-g0327d6b no patches
- 3.7.0-rc5-git-68-gc5e35d6 + disable-cpu-relocs

Do I need to combine some patches? E.g. the disable-cpu-relocs with
sledgehammer + ring flush?

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/54

------------------------------------------------------------------------
On 2012-11-17T10:55:14+00:00 Daniel-ffwll wrote:

To hunt down a few other theories, can everyone please attach the
complete dmesg (doesn't really matter whether with drm.debug or not,
kernel version also doesn't matter).

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/55

------------------------------------------------------------------------
On 2012-11-17T14:30:34+00:00 seipher wrote:

Created attachment 70188
dmesg/error_state on ilk/drm-intel-nightly/ubuntu 12.10

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/56

------------------------------------------------------------------------
On 2012-11-17T17:14:46+00:00 Lekensteyn wrote:

Created attachment 70192
dmesg from 3.6.0-rc2-git-87-g0327d6b no patches

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/57

------------------------------------------------------------------------
On 2012-11-18T12:20:58+00:00 Norbert Preining wrote:

Created attachment 70214
dmesg from 3.7.0-rc5+

Here my dmesg from current running kernel.
I just returned from a travel and will try the patch from comment 53 on top of 
the patches in comments 43 and 46

Norbert

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/58

------------------------------------------------------------------------
On 2012-11-19T09:52:29+00:00 Daniel-ffwll wrote:

Created attachment 70248
use dma32 for gem bo allocations

It's not very likely, but on the off chance that this helps, please
test.

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/59

------------------------------------------------------------------------
On 2012-11-19T16:29:01+00:00 Daniel-ffwll wrote:

Ok, yet another new theory ... everyone please attach your kernel
.config, thanks.

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/60

------------------------------------------------------------------------
On 2012-11-19T18:33:53+00:00 Lekensteyn wrote:

Created attachment 70271
kernel config used for 3.7.x

I haven't tested the patch from comment 59 yet, but here is my kernel
config.

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/61

------------------------------------------------------------------------
On 2012-11-19T21:07:24+00:00 seipher wrote:

Created attachment 70279
config-3.7.0-rc4-g6283022

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/62

------------------------------------------------------------------------
On 2012-11-19T22:21:16+00:00 Norbert Preining wrote:

Created attachment 70286
Norbert's kernel config

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/63

------------------------------------------------------------------------
On 2012-11-20T07:39:30+00:00 Norbert Preining wrote:

Created attachment 70296
bug hit with patches  3.7-rc6 plus patches from #43 and #46, but not #59

Here is another hang when running the two patches 43 and 46. Immediately after 
that I got also a page alloc failure, here the syslog messages:
Nov 20 14:32:57 tofuschnitzel kernel: [55009.700562] [drm:i915_hangcheck_hung] 
*ERROR* Hangcheck timer elapsed... GPU hung
Nov 20 14:32:57 tofuschnitzel kernel: [55009.700571] [drm] capturing error 
event; look for more information in /debug/dri/0/i915_error_state
Nov 20 14:32:58 tofuschnitzel kernel: [55011.204741] [drm:i915_hangcheck_hung] 
*ERROR* Hangcheck timer elapsed... GPU hung
Nov 20 14:32:58 tofuschnitzel kernel: [55011.204853] [drm:i915_reset] *ERROR* 
GPU hanging too fast, declaring wedged!
Nov 20 14:32:58 tofuschnitzel kernel: [55011.204858] [drm:i915_reset] *ERROR* 
Failed to reset chip.
Nov 20 14:33:58 tofuschnitzel kernel: [55071.390112] cat: page allocation 
failure: order:9, mode:0x2000d0
Nov 20 14:33:58 tofuschnitzel kernel: [55071.390158] Pid: 17244, comm: cat Not 
tainted 3.7.0-rc6+ #42
Nov 20 14:33:58 tofuschnitzel kernel: [55071.390188] Call Trace:
Nov 20 14:33:58 tofuschnitzel kernel: [55071.390214]  [<ffffffff81095abc>] 
warn_alloc_failed+0x10a/0x11e
Nov 20 14:33:58 tofuschnitzel kernel: [55071.390247]  [<ffffffff81096f44>] ? 
page_alloc_cpu_notify+0x3e/0x3e
Nov 20 14:33:58 tofuschnitzel kernel: [55071.390280]  [<ffffffff81096f55>] ? 
drain_local_pages+0x11/0x13
Nov 20 14:33:58 tofuschnitzel kernel: [55071.390313]  [<ffffffff81097de2>] 
__alloc_pages_nodemask+0x5a0/0x5e2
Nov 20 14:33:58 tofuschnitzel kernel: [55071.390351]  [<ffffffff810beaf0>] 
____cache_alloc+0x2b5/0x544
Nov 20 14:33:58 tofuschnitzel kernel: [55071.390382]  [<ffffffff810beddd>] 
__kmalloc+0x5e/0x96
Nov 20 14:33:58 tofuschnitzel kernel: [55071.390413]  [<ffffffff810ddeba>] 
seq_read+0x1c3/0x324
Nov 20 14:33:58 tofuschnitzel kernel: [55071.390443]  [<ffffffff810c4d6a>] 
vfs_read+0x98/0xfa
Nov 20 14:33:58 tofuschnitzel kernel: [55071.390470]  [<ffffffff810c4e19>] 
sys_read+0x4d/0x7a
Nov 20 14:33:58 tofuschnitzel kernel: [55071.390500]  [<ffffffff814bb9d2>] 
system_call_fastpath+0x16/0x1b
Nov 20 14:33:58 tofuschnitzel kernel: [55071.390532] Mem-Info:
Nov 20 14:33:58 tofuschnitzel kernel: [55071.390547] DMA per-cpu:
Nov 20 14:33:58 tofuschnitzel kernel: [55071.390565] CPU    0: hi:    0, btch:  
 1 usd:   0
Nov 20 14:33:58 tofuschnitzel kernel: [55071.390591] CPU    1: hi:    0, btch:  
 1 usd:   0
Nov 20 14:33:58 tofuschnitzel kernel: [55071.390617] CPU    2: hi:    0, btch:  
 1 usd:   0
Nov 20 14:33:58 tofuschnitzel kernel: [55071.390643] CPU    3: hi:    0, btch:  
 1 usd:   0
Nov 20 14:33:58 tofuschnitzel kernel: [55071.390669] DMA32 per-cpu:
Nov 20 14:33:58 tofuschnitzel kernel: [55071.390686] CPU    0: hi:  186, btch:  
31 usd:   0
Nov 20 14:33:58 tofuschnitzel kernel: [55071.390712] CPU    1: hi:  186, btch:  
31 usd: 185
Nov 20 14:33:58 tofuschnitzel kernel: [55071.391920] CPU    2: hi:  186, btch:  
31 usd:   0
Nov 20 14:33:58 tofuschnitzel kernel: [55071.393113] CPU    3: hi:  186, btch:  
31 usd:   0
Nov 20 14:33:58 tofuschnitzel kernel: [55071.394320] Normal per-cpu:
Nov 20 14:33:58 tofuschnitzel kernel: [55071.395487] CPU    0: hi:  186, btch:  
31 usd:   0
Nov 20 14:33:58 tofuschnitzel kernel: [55071.396654] CPU    1: hi:  186, btch:  
31 usd: 152
Nov 20 14:33:58 tofuschnitzel kernel: [55071.397839] CPU    2: hi:  186, btch:  
31 usd:  28
Nov 20 14:33:58 tofuschnitzel kernel: [55071.398968] CPU    3: hi:  186, btch:  
31 usd:   0
Nov 20 14:33:58 tofuschnitzel kernel: [55071.400072] active_anon:115042 
inactive_anon:65045 isolated_anon:0
Nov 20 14:33:58 tofuschnitzel kernel: [55071.400072]  active_file:264259 
inactive_file:405769 isolated_file:0
Nov 20 14:33:58 tofuschnitzel kernel: [55071.400072]  unevictable:22 dirty:7 
writeback:0 unstable:0
Nov 20 14:33:58 tofuschnitzel kernel: [55071.400072]  free:58783 
slab_reclaimable:45728 slab_unreclaimable:11528
Nov 20 14:33:58 tofuschnitzel kernel: [55071.400072]  mapped:15992 shmem:10670 
pagetables:6769 bounce:0
Nov 20 14:33:58 tofuschnitzel kernel: [55071.400072]  free_cma:0
Nov 20 14:33:58 tofuschnitzel kernel: [55071.406629] DMA free:15748kB min:540kB 
low:672kB high:808kB active_anon:0kB inactive_anon:4kB active_file:120kB 
inactive_file:0kB unevictable:0kB isolated(anon):0kB isolated(file):0kB 
present:15672kB mlocked:0kB dirty:0kB writeback:0kB mapped:0kB shmem:0kB 
slab_reclaimable:20kB slab_unreclaimable:4kB kernel_stack:0kB pagetables:0kB 
unstable:0kB bounce:0kB free_cma:0kB writeback_tmp:0kB pages_scanned:0 
all_unreclaimable? no
Nov 20 14:33:58 tofuschnitzel kernel: [55071.410109] lowmem_reserve[]: 0 2925 
3808 3808
Nov 20 14:33:58 tofuschnitzel kernel: [55071.411302] DMA32 free:172664kB 
min:103388kB low:129232kB high:155080kB active_anon:357940kB 
inactive_anon:99880kB active_file:879092kB inactive_file:1375920kB 
unevictable:16kB isolated(anon):0kB isolated(file):0kB present:2995364kB 
mlocked:16kB dirty:12kB writeback:0kB mapped:46008kB shmem:12988kB 
slab_reclaimable:140052kB slab_unreclaimable:11464kB kernel_stack:584kB 
pagetables:5324kB unstable:0kB bounce:0kB free_cma:0kB writeback_tmp:0kB 
pages_scanned:18 all_unreclaimable? no
Nov 20 14:33:59 tofuschnitzel kernel: [55071.415174] lowmem_reserve[]: 0 0 883 
883
Nov 20 14:33:59 tofuschnitzel kernel: [55071.416495] Normal free:46720kB 
min:31236kB low:39044kB high:46852kB active_anon:102228kB 
inactive_anon:160296kB active_file:177824kB inactive_file:247156kB 
unevictable:72kB isolated(anon):0kB isolated(file):0kB present:904960kB 
mlocked:72kB dirty:16kB writeback:0kB mapped:17960kB shmem:29692kB 
slab_reclaimable:42840kB slab_unreclaimable:34644kB kernel_stack:2592kB 
pagetables:21752kB unstable:0kB bounce:0kB free_cma:0kB writeback_tmp:0kB 
pages_scanned:0 all_unreclaimable? no
Nov 20 14:33:59 tofuschnitzel kernel: [55071.420657] lowmem_reserve[]: 0 0 0 0
Nov 20 14:33:59 tofuschnitzel kernel: [55071.422091] DMA: 3*4kB 3*8kB 0*16kB 
1*32kB 3*64kB 3*128kB 3*256kB 2*512kB 3*1024kB 3*2048kB 1*4096kB = 15748kB
Nov 20 14:33:59 tofuschnitzel kernel: [55071.423560] DMA32: 2116*4kB 2105*8kB 
5306*16kB 1052*32kB 192*64kB 39*128kB 13*256kB 10*512kB 1*1024kB 1*2048kB 
0*4096kB = 172664kB
Nov 20 14:33:59 tofuschnitzel kernel: [55071.425055] Normal: 2764*4kB 1152*8kB 
951*16kB 237*32kB 27*64kB 1*128kB 1*256kB 1*512kB 1*1024kB 1*2048kB 0*4096kB = 
48768kB
Nov 20 14:33:59 tofuschnitzel kernel: [55071.426553] 695040 total pagecache 
pages
Nov 20 14:33:59 tofuschnitzel kernel: [55071.428009] 14351 pages in swap cache
Nov 20 14:33:59 tofuschnitzel kernel: [55071.429508] Swap cache stats: add 
236320, delete 221969, find 63452/74380
Nov 20 14:33:59 tofuschnitzel kernel: [55071.431003] Free swap  = 9658608kB
Nov 20 14:33:59 tofuschnitzel kernel: [55071.432418] Total swap = 9905148kB
Nov 20 14:33:59 tofuschnitzel kernel: [55071.446640] 1015792 pages RAM
Nov 20 14:33:59 tofuschnitzel kernel: [55071.448108] 37280 pages reserved
Nov 20 14:33:59 tofuschnitzel kernel: [55071.449587] 1201194 pages shared
Nov 20 14:33:59 tofuschnitzel kernel: [55071.451024] 804246 pages non-shared
Nov 20 14:33:59 tofuschnitzel kernel: [55071.452459] SLAB: Unable to allocate 
memory on node 0 (gfp=0xd0)
Nov 20 14:33:59 tofuschnitzel kernel: [55071.454003]   cache: size-2097152, 
object size: 2097152, order: 9
Nov 20 14:33:59 tofuschnitzel kernel: [55071.455474]   node 0: slabs: 0/0, 
objs: 0/0, free: 0

Maybe that helps. Now I am running #43, #46, #59.

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/64

------------------------------------------------------------------------
On 2012-11-20T08:44:48+00:00 Norbert Preining wrote:

Another hang with #43, #46, #59 patches. Is the i915 error state needed?

It always happens while I am doing heavy IO things. This time
pbuilder/cowbuilder installation tests of 1Gb of new packages on Debian.

Norbert

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/68

------------------------------------------------------------------------
On 2012-11-20T08:48:45+00:00 Chris Wilson wrote:

One question we haven't ask is whether this is a genuine hang or an
unfortunate hangcheck? Can you please reproduce with
i915.enable_hangcheck=0 and see if your machine locks up instead of
reporting the hang?

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/69

------------------------------------------------------------------------
On 2012-11-20T09:29:40+00:00 seipher wrote:

For me bisection pointed to commit #6c085a72 - drm/i915: Track unbound
pages. A couple of additional test runs at this and its parent commit
proves this.

I'll try patches from comment #43-#46. Since those didn't fix the
problem for Norbert it might be we have multiple issues.

Chris: running with enable_hangcheck=0 on #6c085a72 the machine locked
up.

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/70

------------------------------------------------------------------------
On 2012-11-21T15:30:39+00:00 Chris Wilson wrote:

Imre suggests that there is a possible fix in
http://cgit.freedesktop.org/~ickle/linux-2.6/log/?h=for-imre. Can people
please try that branch and see if it does improve matters fort them?

Reply at: https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-
intel/+bug/1081009/comments/76


** Changed in: xserver-xorg-video-intel
       Status: Unknown => Incomplete

** Changed in: xserver-xorg-video-intel
   Importance: Unknown => Medium

-- 
You received this bug notification because you are a member of Ubuntu-X,
which is subscribed to xserver-xorg-video-intel in Ubuntu.
https://bugs.launchpad.net/bugs/1081009

Title:
  [arrandale] GPU lockup  IPEHR: 0x02000022

To manage notifications about this bug go to:
https://bugs.launchpad.net/xserver-xorg-video-intel/+bug/1081009/+subscriptions

_______________________________________________
Mailing list: https://launchpad.net/~ubuntu-x-swat
Post to     : ubuntu-x-swat@lists.launchpad.net
Unsubscribe : https://launchpad.net/~ubuntu-x-swat
More help   : https://help.launchpad.net/ListHelp

Reply via email to