Re: [Mesa-dev] [PATCH] loader/dri3: Overhaul dri3_update_num_back

Axel Davy Fri, 02 Sep 2016 07:46:31 -0700

Le 02/09/2016 à 16:41, Axel Davy a écrit :

Le 02/09/2016 à 03:06, Michel Dänzer a écrit :
On 02/09/16 12:37 AM, Alex Deucher wrote:
On Thu, Sep 1, 2016 at 11:28 AM, Jason Ekstrand<ja...@jlekstrand.net> wrote:
On Aug 31, 2016 11:39 PM, "Michel Dänzer" <mic...@daenzer.net> wrote:
On 01/09/16 02:05 PM, Jason Ekstrand wrote:
On Wed, Aug 31, 2016 at 7:00 PM, Michel Dänzer <mic...@daenzer.net
<mailto:mic...@daenzer.net>> wrote:
     On 31/08/16 11:21 PM, Jason Ekstrand wrote:
     > On Aug 19, 2016 12:07 AM, "Michel Dänzer" <mic...@daenzer.net
<mailto:mic...@daenzer.net>
> <mailto:mic...@daenzer.net <mailto:mic...@daenzer.net>>>wrote:
     >> From: Michel Dänzer <michel.daen...@amd.com
<mailto:michel.daen...@amd.com>
> <mailto:michel.daen...@amd.com<mailto:michel.daen...@amd.com>>>
     >>
>> Always use 3 buffers when flipping. With only 2 buffers,we have
to wait
     >> for a flip to complete (which takes non-0 time even with
asynchronous
>> flips) before we can start working on the next frame. Wewere
previously
     >> only using 2 buffers for flipping if the X server supports
asynchronous
>> flips, even when we're not using asynchronous flips. Thiscould
result
>> in bad performance (the referenced bug report is anextreme case,
where
>> the inter-frame stalls were preventing the GPU fromreaching its
maximum
     >> clocks).
     >
     > Sorry for the post-push review but I don't usually pay much
attention to
> the window system code. In any case, I believe you'redoing your> counting wrong. When flipping with swapinterval=0, youneed 4
buffers:
     >
> 1. The buffer currently being scanned out (will bereleased at
next vblank)
> 2. The buffer X has queued for scanout but is waiting onvblank
     s/vblank/flip/g, since async flips may not wait for vblank, but
yeah.

     > 3. The buffer the application has just submitted which X will
queue next
     > of it doesn't get another before the window closes.
     > 4. The buffer the application is using for rendering.
     >
> With only 3, you get a stall during that window in which Xhas
queued
> another flip but we're waiting on vblank before the flipbegins.
An I
     > missing something?
Nothing, except maybe the paragraph below stating that Icouldn'tmeasure any benefit from using 4 buffers. :) I'm not exactlysure
why,
but I suspect it might be because even with just 3 buffers,the GPU
can
     always work on at least one frame ahead of time.
Also note that even before my change, we were only using 3buffers
when
     the X driver supports async flips (with swap interval 0; only 2
buffers
     with swap interval > 0).
Yes, because with async flips you don't have a buffer sittingqueued in
the kernel waiting to be flipped which you can't cancel.
Actually, there is. Even async flips take non-0 time to complete.
that makes perfect sense.
What exactly does? My change may not be perfect, but the logicbefore it
was mostly backwards.
I think perhaps the problem here is that I don't know what you mean by
"async flips". It's an X term that obviously does not mean what Ithought
it meant.
Async means immediate (or as close to it as possibly, maybe hsync
depending on the hw) not at vsync.
Exactly.
That said, I'd be interested in hearing about any test caseswhere 4
     buffers provide a significant boost over 3.
A little history that may be useful: Quadbuffering was originallyadded
for DRI3+present here:
https://cgit.freedesktop.org/mesa/mesa/commit/?id=f7a355556ef5fe23056299a77414f9ad8b5e5a1d
So the commit message claims. If you look at the code after thatchange
though, it's basically impossible to end up with 4 buffers (at least
permanently), since it would require all these conditions to betrue at
the same:

1. priv->flipping (the last Present request was completed as a flip)
2. !(priv->present_capabilities & XCB_PRESENT_CAPABILITY_ASYNC)(the X
    driver doesn't support async flips)
3. priv->swap_interval == 0

Given 2, 1 & 3 are mutually exclusive.
I'm not seeing how 1 & 3 are mutually exclusive. priv->swap_interval
doesn't seem to have anything to do with whether or not you'reflipping.
priv->swap_interval == 0 can only use flips if async flips aresupported.
This may not be always true. In particular with my XWayland Presentimplementation (yeah it's been years, perhaps time to work on it again),
you could have:
. 1 buffer on the screen
. 1 buffer scheduled for next flip kernel side
. 1 buffer old than the previous buffers, and that will get scheduledfor flip after if no new buffer arrives (and will be released if a newbuffer arrives)

I meant "newer" of course


All three kept by the Wayland compositor.
Thus you need a fourth buffer to keep rendering.

Axel



So WRT https://bugs.freedesktop.org/show_bug.cgi?id=97549, let's not
jump to any conclusions but look at how many buffers actually end up
being used for what reasons in each case. I suspect there might be some
surprises. :)

If you use one buffer for swap interval = 0 and no flipping, it meansyou wait for the xserver to handle the presentation request and send theanswer, before rendering again.

Before there wasn't this synchronization, because there were two buffers.

Could this be the cause of the slowdown ?

Axel


_______________________________________________
mesa-dev mailing list
mesa-dev@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/mesa-dev

Re: [Mesa-dev] [PATCH] loader/dri3: Overhaul dri3_update_num_back

Reply via email to