Who's responsible for looking into the test/regression? Bas? Does the
person looking into it need help from the performance or desktop teams?

What bug is tracking that work?

Gavin


On Wed, May 28, 2014 at 12:12 PM, Vladimir Vukicevic
<vladi...@mozilla.com>wrote:

> (Note: I have not looked into the details of CART/TART and their
> interaction with OMTC)
>
> It's entirely possible that (b) is true *now* -- the test may have been
> good and proper for the previous environment, but now the environment
> characteristics were changed such that the test needs tweaks.  Empirically,
> I have not seen any regressions on any of my Windows machines (which is
> basically all of them); things like tab animations and the like have
> started feeling smoother even after a long-running browser session with
> many tabs.  I realize this is not the same as cold hard numbers, but it
> does suggest to me that we need to take another look at the tests now.
>
>     - Vlad
>
> ----- Original Message -----
> > From: "Gijs Kruitbosch" <gijskruitbo...@gmail.com>
> > To: "Bas Schouten" <bschou...@mozilla.com>, "Gavin Sharp" <
> ga...@gavinsharp.com>
> > Cc: dev-tech-...@lists.mozilla.org, "mozilla.dev.platform group" <
> dev-platform@lists.mozilla.org>, "release-drivers"
> > <release-driv...@mozilla.org>
> > Sent: Thursday, May 22, 2014 4:46:29 AM
> > Subject: Re: OMTC on Windows
> >
> > Looking on from m.d.tree-management, on Fx-Team, the merge from this
> > change caused a >40% CART regression, too, which wasn't listed in the
> > original email. Was this unforeseeen, and if not, why was this
> > considered acceptable?
> >
> > As gavin noted, considering how hard we fought for 2% improvements (one
> > of the Australis folks said yesterday "1% was like Christmas!") despite
> > our reasons of why things were really OK because of some of the same
> > reasons you gave (e.g. running in ASAP mode isn't realistic, "TART is
> > complicated", ...), this hurts - it makes it seem like (a) our
> > (sometimes extremely hacky) work was done for no good reason, or (b) the
> > test is fundamentally flawed and we're better off without it, or (c)
> > when the gfx team decides it's OK to regress it, it's fine, but not when
> > it happens to other people, quite irrespective of reasons given.
> >
> > All/any of those being true would give me the sad feelings. Certainly it
> > feels to me like (b) is true if this is really meant to be a net
> > perceived improvement despite causing a 40% performance regression in
> > our automated tests.
> >
> > ~ Gijs
> >
> > On 18/05/2014 19:47, Bas Schouten wrote:
> > > Hi Gavin,
> > >
> > > There have been several e-mails on different lists, and some
> communication
> > > on some bugs. Sadly the story is at this point not anywhere in a
> condensed
> > > form, but I will try to highlight a couple of core points, some of
> these
> > > will be updated further as the investigation continues. The official
> bug
> > > is bug 946567 but the numbers and the discussion there are far outdated
> > > (there's no 400% regression ;)):
> > >
> > > - What OMTC does to tart scores differs wildly per machine, on some
> > > machines we saw up to 10% improvements, on others up to 20%
> regressions.
> > > There also seems to be somewhat more of a regression on Win7 than
> there is
> > > on Win8. What the average is for our users is very hard to say,
> frankly I
> > > have no idea.
> > > - One core cause of the regression is that we're now dealing with two
> D3D
> > > devices when using Direct2D since we're doing D2D drawing on one
> thread,
> > > and D3D11 composition on the other. This means we have DXGI locking
> > > overhead to synchronize the two. This is unavoidable.
> > > - Another cause is that we're now having two surfaces in order to do
> double
> > > buffering, this means we need to initialize more resources when new
> layers
> > > come into play. This again, is unavoidable.
> > > - Yet another cause is that for some tests we composite 'ASAP' to get
> > > interesting numbers, but this causes some contention scenario's which
> are
> > > less likely to occur in real-life usage. Since the double buffer might
> > > copy the area validated in the last frame from the front buffer to the
> > > backbuffer in order to prevent having to redraw much more. If the
> > > compositor is compositing all the time this can block the main thread's
> > > rasterization. I have some ideas on how to improve this, but I don't
> know
> > > how much they'll help TART, in any case, some cost here will be
> > > unavoidable as a natural additional consequence of double buffering.
> > > - The TART number story is complicated, sometimes it's hard to know
> what
> > > exactly they do, and don't measure (which might be different with and
> > > without OMTC) and how that affects practical performance. I've been
> told
> > > this by Avi and it matches my practical experience with the numbers. I
> > > don't know the exact reasons and Avi is probably a better person to
> talk
> > > about this than I am :-).
> > >
> > > These are the core reasons that we were able to identify from
> profiling.
> > > Other than that the things I said in my previous e-mail still apply. We
> > > believe we're offering significant UX improvements with async video and
> > > are enabling more significant improvements in the future. Once we've
> fixed
> > > the obvious problems we will continue to see if there's something that
> can
> > > be done, either through tiling or through other improvements,
> particularly
> > > in the last point I mentioned there might be some, not 'too' complex
> > > things we can do to offer some small improvement.
> > >
> > > If we want to have a more detailed discussion we should probably pick a
> > > list to have this on and try not to spam people too much :-).
> > >
> > > Bas
> > >
> > > ----- Original Message -----
> > > From: "Gavin Sharp" <ga...@gavinsharp.com>
> > > To: "Bas Schouten" <bschou...@mozilla.com>
> > > Cc: "dev-tree-management" <dev-tree-managem...@lists.mozilla.org>,
> > > dev-tech-...@lists.mozilla.org, "release-drivers"
> > > <release-driv...@mozilla.org>, "mozilla.dev.platform group"
> > > <dev-platform@lists.mozilla.org>
> > > Sent: Sunday, May 18, 2014 6:23:58 PM
> > > Subject: Re: OMTC on Windows
> > >
> > >> but tart will regress by ~20%, and several other suites will regress
> as
> > >> well.
> > >> We've investigated this extensively and we believe the majority of
> these
> > >> regressions are due to the nature of OMTC and the fact that we have
> to do
> > >> more work.
> > >
> > > Where can I read more about the TART investigations? I'd like to
> > > understand why it is seen as inevitable, and get some of the details
> > > of the regression. OMTC is important, and I'm excited to see it land
> > > on Windows, but the Firefox and Performance teams have just come off a
> > > months-long effort to make significant wins in TART, and the thought
> > > of taking a 20% regression (huge compared to some of the improvements
> > > we fought for) is pretty disheartening.
> > >
> > > Gavin
> > >
> > > On Sun, May 18, 2014 at 12:16 AM, Bas Schouten <bschou...@mozilla.com>
> > > wrote:
> > >> Hey all,
> > >>
> > >> After quite a lot of waiting we've switched on OMTC on Windows by
> default
> > >> today (bug 899785). This is a great move towards moving all our
> platforms
> > >> onto OMTC (only linux is left now), and will allow us to remove a lot
> of
> > >> code that we've currently been duplicating. Furthermore it puts us on
> > >> track for enabling other features on desktop like APZ, off main thread
> > >> animations and other improvements.
> > >>
> > >> Having said that we realize that what we've currently landed and
> turned on
> > >> is not completely bug free. There's several bugs still open (some more
> > >> serious than others) which we will be addressing in the coming weeks,
> > >> hopefully before the merge to Aurora. The main reason we've switched
> it
> > >> on now is that we want to get as much data as possible from the
> nightly
> > >> channel and our nightly user base before the aurora merge, as well as
> > >> wanting to prevent any new regressions from creeping in while we fix
> the
> > >> remaining problems. This was extensively discussed both internally in
> the
> > >> graphics team and externally with other people and we believe we're
> at a
> > >> point now where things are sufficiently stabilized for our nightly
> > >> audience. OMTC is enabled and disabled with a single pref so if
> > >> unforeseen, serious consequences occur we can disable it quickly at
> any
> > >> stage. We will inevitably find new bugs in the coming weeks, please
> link
> > >> any bugs you happen to come across to bug 899785, if anything
> >   se
> > >>   ems very serious, please let us know, we'll attempt to come up with
> a
> > >>   solution on the short-term rather than disabling OMTC and reducing
> the
> > >>   amount of feedback we get.
> > >>
> > >> There's also some important notes to make on performance, which we
> expect
> > >> to be reported by our automated systems:
> > >>
> > >> - Bug 1000640 is about WebGL. Currently OMTC regresses WebGL
> performance
> > >> considerably, patches to fix this are underway and this should be
> fixed
> > >> on the very short term.
> > >>
> > >> - Several of the Talos test suite numbers will change considerably
> > >> (especially with Direct2D enabled), this means Tscroll for example
> will
> > >> improve by ~25%, but tart will regress by ~20%, and several other
> suites
> > >> will regress as well. We've investigated this extensively and we
> believe
> > >> the majority of these regressions are due to the nature of OMTC and
> the
> > >> fact that we have to do more work. We see no value in holding off OMTC
> > >> because of these regressions as we'll have to go there anyway. Once
> the
> > >> last correctness and stability problems are all solved we will go
> back to
> > >> trying to find ways to get back some of the performance regressions.
> > >> We're also planning to move to a system more like tiling in desktop,
> > >> which will change the performance characteristics significantly
> again, so
> > >> we don't want to sink too much time into optimizing the current
> > >> situation.
> > >>
> > >> - Memory numbers will increase somewhat, this is unavoidable, there's
> > >> several steps which have to be taken when doing off main thread
> > >> compositing (like double-buffering), which inherently use more memory.
> > >>
> > >> - On a brighter note: Async video is also enabled by these patches.
> This
> > >> means that when the main thread is busy churning JavaScript, instead
> of
> > >> stuttering your video should now happily continue playing!
> > >>
> > >> - Also there's some indications that there's a subjective increase in
> > >> scrolling performance as well.
> > >>
> > >>
> > >> If you have any questions please feel free to reach out to myself or
> other
> > >> members of the graphics team!
> > >>
> > >>
> > >> Bas
> > >> _______________________________________________
> > >> dev-platform mailing list
> > >> dev-platform@lists.mozilla.org
> > >> https://lists.mozilla.org/listinfo/dev-platform
> >
> > _______________________________________________
> > dev-tech-gfx mailing list
> > dev-tech-...@lists.mozilla.org
> > https://lists.mozilla.org/listinfo/dev-tech-gfx
> >
>
_______________________________________________
dev-platform mailing list
dev-platform@lists.mozilla.org
https://lists.mozilla.org/listinfo/dev-platform

Reply via email to