There definitely could be a regression in Gecko. As Justin pointed out, we
had a pretty serious regression for the past few weeks that was only
resolved today, but my results were gathered after the regression
resolution.

For Flame performance automation, we test every application against each
build we get from PVT. Each application is tested 30 times, and the metrics
for every run are stored in a database which is where the dashboards get
their data. That also means we can re-query to slice the data in different
ways. The data I provided today represents the 95th percentile of
visuallyLoaded and the mean of USS. We typically don't use mean for launch
times as they are not informative of the standard deviation of results like
a percentile value can. Regardless, here is the current standard deviation
of launch times for said apps:

Calendar: 49ms
Camera: 157ms
Clock: 31ms
Contacts: 224ms
Dialer: 22ms
Email: 152ms
FM: 33ms
Gallery: 52ms
Music: 47ms
SMS: 44ms
Settings: 44ms
Video: 41ms
Test Startup Limit: 116ms

Now, I'm not a statistician in the least, but standard deviation
representing the "normal distribution", most apps exhibit a stable launch
time within about 40-50ms. The 95th percentile gives us a metric that users
can care about: "95% of the time my app loads faster than X milliseconds".
Interestingly enough, the Test Startup Limit app which we use to determine
a general launch baseline, exhibits swings of 116ms in its normal
distribution which is concerning, even though it hasn't really regressed
its performance. Gareth wrote that app and maybe can give us some insight
into what it does that other apps don't that may cause instability in
launch time but over consistently achieving its goal.

I'm happy to help provide whatever insight into this that is needed. :)

Thanks,

Eli Perelman

On Fri, Oct 2, 2015 at 3:46 PM, <[email protected]> wrote:

> And another calm app - Video. I landed an L10n/Intl refactor this morning
> with good perf results [0], but the tests were likely done before.
>
> If you look at Video changelog [1] in the recent months, there's really
> nothing between November 2014 and my change today.
>
> So, where does 200ms and 900kb regression come from? (side note - 900kb
> should not be acceptable unless significant chunk of new data has been
> added to startup path).
>
> I believe that those three apps - Video, FM and Clock should see
> performance/memory between 2.2 and 2.5 improvement. Out of them, Clock is
> the hardest to "revert", but FM and Video should be fairly easy to test the
> same gaia between two geckos and two gaias against the same gecko.
>
> And I believe we should do this before we jump to conclusions because the
> perf impact on those apps seems to be bigger than for some other apps and
> if it's all noise it's a red herring. And if it's Gecko impact, we should
> fix it in Gecko.
>
> zb.
>
>
> [0] https://bugzilla.mozilla.org/show_bug.cgi?id=1197454#c6
> [1]
> https://github.com/mozilla-b2g/gaia/commits/master/apps/video/js/video.js
> _______________________________________________
> dev-fxos mailing list
> [email protected]
> https://lists.mozilla.org/listinfo/dev-fxos
>
_______________________________________________
dev-fxos mailing list
[email protected]
https://lists.mozilla.org/listinfo/dev-fxos

Reply via email to