Hands-on session with Spark and Telemetry data at MozLandia

2014-11-28 Thread rvitillo
You all know Telemetry, our Firefox data collection system that provides us 
with real-world data about performance, hardware, usage and customizations.

Spark is an open-source in-memory data analytics cluster computing framework 
originally developed in the AMPLab at UC Berkeley. In contrast to Hadoop's 
two-stage disk-based MapReduce paradigm, Spark's in-memory primitives provide 
performance up to 100 times faster which allow to interactively run distributed 
operations from a shell and analyze millions of Telemetry submissions in a 
matter of seconds; a great fit for exploratory data analyses.

I will briefly go over the data layout of Telemetry and how Spark works under 
the hood. Finally, I will jump in a hands-on interactive analysis session with 
real data. No prerequisites are required in terms of Telemetry, Spark or 
distributed computing. This is a great opportunity to learn how to quickly 
extract actionable insight from our data.

Time: Friday, December 5, 2014, 4:00:00 PM - 5:30:00 PM GMT -05:00
Location: Belmont Room, Marriott Waterfront 2nd, Mozlandia
___
dev-platform mailing list
dev-platform@lists.mozilla.org
https://lists.mozilla.org/listinfo/dev-platform


Re: power use on Yosemite

2014-11-03 Thread rvitillo
On Saturday, November 1, 2014 7:48:11 PM UTC, Andreas Gal wrote:
> I am using Nightly on Yosemite and power use is pretty atrocious. The battery 
> menu tags Firefox Nightly as a significant battery hog, and I can confirm 
> this from the user experience perspective as well. My battery time is a 
> fraction of using Chrome for the same tasks.
> 
> Not every kind of content seems to trigger this behavior, but Google Sheets 
> in Firefox seems to be a pretty reliable way to drain my battery quickly.

Andreas, I used energia (https://github.com/mozilla/energia) to quickly compare 
the power usage of Nightly vs Chrome on Google Sheets 
(https://docs.google.com/spreadsheet/ccc?key=0ArS97F99-BEZdF81LXlRNnRJTWphQ3pZcTYxT3NSRXc#gid=0
 and other similar sheets) and even though the usage of FF is greater than 
Chrome, it's not dramatically so, i.e.:

Chrome: 2.8W +- 0.03 (confidence interval of the mean)
Nightly: 3.1 +- 0.22

The numbers comprehend CPU and GPU usage. Do you have by chance a particular 
sheet that triggers the issue which you could share? I am going to dig deeper 
first thing in the morning; there is some suspicious high variance in the 
measurements for FF.

On Monday, November 3, 2014 5:33:11 PM UTC, Martin Thomson wrote:
> On 02/11/14 18:13, Mike Hoye wrote:
> > "We found that blocking on a slow connection used consistently more
> > power than idle. This could be solely due to network traffic, but
> > network traffic was very periodic (1 packet in and 1 packet out per
> > second), thus much of the work could likely be attributed to the
> > animated spinner. The difference was not very large, but it was
> > statistically significant."
> 
> That suggests that it could be a result of us not having QUIC and Google 
> Sheets not being friendly to TCP-based usages.

We tracked down many power bugs on Desktop earlier this year using our power 
profiler, see Bug 948528, Bug 962573, 
http://robertovitillo.com/2014/01/21/a-matter-of-energy/ and 
https://github.com/mozilla/energia. Many of those bugs are intimately related 
to the layout stack and still need to be addressed. David, do you have an ETA 
for some of them, e.g. Bug 962594?

In particular Facebook, which practically appears in any top 10 list, had 
(has?) a serious power bug that caused FF to render a hidden spinning wheel. 
Because of this single bug any power benchmark performed by the press, which 
was likely going to be based on the top N most visited sites on the web, was 
likely going to be skewed significantly to our loss.
___
dev-platform mailing list
dev-platform@lists.mozilla.org
https://lists.mozilla.org/listinfo/dev-platform


Re: Telemetry alerts for histograms

2014-08-15 Thread rvitillo
On Wednesday, August 13, 2014 4:36:39 PM UTC+1, rvit...@mozilla.com wrote:
> I requested a public list available over NNTP (see Bug 1053202, 
> mozilla.qa.telemetry-alerts). That should make everybody happy.

The list has been renamed to mozilla.dev.telemetry-alerts.
___
dev-platform mailing list
dev-platform@lists.mozilla.org
https://lists.mozilla.org/listinfo/dev-platform


Re: Telemetry alerts for histograms

2014-08-13 Thread rvitillo
On Wednesday, August 13, 2014 8:06:08 PM UTC+1, Steve Fink wrote:
> I would sort of like a pulse notification for these, but it probably
> 
> wouldn't get enough usage yet for it to be worthwhile. (I'd feed it into
> 
> an irc bot that notifies me + maybe a channel.)
> 
> 
> 
> https://wiki.mozilla.org/Auto-tools/Projects/Pulse

Very interesting; I will keep this in mind for future extensions.
___
dev-platform mailing list
dev-platform@lists.mozilla.org
https://lists.mozilla.org/listinfo/dev-platform


Re: Telemetry alerts for histograms

2014-08-13 Thread rvitillo
I requested a public list available over NNTP (see Bug 1053202, 
mozilla.qa.telemetry-alerts). That should make everybody happy.
___
dev-platform mailing list
dev-platform@lists.mozilla.org
https://lists.mozilla.org/listinfo/dev-platform


Re: Telemetry alerts for histograms

2014-08-12 Thread rvitillo
>From now on all alerts will be sent by default also to 
>telemetry-ale...@mozilla.com.
___
dev-platform mailing list
dev-platform@lists.mozilla.org
https://lists.mozilla.org/listinfo/dev-platform


Expiration versions for Telemetry probes.

2014-08-12 Thread rvitillo
We are planning to forcefully set the expiration version of many outstanding 
probes, i.e. those that don't have yet an explicitly set expiration, to version 
40.

Please have a look at bug 1045108 and comment if you are against it or if you 
want us to don't set an expiration version for your probes.

Thanks.
___
dev-platform mailing list
dev-platform@lists.mozilla.org
https://lists.mozilla.org/listinfo/dev-platform


Re: Telemetry alerts for histograms

2014-08-12 Thread rvitillo
On Wednesday, August 6, 2014 9:12:07 PM UTC+1, Jim (Ningjie) Chen wrote:
> Looks great! Is the system specific to histograms or can it be adapted to 
> other telemetry measurements?
> 
> 
> 
> Looking at the dashboard, we have a growing list of telemetry measurements 
> outside of histograms (hangs, experiments, I/O, power usage, etc.). It'd be 
> great if they can benefit from automatic monitoring as well.

The current system is specifically tailored to histograms but we might do 
something similar for our dashboards once we complete our effort to unify the 
dataset format across all of them.
___
dev-platform mailing list
dev-platform@lists.mozilla.org
https://lists.mozilla.org/listinfo/dev-platform


Re: Telemetry alerts for histograms

2014-08-12 Thread rvitillo
On Thursday, August 7, 2014 9:19:09 AM UTC+1, Gian-Carlo Pascutto wrote:
> Does this filter on any cpp_guards?

No, it doesn't.
___
dev-platform mailing list
dev-platform@lists.mozilla.org
https://lists.mozilla.org/listinfo/dev-platform


Re: Telemetry alerts for histograms

2014-08-06 Thread rvitillo
On Wednesday, August 6, 2014 8:16:09 AM UTC+1, Gian-Carlo Pascutto wrote:
> On 5/08/2014 22:09, Andrew McCreight wrote:
> > How many alerts is this thing going to generate?  Could you have a
> > mailing list that just gets all of the telemetry alerts email and
> > then people could subscribe to it if they wanted?  (Technically, I
> > suppose I could make a mailing list myself and land a patch to
> > subscribe the list to every one of these...) I'm not sure I really
> > want to have my email address distributed with every copy of the
> > Firefox source code. :)
> 
> Decoupling who gets the alerts from the Firefox source sounds like a
> good idea.

Agreed, I am going to create a new catch-all mailing list. Our original intent 
was actually for authors to use an e-mail alias like in Bug 1047568 or let them 
use their own mailing lists. But we can certainly send all alerts in one place 
too. Hopefully receiving all of them is not going to make people ignore the 
alerts altogether. On the other hand though the number of notifications per 
week should be mostly under 10, so maybe this isn't an issue after all.

> 1) What if lots of people want to monitor a specific value?
> 2) What if people stop working on Firefox? They have to "patch out"
> their email?

What about creating a mailing list just for the histograms you are interested 
in?

> 3) What if the owner of a feature changes? How do the Firefox versions
> with different Histogram.json interact with the telemetry alerts?

Only the latest version of Histograms.json is taken into account. 

I am aware that there is some work involved to receive the notifications, 
unless you are OK with receiving all of them. There isn't any good way to 
discern automatically who has to receive which notification. I am open to 
suggestions.

___
dev-platform mailing list
dev-platform@lists.mozilla.org
https://lists.mozilla.org/listinfo/dev-platform


Re: Telemetry alerts for histograms

2014-08-05 Thread rvitillo
The dashboard of the alerting system is available at 
http://vitillo.github.io/cerberus/dashboard/.

If you want to be notified when the distribution of your histogram changes 
significantly, please add your e-mail address to Histograms.json as documented 
in 
https://developer.mozilla.org/en-US/docs/Mozilla/Performance/Adding_a_new_Telemetry_probe.

Thanks,
Roberto
___
dev-platform mailing list
dev-platform@lists.mozilla.org
https://lists.mozilla.org/listinfo/dev-platform


Re: Deprecating localstore.rdf

2014-08-05 Thread rvitillo
On Monday, August 4, 2014 8:55:35 PM UTC+1, Benjamin Smedberg wrote:

> This does involve a one-time import of localstore data into the new 
> format, correct?

Correct.

> 
> I'm happy that we are doing this. I *believe* that this may be the last 
> client of the RDF code in Firefox, which may allow us to remove RDF from 
> Firefox in a future release. Do you already have an addon validation 
> warning about addons using localstore?

What do you mean exactly by addon validation warning? In an earlier version of 
the patch there was a warning for any user of the old localstore API but Neil 
suggested to remove it.

___
dev-platform mailing list
dev-platform@lists.mozilla.org
https://lists.mozilla.org/listinfo/dev-platform


Telemetry alerts for histograms

2014-07-23 Thread rvitillo
We, as in the performance team, are currently working on an automatic alerting 
system for Telemetry's data. At the moment we can detect sensible changes in 
the distributions of the collected histograms and we would like to notify the 
authors of the histograms with an e-mail. 

The e-mail will include the name of the histogram, the date of the regression 
and a plot that shows two histograms: the reference and the regression 
distribution. The regression histogram shows the distribution of the histogram 
taken the day it changed, while the reference histogram shows how the 
distribution should look like based on the previous days. In the future the 
e-mail will include a link to Telemetry's dashboard which will likely show some 
more information.

Since using 'hg blame' or a similar approach to pinpoint the author of the 
histogram doesn't always yield the right match (see Bug 1037494), we are likely 
going to ask the interested authors to manually add their e-mail address to 
toolkit/components/telemetry/Histograms.json.

The question to the histogram authors now is if receiving alerts through 
e-mails is OK or they would prefer something else. Note that a great deal of 
care has been taken to ensure that there the number of false positives is very 
close to 0, i.e. if you receive an alert then the distribution has clearly 
changed and the effect size is big enough that it can be considered more than 
significant. So don't expect your mailbox to be flooded with useless alerts.

Roberto
___
dev-platform mailing list
dev-platform@lists.mozilla.org
https://lists.mozilla.org/listinfo/dev-platform


Deprecating localstore.rdf

2014-07-23 Thread rvitillo
Localstore.rdf will soon be replaced with a json store (see Bug 559505). I am 
currently planning to leave the localstore.rdf implementation as it is and 
issue a warning when a client tries to access to it. This is needed as some 
add-ons seem still to rely on it. We could use some Telemetry probes to see 
effectively how many add-ons are still using the rdf store once the patch lands.

Are there any objections or remarks to the deprecation of localstore.rdf?

Roberto
___
dev-platform mailing list
dev-platform@lists.mozilla.org
https://lists.mozilla.org/listinfo/dev-platform