Re: [Analytics] [Multimedia] Using EventLogging for funnel analysis

2014-05-21 Thread Christian Aistleitner
Hi Luis, On Fri, May 16, 2014 at 01:44:12PM -0700, Luis Villa wrote: > When we evaluated the last spec draft (Jan/Feb?) "do not track" in the > specification quite clearly and explicitly meant "do not allow tracking by > *third parties*". So the tracking we do internally is permissible, whether >

Re: [Analytics] [WikimediaMobile] Notes on mobile instrumentation needs

2014-05-21 Thread Dan Garry
Thanks for the summary, Dario. I've placed a card in our backlog for us to implement the app-specific tag. We definitely want this before release! Thanks, Dan On 20 May 2014 21:42, Dario Taraborelli wrote: > This is a summary of my discussion with Maryana on mobile instrumentation > needs: >

Re: [Analytics] Monthly research & data showcase livestreamed today

2014-05-21 Thread Dario Taraborelli
The livestream link is http://youtu.be/AUupsnvV1oA On Wed, May 21, 2014 at 7:50 AM, Dario Taraborelli < dtarabore...@wikimedia.org> wrote: > The next Research & Data > showcase > will > be live-streamed *today Wed 5/21 at 11.

Re: [Analytics] Analytics maintainers

2014-05-21 Thread Kevin Leduc
I don't get it... am I going to hell for requesting deletion of an article :-) On Tue, May 20, 2014 at 3:38 PM, Ori Livneh wrote: > On Tue, May 20, 2014 at 3:10 PM, Federico Leva (Nemo) > wrote: > >> Obligatory Florence reading: https://meta.wikimedia.org/wiki/Keep_history > > > Obligatory F

[Analytics] Monthly research & data showcase livestreamed today

2014-05-21 Thread Dario Taraborelli
The next Research & Data showcase will be live-streamed today Wed 5/21 at 11.30 PT. The streaming link will be posted on the lists a few minutes before the showcase starts and as usual you can join the conversation on IRC at #wikimedia-research. We look forward to seeing you! Dario This mon

Re: [Analytics] purging old data from eventlogging db

2014-05-21 Thread Dario Taraborelli
> The motivation behind your proposal is (I think) a desire to have a unified > configuration interface for data collection jobs. This makes total sense and > it's worth pursuing. I just don't think we should stuff everything into the > schema. The schema is just that: a schema. It's a data mode

Re: [Analytics] [Multimedia] Media Viewer Dashboards

2014-05-21 Thread Nuria Ruiz
>>[gerco]From action events, we were getting about 15M a day, >>and we only use them to show total counts (daily number of clicks etc). >>How do we tell when the sampling ratio is right for that? >[gilles] I think you're overthinking it, you seem to be looking for the >perfect figure. Let's start

Re: [Analytics] [Multimedia] EventLogging ballooning

2014-05-21 Thread Nuria Ruiz
>> [gerco] We also want to display global average loading time, which is an >> average of all the logged loading times >> (which, per above, use different sampling). > [gilles] Having every graph and metric possible isn't necessarily a useful > goal. > Specific graphs are only worth having if t

Re: [Analytics] purging old data from eventlogging db

2014-05-21 Thread Nuria Ruiz
>Not to hijack the thread, but: to do this in the schema itself confuses the >structure of the data >with the mechanics of its use. I think having a couple of helpers in >JavaScript and PHP > for simple random sampling is sufficient. Much agree with ori here. We would be bloating schema with prop

Re: [Analytics] [Multimedia] Media Viewer Dashboards

2014-05-21 Thread Gilles Dubuc
> > There is a big spike every weekend in the unsampled logs as well, so the > numbers jumping around between Friday and now is not necessarily a > sampling artifact. > Look at the figures closely, they're ridiculous. French wikipedia image views that have been very stable lately supposedly double

Re: [Analytics] [Multimedia] EventLogging ballooning

2014-05-21 Thread Gilles Dubuc
> > The duration log shows > I think you're focusing too much on the duration log which isn't graphed yet. Implementing graphs for that data has been constantly postponed in our cycle planning because it's been considered lower priority than the rest. We can focus on challenges specific to that da

Re: [Analytics] purging old data from eventlogging db

2014-05-21 Thread Ori Livneh
On Tue, May 20, 2014 at 10:36 PM, Dario Taraborelli < dtarabore...@wikimedia.org> wrote: > On May 20, 2014, at 10:09 PM, Sean Pringle wrote: > > Hi! > > I'd like to hear from stakeholders about purging old data from the > eventlogging database. Yes, no, why [not], etc. > > I understand from Ori t