Re: [Analytics] [Pageviews] [Technical] Simplifying the available static dumps of pageview data

2016-02-16 Thread Dan Andreescu
; Behalf Of *Aaron Halfaker > *Sent:* Tuesday, February 16, 2016 18:11 > *To:* A mailing list for the Analytics Team at WMF and everybody who has > an interest in Wikipedia and analytics. > *Subject:* Re: [Analytics] [Pageviews] [Technical] Simplifying the > available static dum

Re: [Analytics] [Pageviews] [Technical] Simplifying the available static dumps of pageview data

2016-02-16 Thread Erik Zachte
: A mailing list for the Analytics Team at WMF and everybody who has an interest in Wikipedia and analytics. Subject: Re: [Analytics] [Pageviews] [Technical] Simplifying the available static dumps of pageview data dumps.wikimedia.org/analytics Does "analytics" mean anything in th

Re: [Analytics] [Pageviews] [Technical] Simplifying the available static dumps of pageview data

2016-02-16 Thread Aaron Halfaker
> > dumps.wikimedia.org/analytics Does "analytics" mean anything in this context? Why not aim for something like dumps.wikimedia.org/views? -Aaron On Thu, Feb 11, 2016 at 9:39 AM, Oliver Keyes wrote: > It's also the International Day of Women and Girls in Science! > > Sounds like a good summ

Re: [Analytics] [Pageviews] [Technical] Simplifying the available static dumps of pageview data

2016-02-11 Thread Oliver Keyes
It's also the International Day of Women and Girls in Science! Sounds like a good summary. On 11 February 2016 at 07:31, Dan Andreescu wrote: > I almost revived this thread on Mardi Gras, but I didn't want to be known as > The Holiday Crusher so I waited. Today is relatively safe [1] :) > > Ok,

Re: [Analytics] [Pageviews] [Technical] Simplifying the available static dumps of pageview data

2016-02-11 Thread Dan Andreescu
I almost revived this thread on Mardi Gras, but I didn't want to be known as The Holiday Crusher so I waited. Today is relatively safe [1] :) Ok, there are three main points being made: 1. deprecating the old datasets 2. liberating ourselves from the old format 3. reorganizing the dumps page My

Re: [Analytics] [Pageviews] [Technical] Simplifying the available static dumps of pageview data

2016-01-06 Thread Dario Taraborelli
Erik's proposal sounds very reasonable. There might be some confusion about what we mean by "keeping the old datasets for longitudinal analysis". No one is planning to remove the old static dumps, just stop generating them/maintaining them going forward. I also want to echo Nuria regarding the hu

Re: [Analytics] [Pageviews] [Technical] Simplifying the available static dumps of pageview data

2016-01-06 Thread Nuria Ruiz
>As I just mentioned to Dan in a private email conversation, keeping datasets even with imperfect measurements is important. Particularly for longitudinal analysis. Have in mind that maintaining these old dumps is not "free", it causes a lot of confusion and maintenance costs to have several pagevi

Re: [Analytics] [Pageviews] [Technical] Simplifying the available static dumps of pageview data

2015-12-24 Thread Oliver Keyes
r the Analytics Team at WMF and everybody who has an > interest in Wikipedia and analytics. > Subject: Re: [Analytics] [Pageviews] [Technical] Simplifying the available > static dumps of pageview data > > > > Apologies! I realized it was Christmas Eve but I by no means m

Re: [Analytics] [Pageviews] [Technical] Simplifying the available static dumps of pageview data

2015-12-24 Thread Erik Zachte
a and analytics. Subject: Re: [Analytics] [Pageviews] [Technical] Simplifying the available static dumps of pageview data Apologies! I realized it was Christmas Eve but I by no means meant to rush this conversation. Take as long as you like to answer to the thread and enjoy your holidays eve

Re: [Analytics] [Pageviews] [Technical] Simplifying the available static dumps of pageview data

2015-12-24 Thread Dan Andreescu
> > Erik > > > > > > > > *From:* Analytics [mailto:analytics-boun...@lists.wikimedia.org] *On > Behalf Of *Maurice Vergeer > *Sent:* Thursday, December 24, 2015 15:12 > *To:* A mailing list for the Analytics Team at WMF and everybody who has > an interest in Wi

Re: [Analytics] [Pageviews] [Technical] Simplifying the available static dumps of pageview data

2015-12-24 Thread Erik Zachte
A mailing list for the Analytics Team at WMF and everybody who has an interest in Wikipedia and analytics. Subject: Re: [Analytics] [Pageviews] [Technical] Simplifying the available static dumps of pageview data Dear all, As I just mentioned to Dan in a private email conversation, keepin

Re: [Analytics] [Pageviews] [Technical] Simplifying the available static dumps of pageview data

2015-12-24 Thread Maurice Vergeer
Dear all, As I just mentioned to Dan in a private email conversation, keeping datasets even with imperfect measurements is important. Particularly for longitudinal analysis. Also, from what I understand - me being a newby here - is that the data are stored in separate files. Dan suggested reorder

Re: [Analytics] [Pageviews] [Technical] Simplifying the available static dumps of pageview data

2015-12-24 Thread Alex Druk
Nothing against this approach! On Thu, Dec 24, 2015 at 2:55 PM, Dan Andreescu wrote: > > > On Thu, Dec 24, 2015 at 8:48 AM, Alex Druk wrote: > >> Hi Dan, >> Happy holidays! >> Good idea to combine these datasets! However we have one more dataset by >> Erik Zachte : http://dumps.wikimedia.org/ot

Re: [Analytics] [Pageviews] [Technical] Simplifying the available static dumps of pageview data

2015-12-24 Thread Dan Andreescu
On Thu, Dec 24, 2015 at 8:48 AM, Alex Druk wrote: > Hi Dan, > Happy holidays! > Good idea to combine these datasets! However we have one more dataset by > Erik Zachte : http://dumps.wikimedia.org/other/pagecounts-ez/ > And that's an important one! But I was thinking we could re-organize the pag

Re: [Analytics] [Pageviews] [Technical] Simplifying the available static dumps of pageview data

2015-12-24 Thread Alex Druk
Hi Dan, Happy holidays! Good idea to combine these datasets! However we have one more dataset by Erik Zachte : http://dumps.wikimedia.org/other/pagecounts-ez/ On Thu, Dec 24, 2015 at 2:41 PM, Dan Andreescu wrote: > I should have started this discussion a while ago, but it's easier to > catch up

[Analytics] [Pageviews] [Technical] Simplifying the available static dumps of pageview data

2015-12-24 Thread Dan Andreescu
I should have started this discussion a while ago, but it's easier to catch up on work during vacation :) We currently have 3 available static file dumps of pageview data. I will explain them here and explain my thoughts on simplifying the situation. Feel free to turn this thread into a wiki. *