Re: [Analytics] [Discussion] User agent data releases

2015-03-05 Thread Dario Taraborelli
heads up that after a review with Legal we decided that we should not release the sampled raw dataset. Oliver is now working on making parsed UA data available. On Mar 5, 2015, at 10:52 AM, Oliver Keyes oke...@wikimedia.org wrote: Just a clarifying note: Dario still needs to review the

Re: [Analytics] [Discussion] User agent data releases

2015-03-05 Thread Oliver Keyes
Just a clarifying note: Dario still needs to review the actual methodology. While Legal have approved it from their end, they've also made clear that this is contingent on the anonymisation methodology pasting muster from an RD point of view. On 5 March 2015 at 12:39, Oliver Keyes

Re: [Analytics] [Discussion] User agent data releases

2015-03-04 Thread Oliver Keyes
So it's distinct people, globally - and I deliberately made it wooly it by operating over username, which means the threshold is fuzzy (i.e., at a minimum it's 50. At a maximum it's 50x[number of wikis]). It's very deliberately dimension-free: user_agent,

Re: [Analytics] [Discussion] User agent data releases

2015-03-03 Thread Nuria Ruiz
Erik has asked me to write an exploratory app for user-agent data. The idea is to enable Product Managers and engineers to easily explore what users use so they know what to support. I've thrown up an example screenshot at http://ironholds.org/agents_example_screen.png I cannot speak as to the

Re: [Analytics] [Discussion] User agent data releases

2015-03-03 Thread Oliver Keyes
On 3 March 2015 at 19:35, Nuria Ruiz nu...@wikimedia.org wrote: Erik has asked me to write an exploratory app for user-agent data. The idea is to enable Product Managers and engineers to easily explore what users use so they know what to support. I've thrown up an example screenshot at