[Analytics] [Release] Hive wmf.webrequest new fields

2015-04-01 Thread Joseph Allemandou
Hi Analytics people, Three new fields are available in the hive table wmf.webrequest (as per the documentation): user_agent_map map User-agent map with browser_name, browser_major, device, os_name, os_minor, os_major keys and associated values x_analytics_map map X_anal

Re: [Analytics] [Release] explore the (parsed) (common) Wikimedia user agents

2015-03-09 Thread Oliver Keyes
1%...of the browsers that made it through the minimum request count filter ;). But crawler-traffic overall is actually ~50% of US desktop traffic, for scale. We get a lot of hits (not so much from Google, who crawl in a smart way, as Bing, who crawl in a very dumb way) On 9 March 2015 at 22:54, Ti

Re: [Analytics] [Release] explore the (parsed) (common) Wikimedia user agents

2015-03-09 Thread Timo Tijhof
Wow, does Googlebot really represent over 1% of our desktop/reader traffic? Rather interesting compared to that of e.g. WinXP/IE6, which is over 60x smaller at 0.016%. But never mind IE6's percentage, that of Google would seem quite high. — Timo On 6 Mar 2015, at 01:02, Oliver Keyes wrote: >

Re: [Analytics] [Release] explore the (parsed) (common) Wikimedia user agents

2015-03-07 Thread Nuria
Amir, I think you need to re-read Oliver's e-mail as this data is not going to be updated regularly. He is explained why as part of this thread. Thanks, Nuria > On Mar 6, 2015, at 10:40 AM, Amir E. Aharoni > wrote: > > Very neat! This pretty much resolves https://phabricator.wikimedia.or

Re: [Analytics] [Release] explore the (parsed) (common) Wikimedia user agents

2015-03-06 Thread Oliver Keyes
As said: >"Will this be run regularly?" >Not as of this moment. At least, not by me. This is an ad-hoc report >in response to an ad-hoc request. > >"Who do I go to if I want that to change?" >Analytics Engineering has this task on their backlog already. On 6 March 2015 at 13:40, Amir E. Aharoni

Re: [Analytics] [Release] explore the (parsed) (common) Wikimedia user agents

2015-03-06 Thread Amir E. Aharoni
Very neat! This pretty much resolves https://phabricator.wikimedia.org/T78539 . I join Andrew's request about percents in addition to (or instead of) absolute numbers, and I hope that it will be updated, say, once a month. Thanks! -- Amir Elisha Aharoni · אָמִיר אֱלִישָׁע אַהֲרוֹנִי http://ahar

Re: [Analytics] [Release] explore the (parsed) (common) Wikimedia user agents

2015-03-06 Thread Andrew Gray
Awesome! Thinking about this a little more, presumably the 500 cutoff means there's a long, long tail of smaller combinations - so could those percentages be of the absolute total, and could the absolute total + some idea of how much is "other" be listed in the outline? Thanks, Andrew. On 6 Mar

Re: [Analytics] [Release] explore the (parsed) (common) Wikimedia user agents

2015-03-06 Thread Oliver Keyes
Indeed and indeed! Working those into the first changeset :) On 6 March 2015 at 07:52, Andrew Gray wrote: > Hi Oliver, > > This is really neat, but the absolute user numbers are a bit > confusing. Is there any chance it could also display pageviews as an > approximate percentage? It's clear that,

Re: [Analytics] [Release] explore the (parsed) (common) Wikimedia user agents

2015-03-06 Thread Andrew Gray
Hi Oliver, This is really neat, but the absolute user numbers are a bit confusing. Is there any chance it could also display pageviews as an approximate percentage? It's clear that, say, 2.1m Chrome 40 on Windows 7 is a lot more than 517 Chrome 40 on Linux, but because there's no total given it's

[Analytics] [Release] explore the (parsed) (common) Wikimedia user agents

2015-03-05 Thread Oliver Keyes
Hey all, A perennial request from WMF engineers/product people, as well as third-party developers, is an idea of what browsers people are using so we know what we have to support on the frontend side of things. With Legal/Analytics signoff and +2ing, I've built an exploratory tool at http://datav

Re: [Analytics] [Release]

2015-02-25 Thread Pine W
Excellent! Pine On Feb 25, 2015 1:26 PM, "Oliver Keyes" wrote: > Totally! I'm also going to get together with some NEU hackers tomorrow > and work on actually visualising the data on *drumroll* maps, which'd > probably be more interesting eye candy than infinite bar plots :) > > On 25 February 2

Re: [Analytics] [Release]

2015-02-25 Thread Oliver Keyes
Totally! I'm also going to get together with some NEU hackers tomorrow and work on actually visualising the data on *drumroll* maps, which'd probably be more interesting eye candy than infinite bar plots :) On 25 February 2015 at 16:19, Pine W wrote: > Very nice. Do you think that you could pick

Re: [Analytics] [Release]

2015-02-25 Thread Pine W
Very nice. Do you think that you could pick out a few of your favorite graphs and add them to this week's Recent Research report in a gallery? Thanks! Pine Hey all! We've released a highly-aggregated dataset of readership data - specifically, data about where, geographically, traffic to each of o

[Analytics] [Release]

2015-02-25 Thread Oliver Keyes
Hey all! We've released a highly-aggregated dataset of readership data - specifically, data about where, geographically, traffic to each of our projects (and all of our projects) comes from. The data can be found at http://dx.doi.org/10.6084/m9.figshare.1317408 - additionally, I've put together an