[Analytics] noccokie tag on X-analytics

2015-10-21 Thread Nuria Ruiz
Team: As of today incoming request data includes an extra bit of information on the X-analytics header. If an incoming request to any wikipedia project had no cookies whatsoever it will be tagged with nocookie=1. A requests without any cookies could correspond to a fresh browser session, a user b

Re: [Analytics] noccokie tag on X-analytics

2015-10-21 Thread Oliver Keyes
What was the motivation for this change? Just looking for possible automata? On 21 October 2015 at 15:38, Nuria Ruiz wrote: > Team: > > As of today incoming request data includes an extra bit of information on > the X-analytics header. > > If an incoming request to any wikipedia project had no co

Re: [Analytics] noccokie tag on X-analytics

2015-10-21 Thread Nuria Ruiz
>What was the motivation for this change? Just looking for possible automata? Right.The motivation was to see if the absence of cookies works as a cheap proxy to identify robots. It is a pretty easy change to make that might help us quite a bit, we shall update the list when we have some data. On

Re: [Analytics] noccokie tag on X-analytics

2015-10-21 Thread Oliver Keyes
Awesome; cool! On 21 October 2015 at 15:49, Nuria Ruiz wrote: >>What was the motivation for this change? Just looking for possible >> automata? > Right.The motivation was to see if the absence of cookies works as a cheap > proxy to identify robots. It is a pretty easy change to make that might he

Re: [Analytics] noccokie tag on X-analytics

2015-10-21 Thread Ryan Kaldari
I was under the impression that most of the MediaWiki bot frameworks do accept cookies, but I imagine many of the home-made bots don't. Seems like using an unknown useragent string might be a better proxy. On Wed, Oct 21, 2015 at 1:49 PM, Nuria Ruiz wrote: > >What was the motivation for this cha

Re: [Analytics] noccokie tag on X-analytics

2015-10-21 Thread Nuria Ruiz
>Seems like using an unknown useragent string might be a better proxy. We already do bot filtering using user agent strings. Please see: https://github.com/wikimedia/analytics-refinery-source/blob/master/refinery-core/src/main/java/org/wikimedia/analytics/refinery/core/Webrequest.java#L56 On Wed,

[Analytics] Code of Conduct and publication of private non-harassing communication

2015-10-21 Thread Matthew Flaschen
Quim has proposed an alternative wording for the text about republication of private communication. You can comment at https://www.mediawiki.org/wiki/Talk:Code_of_Conduct/Draft#New_proposal.2C_welcomes_feedback or to conduct-discuss...@wikimedia.org . Thanks as always, Matt Flaschen __

[Analytics] Please help finish "Report a problem" section of CoC (+updates)

2015-10-21 Thread Matthew Flaschen
We are now working on the "Report a problem" section of the draft Code of conduct: * Section: https://www.mediawiki.org/wiki/Code_of_Conduct/Draft#Report_a_problem * Talk: https://www.mediawiki.org/wiki/Talk:Code_of_Conduct/Draft#Finishing_the_.22Report_a_problem.22_section * Alternatively