>Seems like using an unknown useragent string might be a better proxy.
We already do bot filtering using user agent strings. Please see:
https://github.com/wikimedia/analytics-refinery-source/blob/master/refinery-core/src/main/java/org/wikimedia/analytics/refinery/core/Webrequest.java#L56

On Wed, Oct 21, 2015 at 12:58 PM, Ryan Kaldari <rkald...@wikimedia.org>
wrote:

> I was under the impression that most of the MediaWiki bot frameworks do
> accept cookies, but I imagine many of the home-made bots don't. Seems like
> using an unknown useragent string might be a better proxy.
>
> On Wed, Oct 21, 2015 at 1:49 PM, Nuria Ruiz <nu...@wikimedia.org> wrote:
>
>> >What was the motivation for this change? Just looking for possible
>> automata?
>> Right.The motivation was to see if the absence of cookies works as a
>> cheap proxy to identify robots. It is a pretty easy change to make that
>> might help us quite a bit, we shall update the list when we have some data.
>>
>> On Wed, Oct 21, 2015 at 12:47 PM, Oliver Keyes <oke...@wikimedia.org>
>> wrote:
>>
>>> What was the motivation for this change? Just looking for possible
>>> automata?
>>>
>>> On 21 October 2015 at 15:38, Nuria Ruiz <nu...@wikimedia.org> wrote:
>>> > Team:
>>> >
>>> > As of today incoming request data includes an extra bit of information
>>> on
>>> > the X-analytics header.
>>> >
>>> > If an incoming request to any wikipedia project had no cookies
>>> whatsoever it
>>> > will be tagged with nocookie=1. A requests without any cookies could
>>> > correspond to a fresh browser session, a user browsing with cookies
>>> disabled
>>> > or, most likely, a bot request as most bots will not accept cookies. We
>>> > *might* be able to use this setting as a cheap proxy to quantify bot
>>> > traffic.
>>> >
>>> >
>>> > Documentation about this change can be found here:
>>> > https://wikitech.wikimedia.org/wiki/X-Analytics
>>> >
>>> >
>>> > Thanks,
>>> >
>>> > Nuria
>>> >
>>> >
>>> >
>>> > _______________________________________________
>>> > Analytics mailing list
>>> > Analytics@lists.wikimedia.org
>>> > https://lists.wikimedia.org/mailman/listinfo/analytics
>>> >
>>>
>>>
>>>
>>> --
>>> Oliver Keyes
>>> Count Logula
>>> Wikimedia Foundation
>>>
>>> _______________________________________________
>>> Analytics mailing list
>>> Analytics@lists.wikimedia.org
>>> https://lists.wikimedia.org/mailman/listinfo/analytics
>>>
>>
>>
>> _______________________________________________
>> Analytics mailing list
>> Analytics@lists.wikimedia.org
>> https://lists.wikimedia.org/mailman/listinfo/analytics
>>
>>
>
> _______________________________________________
> Analytics mailing list
> Analytics@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/analytics
>
>
_______________________________________________
Analytics mailing list
Analytics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics

Reply via email to