Just a quick note to clarify that this change only filters out bots whose
requests carry a user agent string that identifies them as such. You can
track our tasks related to identifying nonevident bots in Phabricator task
T138207 <https://phabricator.wikimedia.org/T138207>.

Thanks!

On Wed, May 24, 2017 at 6:35 PM, Jon Katz <jk...@wikimedia.org> wrote:

> Nice change!  Thanks.
>
> On Wed, May 24, 2017 at 8:05 AM, Tilman Bayer <tba...@wikimedia.org>
> wrote:
>
>> Thanks Francisco!
>> To express it from the perspective of users of this data: The results of
>> your EventlLogging queries may change slightly, but for the better,
>> improving accuracy. (In the past e.g. GoogleBot has shown up in schemas for
>> mobile web and the Android Wikipedia app.)
>>
>> On Wed, May 24, 2017 at 4:54 AM, Francisco Dans <fd...@wikimedia.org>
>> wrote:
>>
>>> Hi all,
>>>
>>> Today we'll be deploying a change that affects how events triggered by
>>> bots/spiders are stored. We have added a property to the user agent map in
>>> the event capsule called *is_bot, *which we use to prevent them from
>>> being persisted in MySQL, and store them only in Hadoop.
>>>
>>> For more information on this change refer to phab task T67508
>>> <https://phabricator.wikimedia.org/T67508>.
>>>
>>> Thank you!
>>>
>>> --
>>> *Francisco Dans*
>>> Software Engineer, Analytics Team
>>> Wikimedia Foundation
>>>
>>> _______________________________________________
>>> Analytics mailing list
>>> Analytics@lists.wikimedia.org
>>> https://lists.wikimedia.org/mailman/listinfo/analytics
>>>
>>>
>>
>>
>> --
>> Tilman Bayer
>> Senior Analyst
>> Wikimedia Foundation
>> IRC (Freenode): HaeB
>>
>> _______________________________________________
>> Analytics mailing list
>> Analytics@lists.wikimedia.org
>> https://lists.wikimedia.org/mailman/listinfo/analytics
>>
>>
>
> _______________________________________________
> Analytics mailing list
> Analytics@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/analytics
>
>


-- 
*Francisco Dans*
Software Engineer, Analytics Team
Wikimedia Foundation
_______________________________________________
Analytics mailing list
Analytics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics

Reply via email to