Hello,

> One question we have is whether the pageviews we observe are driven by
bots and spiders. We know that the > wikimedia rest api provides this
information going back to July 1 2015.
Please have in mind that these are only self-identified bots, there is
probably about 1-5% of bot pageview traffic that gets wrongly labeled as
"user", a project is on its way to better label this traffic as coming from
bots.




On Tue, Nov 13, 2018 at 6:41 AM Jennifer Pan <[email protected]> wrote:

> Hi there,
>
>
> I'm an assistant professor in the Department of Communication at Stanford.
> My co-author, Molly Roberts (Political Science, UCSD), and I are working on
> a paper examining the effect of China's 2015 block of Chinese language
> wikipedia on pageviews, which builds on our previous work on censorship in
> China.
>
> We are using the block to conduct a interrupted time series design to
> measure the effect of censorship on Chinese users. Our main finding is that
> Chinese users were using Wikipedia to browse (starting at the home page),
> and the block influenced users' ability to explore and encounter unexpected
> information. One question we have is whether the pageviews we observe are
> driven by bots and spiders. We know that the wikimedia rest api provides
> this information going back to July 1 2015. Since the China block of
> Wikipedia was on May 19, 2015, we are wondering if there is pageview data
> by agent type for zh.wikipedia.org pages (all or some subset like most
> popular) going back to May 2015 (specifically May 18-21, 2015)? From
> https://meta.wikimedia.org/wiki/Research:Timeline_of_Wikimedia_analytics,
> it says that pageview data is available in bulk starting on May 1, 2015,
> so we thought maybe there was some chance this data exists.
>
> Any suggestions would be greatly appreciated, and if this is not possible,
> please let us know.
>
> Thank you!
> Jennifer Pan
>
> _______________________________________________
> Analytics mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/analytics
>
_______________________________________________
Analytics mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/analytics

Reply via email to