Re: [Analytics] [Wikimedia Research Showcase] March 17: Curiosity

2021-03-15 Thread Janna Layton
Reminder that this will be happening on Wednesday. On Thu, Mar 11, 2021 at 12:49 PM Janna Layton wrote: > In this showcase, Prof. Danielle Bassett will present recent work studying > individual and collective curiosity as network building processes using > Wikipedia. > > Date/Time: March 17, 16:

Re: [Analytics] Pageview-complete entries labeled as "-"

2021-03-15 Thread Joseph Allemandou
Hi again Ogier, > I don't exactly understand the part, about the page_id being defined in the request. I thought the page_id was "resolved" based on the page_title being in the uri_query. This is not how the page_id is set in our traffic datasets :) We receive the page_id in HTTP-Header, set by t

Re: [Analytics] Pageview-complete entries labeled as "-"

2021-03-15 Thread Ogier Maitre
Hello Joseph, Thank you for your detailed response. We suspected curid could be part of the equation here, but it is nice to have it confirmed here (at least for a part of the answer). > The entry appears two times because for one of them there is no page_id > defined in the request, therefore

Re: [Analytics] About readership timestamp

2021-03-15 Thread Andrew Otto
Hi, Yes we prefer to always use UTC for timestamps. On Fri, Mar 12, 2021 at 2:40 PM Ho Chung wrote: > Hello > > In this page did you know when any readership visit any Chinese web page > , > > Eg. https://zh.wikipedia.org/wiki/MP3 > > > the timestamp is use UTC ? > > > https://wikitech.wiki

Re: [Analytics] Pageview-complete entries labeled as "-"

2021-03-15 Thread Joseph Allemandou
Hello Ogier, Thank you a lot for the wikimaps work, and your thorough analysis on the pageviews :) Here is what I found on your two questions, investigating one day of `user` visited pageviews recent data (we keep detailed data for 90 days only and I needed those detailed for the analysis). > Wha

[Analytics] Fwd: About: refine_webrequest.hql

2021-03-15 Thread Joseph Allemandou
Forwarding to the analytics list for reference. -- Forwarded message - From: Ho Chung Date: Mon, Mar 15, 2021 at 11:45 AM Subject: Re: [Analytics] About: refine_webrequest.hql To: Joseph Allemandou Hello Thanks for your reply Because i was research your Analytics team public

Re: [Analytics] About: refine_webrequest.hql

2021-03-15 Thread Joseph Allemandou
Hi, the `dt` field is the time in UTC (no timezone specified) at which the request ends being processed by Varnish. Cheers Joseph On Mon, Mar 15, 2021 at 8:36 AM Luca Toscano wrote: > +A mailing list for the Analytics Team at WMF and everybody who has an > interest in Wikipedia and analytics. >

Re: [Analytics] About: refine_webrequest.hql

2021-03-15 Thread Luca Toscano
+A mailing list for the Analytics Team at WMF and everybody who has an interest in Wikipedia and analytics. Hi! I added the Analytics mailing list in Cc so other people can chime in, this is the canonical way to follow up with us and the community, please avoid direct email if possible :) Thank