Re: [Analytics] High number of pageviews on page with single hyphen as title

2016-12-06 Thread Nuria Ruiz
>Also, just out of curiosity and to better understand the issue, what >would be an example of a real life request URL that results in such a >"no page title found" error when extracting the title? Special page requests, for example. Normally pages like "Special:Blah" are "actions" not pages themse

Re: [Analytics] High number of pageviews on page with single hyphen as title

2016-12-05 Thread Tilman Bayer
On Mon, Nov 14, 2016 at 12:25 PM, Nuria Ruiz wrote: > This is documented now here: > > https://wikitech.wikimedia.org/wiki/Analytics/PageviewAPI#Gotchas Thanks for the documentation. Does this only affect data provided by the API, or also the page_title field in the pageview_hourly table, i.e. the

Re: [Analytics] High number of pageviews on page with single hyphen as title

2016-11-17 Thread Vipul Naik
Correction: The number for 404.php shot up on September 13: https://wikimedia.org/api/rest_v1/metrics/pageviews/per-article/en.wikipedia/desktop/user/404.php/daily/20160901/20160930?purge756777637 On Thu, Nov 17, 2016 at 4:51 PM, Vipul Naik wrote: > Thanks for opening the ticket and for clarifyi

Re: [Analytics] High number of pageviews on page with single hyphen as title

2016-11-17 Thread Vipul Naik
Thanks for opening the ticket and for clarifying the issue more. On a related note, I wonder if you could add the documentation for the unusual amount of pageviews to 404.php as returned by the API. That number also shot up in October 2016; see http://wikipediaviews.org/displayviewsformultiplemont

Re: [Analytics] High number of pageviews on page with single hyphen as title

2016-11-17 Thread Nuria Ruiz
>Just to verify what you are saying, would it be right to say that the bug fix caused >a a lot of pageviews to be moved from the respective (nonexistent) pages to "-" pageviews? No, the bugfix makes those faulty requests to no longer be stored as pageviews thus it cannot make that number increase.

Re: [Analytics] High number of pageviews on page with single hyphen as title

2016-11-17 Thread Vipul Naik
Thank you for linking to that bug, Marcel. Just to verify what you are saying, would it be right to say that the bug fix caused a a lot of pageviews to be moved from the respective (nonexistent) pages to "-" pageviews? And, does that means that the current estimate of "-" pageviews is more accurate

Re: [Analytics] High number of pageviews on page with single hyphen as title

2016-11-16 Thread Marcel Ruiz Forns
Maybe the high value in October (45M) has something to do with the last changes in https://phabricator.wikimedia.org/T145922 ? On Mon, Nov 14, 2016 at 9:25 PM, Nuria Ruiz wrote: > This is documented now here: > > https://wikitech.wikimedia.org/wiki/Analytics/PageviewAPI#Gotchas > > On Tue, Nov 8

Re: [Analytics] High number of pageviews on page with single hyphen as title

2016-11-14 Thread Nuria Ruiz
This is documented now here: https://wikitech.wikimedia.org/wiki/Analytics/PageviewAPI#Gotchas On Tue, Nov 8, 2016 at 7:25 AM, Vipul Naik wrote: > Hi Joseph, > > Thanks for the clarification. > > Any ideas why this number is much higher for some months? In particular, > on desktop, it's high in

Re: [Analytics] High number of pageviews on page with single hyphen as title

2016-11-08 Thread Vipul Naik
Hi Joseph, Thanks for the clarification. Any ideas why this number is much higher for some months? In particular, on desktop, it's high in the months of July to September 2015 (around 10 million, compared to the usual 5 million) and then high again in October 2016 (45 million, about 10x the usual

Re: [Analytics] High number of pageviews on page with single hyphen as title

2016-11-08 Thread Joseph Allemandou
Hello Issa, Thank you for your question. The very high number of views of the "-" page is explained by this dash value being used as a special value for "no page title found" when extracting titles from urls. We definitely should document this in the API, creating this task: https://phabricator.wi

[Analytics] High number of pageviews on page with single hyphen as title

2016-11-07 Thread Issa Rice
Dear Analytics Mailing List, Recently while querying pageviews of various pages, I discovered that the page whose title is a single hyphen character (i.e. with the title "-", with URL , which redirects to ) receives an un