[Wiki-research-l] New viz.: Wikipedias, participation per language

2018-09-10 Thread Erik Zachte
continent https://stats.wikimedia.org/wikimedia/participation/d3_participation_continent.html You can also zoom in on one continent, by clicking on it Any feedback is welcome. Erik Zachte ___ Wiki-research-l mailing list Wiki-research-l@lists.wikimedia.org

[Analytics] New viz.: Wikipedias, participation per language

2018-09-10 Thread Erik Zachte
continent https://stats.wikimedia.org/wikimedia/participation/d3_participation_continent.html You can also zoom in on one continent, by clicking on it Any feedback is welcome. Erik Zachte ___ Analytics mailing list Analytics@lists.wikimedia.org https

Re: [Wiki-research-l] Wikimedia Commons data structure - public?

2018-07-19 Thread Erik Zachte
Hi Trilce, There is new set of dumps for every Wikimedia wiki at least once a month. Among those files are several database dumps in xml format. One with the most recent version of every article, one with meta data but no article texts ('stub dumps'). One with full texts for every revision of ever

[Analytics] New files for geo coded Wikimedia stats

2018-07-11 Thread Erik Zachte
Today I released two new json files [2][4]. Both complement visualization 'Wikipedia Views Visualized' [1] (aka WiViVi), but both can be useful in other contexts as well. 1) File 'demographics_from_world_bank_for_wikimedia.json' [2] resulted from harvesting World Bank API files. It contains yearly

[Wiki-research-l] New files for geo coded Wikimedia stats

2018-07-11 Thread Erik Zachte
Today I released two new json files [2][4]. Both complement visualization 'Wikipedia Views Visualized' [1] (aka WiViVi), but both can be useful in other contexts as well. 1) File 'demographics_from_world_bank_for_wikimedia.json' [2] resulted from harvesting World Bank API files. It contains yearly

Re: [Analytics] Tool to visualize which wiki pages link to which wiki pages?

2017-11-27 Thread Erik Zachte
A problem with the category hierarchy is that any rather out of place subcategory brings in a full branch of anomalous subjects below it. Thus making a report like https://stats.wikimedia.org/wikimedia/pageviews/categorized/wp-en/2015-06/pageviews_wp-en_cat_WikiProject_Medicine_2015-06.html

Re: [Analytics] Undocumented project code in pagecounts-ez

2017-11-14 Thread Erik Zachte
Sorry for this code scheme being not so intuitive, which two meanings for 'm' depending on where it appears. The coding system was extended several times, and Christian Aistleitner and I prioritized downward compatibility over intuitiveness, reluctantly. Erik From: Analytics [mailto:

Re: [Wikitech-l] Can we drop revision hashes (rev_sha1)?

2017-09-15 Thread Erik Zachte
process flow. That I can't tell. Erik Zachte -Original Message- From: Wikitech-l [mailto:wikitech-l-boun...@lists.wikimedia.org] On Behalf Of Daniel Kinzler Sent: Friday, September 15, 2017 12:52 To: Wikimedia developers Subject: [Wikitech-l] Can we drop revision hashes (rev_sha1)? H

Re: [Analytics] Resources stat1005

2017-08-12 Thread Erik Zachte
I will soon start the two Wikistats jobs which run for about several weeks each month, They might use two cores each, one for unzip, one for perl. How many cores are there anyway? Cheers, Erik From: Analytics [mailto:analytics-boun...@lists.wikimedia.org] On Behalf Of Adrian Bielefel

Re: [Analytics] Daily page count dumps not working for last few days?

2017-08-07 Thread Erik Zachte
Progress is kept at https://en.wikipedia.org/wiki/Wikipedia:Village_pump_(technical)#Pagecounts-ez_dataset_hasn.27t_generated_since_JUL-23 Cheers, Erik From: Analytics [mailto:analytics-boun...@lists.wikimedia.org] On Behalf Of Chris Zaharia Sent: Thursday, July 27, 2017 14:52 To: analyt

[Wiki-research-l] new viz. WiViVi = Wikipedia Views Visualized

2017-08-02 Thread Erik Zachte
. Thanks, Erik Zachte ___ Wiki-research-l mailing list Wiki-research-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wiki-research-l

Re: [Analytics] Top editors in a certain namespace across sites?

2017-06-02 Thread Erik Zachte
ace across sites? Hi Erik, My query only looks at the last 30 days. See the "(in the last 30 days)" suffix on the title :) This explains the discrepancy in our counts. -Aaron On Thu, Jun 1, 2017 at 3:54 PM, Erik Zachte wrote: Here is an experiment I did about two m

Re: [Wikimedia-l] [Wikimedia Announcements] Wikimedia Deutschland: Annual Report 2016

2017-06-02 Thread Erik Zachte
Wow, very impressive report overall! I particularly love the videos. They are quite informative, the ones with real people are a great introduction to what editing entails, the animated ones are entertaining and inspiring. These videos imo deserve to be used on many of our projects, localized

Re: [Analytics] Top editors in a certain namespace across sites?

2017-06-01 Thread Erik Zachte
Here is an experiment I did about two months ago with First Normal Form (yikes!) and GNU. I just never posted this yet. I collected alle edits from all wikis using recent full history stub dumps, with a perl script, which took some 30 hours. The total file for all Wikimedia wikis is 240 GB uncom

Re: [Wikimedia-l] Wikitribune!

2017-04-26 Thread Erik Zachte
Wed, Apr 26, 2017 at 9:01 AM, Erik Zachte wrote: > Here is a high level Wikistats page on how our projects fared in terms > of active wikis. > https://stats.wikimedia.org/EN/ProjectTrendsActiveWikis.html > > Erik Zachte > > -Original Message- > From: Wikimedia-l [mail

Re: [Wikimedia-l] Wikitribune!

2017-04-26 Thread Erik Zachte
Here is a high level Wikistats page on how our projects fared in terms of active wikis. https://stats.wikimedia.org/EN/ProjectTrendsActiveWikis.html Erik Zachte -Original Message- From: Wikimedia-l [mailto:wikimedia-l-boun...@lists.wikimedia.org] On Behalf Of Alessandro Marchetti Sent

Re: [Analytics] Fwd: follow-up on editors

2017-04-25 Thread Erik Zachte
27;m fine with using 5+ edits per month and 100+ edits per month as measures of productivity, but I would prefer to drop the terms "active editor" and "very active editor". I'd also like to see more prominence given to other metrics such as bytes changed and logged no

Re: [Analytics] Fwd: follow-up on editors

2017-04-11 Thread Erik Zachte
saying that anyone who contributes to publicly recorded astronomy observations is an astronomer -- even if they have only done so once. In my estimation, that doesn't sound crazy. Your comparison to "looking at the night sky" is a lot more like reading Wikipedia. -Aaron

Re: [Analytics] Fwd: follow-up on editors

2017-04-11 Thread Erik Zachte
About 'Number of editors who contribute 1 edit per month?' I'm hoping we're not going that use that number for our next fundraiser ;-) The more inclusive our numbers are, the less meaningful, bordering on alternative facts. A person with one edit in any given month is as much an editor

Re: [Analytics] Os stats

2017-03-14 Thread Erik Zachte
Hi Christian, I'm forwarding your question to the WMF Analytics Team who authored this report. Cheers, Erik -Original Message- From: Christian Schaller [mailto:cscha...@redhat.com] Sent: Monday, March 13, 2017 16:07 To: Erik Zachte Cc: Tomas Popela Subject: Re: Os stats Hi

Re: [Wikimedia-l] Very good news!

2017-02-19 Thread Erik Zachte
ount for active editors trends. For similar charts for other Wikipedias see https://stats.wikimedia.org/EN/PlotsPngEditHistoryTop.htm Erik Zachte P.S. unrelated but good to know if you dive into Wikistats: February 2017 reports are incomplete (under investigation) -Original Message- Fr

[MediaWiki-commits] [Gerrit] analytics/wikistats[master]: Update banner with design consultation

2017-02-14 Thread Erik Zachte (Code Review)
Erik Zachte has submitted this change and it was merged. ( https://gerrit.wikimedia.org/r/337452 ) Change subject: Update banner with design consultation .. Update banner with design consultation Change-Id

Re: [Analytics] Research on wikipedia traffic and educational quality

2017-02-08 Thread Erik Zachte
Dear Fabian, Let me relay your question to the WMF Analytics Team. Best regards, Erik Zachte From: Fabian Stephany [mailto:fn...@cam.ac.uk] Sent: Sunday, February 05, 2017 20:48 To: erikzac...@infodisiac.com; erikzac...@wikimedia.org Cc: Fabian Braesemann Subject: Research on

Re: [Analytics] Figures regarding African topics coverage

2017-02-08 Thread Erik Zachte
(Cc'ing Analytics mailing list) Hi Samuel, I can only partially answer your questions, but maybe colleagues can add to this. ==Page views== Not exactly what you ask, but we do publish monthly update stats on pageviews per country and per language This report shows how many pa

Re: [Wikitech-l] [Analytics] Monthly page view stats that can now be queried via Pageview API.

2017-01-25 Thread Erik Zachte
Thanks Analytics Team for this really good news! A lot of reporting tools will benefit. Erik Zachte -Original Message- From: Analytics [mailto:analytics-boun...@lists.wikimedia.org] On Behalf Of Wes Moran Sent: Wednesday, January 25, 2017 18:41 To: A mailing list for the Analytics Team

Re: [Analytics] [Wikitech-l] Monthly page view stats that can now be queried via Pageview API.

2017-01-25 Thread Erik Zachte
Thanks Analytics Team for this really good news! A lot of reporting tools will benefit. Erik Zachte -Original Message- From: Analytics [mailto:analytics-boun...@lists.wikimedia.org] On Behalf Of Wes Moran Sent: Wednesday, January 25, 2017 18:41 To: A mailing list for the Analytics Team

[Wiki-research-l] Wiki Loves Monuments 2016 stats

2017-01-10 Thread Erik Zachte
New stats are available for Wiki Loves Monuments 2016 contest http://infodisiac.com/blog/2017/01/wiki-loves-monuments-2016/ Charts also on https://commons.wikimedia.org/wiki/Category:Wiki_Loves_Monuments_2016_stats Erik Zachte ___ Wiki

[Analytics] unique devices per device type

2016-11-10 Thread Erik Zachte
Hi, I am looking for monthly unique devices per device type (desktop, smartphone, tablet, maybe 'other'). I found this https://dumps.wikimedia.org/other/unique_devices/2016/2016-09/ file unique_devices_monthly-2016-09.gz But this totals unique devices per platform, e.g. en.m.wikipedia

Re: [Wiki-research-l] Wikipedia video stats ?

2016-11-04 Thread Erik Zachte
There is work being done towards front-end for media count files. Step one completed: at least the counts are in a database now, albeit only some columns. https://phabricator.wikimedia.org/T116363 Erik -Original Message- From: Wiki-research-l [mailto:wiki-research-l-boun...@lists.wikim

Re: [Analytics] [Wiki-research-l] Wikipedia video stats ?

2016-11-04 Thread Erik Zachte
There is work being done towards front-end for media count files. Step one completed: at least the counts are in a database now, albeit only some columns. https://phabricator.wikimedia.org/T116363 Erik -Original Message- From: Wiki-research-l [mailto:wiki-research-l-boun...@lists.wikim

Re: [Analytics] https://stats.wikimedia.org/wikimedia/squids/SquidReportOperatingSystems.htm

2016-10-12 Thread Erik Zachte
https://analytics.wikimedia.org/dashboards/browsers/#all-sites-by-os I forward you message to the WMF Analytics Team who maintain these stats. Best regards, Erik Zachte From: Haar, Dirk [mailto:dirk.h...@partner.commerzbank.com] Sent: Wednesday, October 12, 2016 11:22 To: '

Re: [Analytics] Seeking feedback (+ answer to 1 question) on a timeline of Wikipedia analytics

2016-09-30 Thread Erik Zachte
Vipul, I found this: for June 18, 2014 [WikimediaMobile] New tablet-optimized mobile site now live for all tablet users https://lists.wikimedia.org/pipermail/mobile-l/2014-June/007394.html The title of the meta page is up to you, but the current title seems fine to me, except that this

Re: [Analytics] Seeking feedback (+ answer to 1 question) on a timeline of Wikipedia analytics

2016-09-30 Thread Erik Zachte
Hi Vipul, Thanks for doing this. I made a few changes to the timeline. Wouldn't meta be an appropriate place for this? Notability is not an issue there. And it formalizes co-authoring. Cheers, Erik From: Analytics [mailto:analytics-boun...@lists.wikimedia.org] On Behalf Of

Re: [Wikimedia-l] Ray Saintonge has died

2016-09-13 Thread Erik Zachte
ay as I remember him, accompanied by Phoebe https://vimeo.com/134889976 Erik Zachte -Original Message- From: Wikimedia-l [mailto:wikimedia-l-boun...@lists.wikimedia.org] On Behalf Of Itzik - Wikimedia Israel Sent: Tuesday, September 13, 2016 18:49 To: Wikimedia Mailing List Subjec

Re: [Analytics] Urgent Data Issue

2016-08-17 Thread Erik Zachte
ut) and for each wiki requests from mobile devices are now included (desktop and mobile counts on separate lines). I hope this helps, Cheers, Erik Zachte From: Analytics [mailto:analytics-boun...@lists.wikimedia.org] On Behalf Of Jaime Crespo Sent: Wednesday, August 17, 2016 16

Re: [Analytics] Analytics dashboards by country

2016-08-13 Thread Erik Zachte
Hi Stephen, Traffic breakdown by country reports are still operational. Updates were overdue. Sorry. I generated reports for June/July today. https://stats.wikimedia.org/wikimedia/squids/SquidReportPageViewsPerCountryBreakdown.htm Cheers, Erik From: Analytics [mailto:analytics-

Re: [Analytics] [Wikistats 2.0] [Regular Update] First update on Wikistats 2.0

2016-07-31 Thread Erik Zachte
Thanks Dan, very helpful update. Erik From: Analytics [mailto:analytics-boun...@lists.wikimedia.org] On Behalf Of Amir E. Aharoni Sent: Sunday, July 31, 2016 8:00 To: A mailing list for the Analytics Team at WMF and everybody who has an interest in Wikipedia and analytics. Subject: Re: [A

Re: [Wiki-research-l] Multi year page views statistics

2016-07-11 Thread Erik Zachte
New phab request: https://phabricator.wikimedia.org/T139934 Erik -Original Message- From: Wiki-research-l [mailto:wiki-research-l-boun...@lists.wikimedia.org] On Behalf Of Federico Leva (Nemo) Sent: Monday, July 11, 2016 15:29 To: avnerkan...@gmail.com; Research into Wikimedia content an

Re: [Analytics] Wiki Page Views Project

2016-07-09 Thread Erik Zachte
Manny, If you're willing to download large files, monthly totals are here https://dumps.wikimedia.org/other/pagecounts-ez/merged/ pagecounts-2016-06-views-ge-5.bz2 has all titles with 5 or more r

Re: [Wikimedia-l] Rosie Stephenson-Goodknight

2016-06-30 Thread Erik Zachte
Gerard, feel free to follow-up on your call to action with more action. Erik Zachte -Original Message- From: Wikimedia-l [mailto:wikimedia-l-boun...@lists.wikimedia.org] On Behalf Of Gerard Meijssen Sent: Thursday, June 30, 2016 11:38 To: Wikimedia Mailing List Subject: [Wikimedia-l

Re: [Analytics] Trends in main page statistics

2016-06-23 Thread Erik Zachte
or up to date visuals, this time in D3) also SquidReportPageViewsPerLanguageBreakdown.htm still says in small type it uses 1:1000 sampled log data (no longer, I'll remove that) it does also says in huge type it now uses hadoop Erik Zachte From: Erik Zachte [mailto:ezac...@wik

Re: [Analytics] Trends in main page statistics

2016-06-23 Thread Erik Zachte
/uploads/2012/11/Share-of-Wikipedia-page-views-from-South-America.png note this is *share of pageviews* so the summer dip is concealed on wp:es, as it coincides with other languages Erik Zachte From: Analytics [mailto:analytics-boun...@lists.wikimedia.org] On Behalf Of Pine W Sent: Thursday

Re: [Analytics] Pageview analysis graphs not loading

2016-06-22 Thread Erik Zachte
url? From: Analytics [mailto:analytics-boun...@lists.wikimedia.org] On Behalf Of Pine W Sent: Wednesday, June 22, 2016 23:53 To: A mailing list for the Analytics Team at WMF and everybody who has an interest in Wikipedia and analytics. Subject: [Analytics] Pageview analysis graphs not loading

Re: [Wikimedia-l] Welcome Delphine Ménard as WMF's Annual Plan Grants Program Officer

2016-06-21 Thread Erik Zachte
Great news! Congrats, Delphine. Erik -Original Message- From: Wikimedia-l [mailto:wikimedia-l-boun...@lists.wikimedia.org] On Behalf Of Pierre-Selim Sent: Tuesday, June 21, 2016 12:04 To: Wikimedia Mailing List Subject: Re: [Wikimedia-l] Welcome Delphine Ménard as WMF's Annual Plan Grant

Re: [Analytics] Pagecount Datasets to be Deprecated at the end of May

2016-05-26 Thread Erik Zachte
It's as much as changing the download url. The new version is downward compatible. https://dumps.wikimedia.org/other/pageviews/ Erik Zachte From: Analytics [mailto:analytics-boun...@lists.wikimedia.org] On Behalf Of Ryan Kaldari Sent: Thursday, May 26, 2016 23:27 To: A mailing lis

Re: [Analytics] API usage advice

2016-04-28 Thread Erik Zachte
Hi Sander, Not an API but probably relevant, this data stream on media (binary file) downloads: https://wikitech.wikimedia.org/wiki/Analytics/Data/Mediacounts Cheers, Erik Zachte From: Analytics [mailto:analytics-boun...@lists.wikimedia.org] On Behalf Of Sander Ubink Sent

Re: [Wiki-research-l] Finding the most viewed Wikipedia articles on education

2016-04-21 Thread Erik Zachte
manageable, by blacklisting weird subbranches. https://stats.wikimedia.org/wikimedia/pageviews/categorized/wp-en/2016-02/categories_wp-en_cat_Education_2016-02.html Erik Zachte From: Wiki-research-l [mailto:wiki-research-l-boun...@lists.wikimedia.org] On Behalf Of Leila Zia Sent

Re: [Analytics] Problem in Pageviews Files

2016-04-20 Thread Erik Zachte
this data stream it's still there just to not break client scripts which may expect it. Cheers, Erik Zachte -Original Message- From: Analytics [mailto:analytics-boun...@lists.wikimedia.org] On Behalf Of hafez Sent: Monday, April 18, 2016 15:49 To: analytics@lists.wikimedia.org Subj

Re: [Wikitech-l] [Analytics] Unique Devices data available on API

2016-04-19 Thread Erik Zachte
27;t fixed over time. Erik Zachte -Original Message- From: Wikitech-l [mailto:wikitech-l-boun...@lists.wikimedia.org] On Behalf Of Gergo Tisza Sent: Tuesday, April 19, 2016 22:54 To: A mailing list for the Analytics Team at WMF and everybody who has an interest in Wikipedia and ana

Re: [Analytics] [Wikitech-l] Unique Devices data available on API

2016-04-19 Thread Erik Zachte
27;t fixed over time. Erik Zachte -Original Message- From: Wikitech-l [mailto:wikitech-l-boun...@lists.wikimedia.org] On Behalf Of Gergo Tisza Sent: Tuesday, April 19, 2016 22:54 To: A mailing list for the Analytics Team at WMF and everybody who has an interest in Wikipedia and ana

[MediaWiki-commits] [Gerrit] [WIP] Link to the new browser-reports - change (analytics/wikistats)

2016-03-26 Thread Erik Zachte (Code Review)
Erik Zachte has submitted this change and it was merged. Change subject: [WIP] Link to the new browser-reports .. [WIP] Link to the new browser-reports This patch includes: - Replace the links to SquidReportOperatingSystems

Re: [Analytics] [Pageviews] [Technical] Simplifying the available static dumps of pageview data

2016-02-16 Thread Erik Zachte
reate more extensive datasets with >>>> more different measurements in a single datafile. On the other hand, the >>>> files would become even bigger in size. Not an issue for mee, but for users >>>> in the field accesibility (dowlnload bandwidth) could become an iss

Re: [Wiki-Medicine] [Analytics] Zika

2016-02-15 Thread Erik Zachte
Some observations (maybe stating the obvious): https://tools.wmflabs.org/pageviews/#start=2016-01-16&end=2016-02-14&project=en.wikipedia.org&platform=all-access&agent=user&pages=Zika_virus the double peak seems to confirm PV count on wp:en is not correlated much with spread of the disease, but

Re: [Analytics] [Wiki-Medicine] Zika

2016-02-15 Thread Erik Zachte
Some observations (maybe stating the obvious): https://tools.wmflabs.org/pageviews/#start=2016-01-16&end=2016-02-14&project=en.wikipedia.org&platform=all-access&agent=user&pages=Zika_virus the double peak seems to confirm PV count on wp:en is not correlated much with spread of the disease, but

Re: [Wikidata] [Analytics] [Wiki-Medicine] Zika

2016-02-15 Thread Erik Zachte
Some observations (maybe stating the obvious): https://tools.wmflabs.org/pageviews/#start=2016-01-16&end=2016-02-14&project=en.wikipedia.org&platform=all-access&agent=user&pages=Zika_virus the double peak seems to confirm PV count on wp:en is not correlated much with spread of the disease, but

Re: [Analytics] Zika

2016-02-14 Thread Erik Zachte
FWIW here are PV reports for topically related articles, e.g. Dengue can be translated by the same musquito: http://stats.wikimedia.org/wikimedia/pageviews/categorized/wp-es/2016-01/pageviews_wp-es_cat_Flaviviridae_2016-01.html http://stats.wikimedia.org/wikimedia/pageviews/categorized/wp-en

Re: [Wiki-Medicine] [Analytics] Zika

2016-02-14 Thread Erik Zachte
FWIW here are PV reports for topically related articles, e.g. Dengue can be translated by the same musquito: http://stats.wikimedia.org/wikimedia/pageviews/categorized/wp-es/2016-01/pageviews_wp-es_cat_Flaviviridae_2016-01.html http://stats.wikimedia.org/wikimedia/pageviews/categorized/wp-en

Re: [Analytics] [Pageviews] [Technical] Simplifying the available static dumps of pageview data

2015-12-24 Thread Erik Zachte
ryone :) I'll poke the thread again after the New Year. Happy Holidays! On Thu, Dec 24, 2015 at 9:21 AM, Erik Zachte wrote: Dan, thanks for raising the issue (a bit less for raising it on X-mas eve ;-) (just kidding, mostly) Frankly I don't see much use for the earlier relea

Re: [Analytics] [Pageviews] [Technical] Simplifying the available static dumps of pageview data

2015-12-24 Thread Erik Zachte
8 PM, Alex Druk wrote: Nothing against this approach! On Thu, Dec 24, 2015 at 2:55 PM, Dan Andreescu wrote: On Thu, Dec 24, 2015 at 8:48 AM, Alex Druk wrote: Hi Dan, Happy holidays! Good idea to combine these datasets! However we have one more dataset by Erik Zachte : http://dumps.wik

Re: [Analytics] Data collection

2015-12-14 Thread Erik Zachte
Hi Caitlin, Here is a breakdown of categories within Phytopathology on English wikipedia: http://ow.ly/VQNVL and the articles within those categories ranked by page view for Oct 2015 : http://ow.ly/VQNCv I can run similar reports for earlier months. Cheers, Erik From: Analyti

[Analytics] Wikistats upgraded to new page view definition

2015-12-03 Thread Erik Zachte
Hi all, I just released a major upgrade for Wikistats traffic reports: see blog post http://infodisiac.com/blog/2015/12/wikistats-upgraded-to-new-page-view-definition/ Erik Zachte ___ Analytics mailing list Analytics@lists.wikimedia.org

Re: [Wikimedia-l] Fundraising banner (again)

2015-12-01 Thread Erik Zachte
The ad would be slightly more palatable if it used coffee-darkbrown instead of epitaph-black for the plea you can't ignore. Erik Zachte -Original Message- From: Wikimedia-l [mailto:wikimedia-l-boun...@lists.wikimedia.org] On Behalf Of wctaiwan Sent: Wednesday, December 02, 2015 0:

Re: [Analytics] stats.wikimedia.org typo

2015-10-28 Thread Erik Zachte
Fixed Erik From: Analytics [mailto:analytics-boun...@lists.wikimedia.org] On Behalf Of Pine W Sent: Wednesday, October 28, 2015 23:27 To: A mailing list for the Analytics Team at WMF and everybody who has an interest in Wikipedia and analytics. Subject: [Analytics] stats.wikimedia.org typ

Re: [Analytics] [Spam] Re: User statistics for video marking ENWP 5m article milestone

2015-10-27 Thread Erik Zachte
y's giving a higher number? I agree entirely that we should be very careful with quoting these figures. I think you'd probably be safe to say that more than a million people have edited... but even then I'd be cautious. Andrew. On 27 October 2015 at 11:11, Erik Zachte wrote:

Re: [Analytics] [Spam] Re: User statistics for video marking ENWP 5m article milestone

2015-10-27 Thread Erik Zachte
eople will still not get the full picture, but many more will know a story that is closer to the reality of Wikipedia. Leila -Aaron On Tue, Oct 27, 2015 at 11:03 AM, Erik Zachte wrote: I do agree that we reject good contributions. I also agree this is a messy filter.

Re: [Analytics] [Spam] Re: User statistics for video marking ENWP 5m article milestone

2015-10-27 Thread Erik Zachte
not get the full picture, but many more will know a story that is closer to the reality of Wikipedia. Leila -Aaron On Tue, Oct 27, 2015 at 11:03 AM, Erik Zachte wrote: I do agree that we reject good contributions. I also agree this is a messy filter. The main point however is do

Re: [Analytics] [Spam] Re: User statistics for video marking ENWP 5m article milestone

2015-10-27 Thread Erik Zachte
e are also many edits that get reverted. Arguably, those edits aren't productive either, but they don't disappear from the dumps like article drafts do. This is a messy filter at best. On Tue, Oct 27, 2015 at 10:28 AM, Erik Zachte wrote: As Aaron says. I'd like to add that

Re: [Analytics] User statistics for video marking ENWP 5m article milestone

2015-10-27 Thread Erik Zachte
s supposed not to do that. - How many people will have registered in good faith just out of habit, or to tweak presentation preferences, and then played with the edit button just to see what happens? Note that roughly 2 out of 3 accounts doesn't even reach 3 edits. Cheers, Erik Zachte [1]

Re: [WikimediaMobile] [Wmfall] Tilman is joining the Reading Team!

2015-10-16 Thread Erik Zachte
Congrats, Tilman. And can you sign me up for that report please? Thanks. Erik From: Wmfall [mailto:wmfall-boun...@lists.wikimedia.org] On Behalf Of Jessica Robell Sent: Friday, October 16, 2015 12:05 To: Katy Love Cc: Staff (All); mobile-l Subject: Re: [Wmfall] [WikimediaMobile] Tilman is

Re: [Wikimedia-l] Q1 Fundraising Update

2015-10-15 Thread Erik Zachte
27;re told, the best they can. The Wikimedia Foundation gets a lot of flak in these discussions. But isn't WMF operating within limits set by the Board of Trustees? Lila can propose a budget, but the Board is ultimately responsible, needs to approve that budget, and can amend it. Erik Zacht

Re: [Wikimedia-l] I'll be moving on

2015-10-08 Thread Erik Zachte
Dear Jan, Thank you very much for your immense contributions to the movement! Not the least of those is your ability to be (and I quote Sandra Rientjes) 'a source of sanity and calm'. I can't vow for chapter meetings as Sandra does, but I have seen a lot of that on public mailing lists. Happy e

Re: [Analytics] Redis on stat1001

2015-10-07 Thread Erik Zachte
015 3:14 To: A mailing list for the Analytics Team at WMF and everybody who has an interest in Wikipedia and analytics. Subject: Re: [Analytics] Redis on stat1001 Does stats.wikimedia.org use redis? On Oct 7, 2015 8:02 PM, "Erik Zachte" wrote: stat1001 hosts a.o. stats.wikimedi

Re: [Analytics] Redis on stat1001

2015-10-07 Thread Erik Zachte
stat1001 hosts a.o. stats.wikimedia.org From: Analytics [mailto:analytics-boun...@lists.wikimedia.org] On Behalf Of Aaron Halfaker Sent: Thursday, October 08, 2015 2:46 To: A mailing list for the Analytics Team at WMF and everybody who has an interest in Wikipedia and analytics. Subject: Re:

Re: [Analytics] Intervention analysis (Re: Wikimedia traffic forecast application)

2015-09-18 Thread Erik Zachte
It would be interesting to test for random dates in the past how often the prediction went outside predicted range without any significant intervention happening at that time. It's so easy to accept favorable deviations as a prove the intervention worked, while deviations probably happen all the

Re: [Analytics] corrupted and missing log files

2015-09-14 Thread Erik Zachte
Hi George, Server mishaps often had to do with congestion, traffic overload being worsened by non-essential routines running in parallel on the same server in early years. I can't comment on the precise reasons per occasion why page view count files got missing/corrupt. We haven't kept a j

Re: [Analytics] User statistics for video marking ENWP 5m article milestone

2015-09-12 Thread Erik Zachte
Hi Pine, > I think that the definition on Special:Statistics makes more sense for > "active editors" than the >=5 definition than is commonly used in discussions > on mailing lists. tl;dr 'active editor' is a term with a long history. If we recoin that term and keep informing the publi

Re: [Wikimedia-l] Increase in size of the core editing community

2015-09-10 Thread Erik Zachte
James, A) Should we value editors with many edits more than editors with just a few? Your counter-example (editors who write a long article in one go offline) is canonical, and probably uncontested, so you're stating the obvious, no need to use a loaded term like 'offensive', and to spell it ou

Re: [Analytics] Editor population stats for August

2015-09-08 Thread Erik Zachte
(note here first column is not deduplicated like above, pls ignore) for charts see http://stats.wikimedia.org/EN/ReportCardTopWikis.htm Cheers, Erik From: Erik Zachte [mailto:ezac...@wikimedia.org] Sent: Tuesday, September 01, 2015 13:59 To: 'A mailing list for the Analytics

Re: [Analytics] Editor population stats for August

2015-09-01 Thread Erik Zachte
Hi Pine, Expect Wikistats reports mid September. Since a few months the stub dumps are produced separately, which speeded up the process considerably. I expect all stub dumps are done around 8th/9th of the month. It takes up to week after that to produce the counts and reports. BTW

Re: [Analytics] Webrequest loss on 08-03 and 08-10

2015-08-28 Thread Erik Zachte
So worst case (no data at all) our monthly PV totals will be down with 1.6% (12/744). I marked these periods as invalid in my webstatscollector 2.0 client so that totals will be extrapolated from remaining 732 hours. From: analytics-boun...@lists.wikimedia.org [mailto:analytics-boun...@list

Re: [Analytics] Pageviews definition + measurement for apps adding link previews + using RESTBase

2015-08-19 Thread Erik Zachte
+1 on baseline inflation We will have a hard time to connect historic and future pageview counts anyway, now that we migrate to new infrastructure (mostly because historic counts didn't exluded crawlers). But at least the concepts of 'page' and 'view' haven't changed much in all these years

Re: [Wikimedia-l] GA Stats using Wikimedia Stats

2015-08-19 Thread Erik Zachte
Hi Tito, Wikistats can collect pageviews for a certain category and its subcategories. In English Wikipedia I just ran the script for categories WikiProject_Featured_articles and WikiProject_Good_articles Featured articles, 1 pageviews 2 categories included 1 http://stats.wikimedia.org/wikimed

Re: [Wikitech-l] Geohack tools

2015-08-18 Thread Erik Zachte
As for realtime, I recommend caution with burdening Wikipedia with even more highly transient information, at least within our current database scheme. For years Serbian Wikinews has been inundated with weather info, hourly (!), per city (!), and thus managed 3 million revisions with a handful o

Re: [Analytics] New to list; please direct me to the tool(s) that I can use to determine per-page views per category/WikiProject

2015-07-29 Thread Erik Zachte
Hi, I have a batch tool which collects article titles for any category and its subcategories (up to an arbitrary depth), then collects the page views for those articles for any given month and prints a sorted list. For optimal results the parsed category subtree often needs manual pruning (so we

Re: [Analytics] proposal to axe current traffic reports

2015-07-26 Thread Erik Zachte
ing list for the Analytics Team at WMF and everybody who has an interest in Wikipedia and analytics. Subject: Re: [Analytics] proposal to axe current traffic reports On Fri, Jul 24, 2015 at 1:25 PM, Erik Zachte wrote: Wikistats broadly comes in two parts - A Content and activity reports per

Re: [Analytics] proposal to axe current traffic reports

2015-07-24 Thread Erik Zachte
27; Subject: Re: [Analytics] proposal to axe current traffic reports Erik Zachte, 24/07/2015 18:59: > I think the time has come to disable the traffic reports based on > webstatscollector (2.0) data. > > See > http://stats.wikimedia.org/cgi-bin/search_portal.pl?search=breakdown+o &g

Re: [Analytics] proposal to axe current traffic reports

2015-07-24 Thread Erik Zachte
* The data to power any of these reports is in great shape and is on the hadoop cluster in a neatly pre-aggregated hourly table. We could use that to start replicating these more detailed analyses from Wikistats. On Fri, Jul 24, 2015 at 12:59 PM, Erik Zachte wrote: Hi all, I think th

[Analytics] proposal to axe current traffic reports

2015-07-24 Thread Erik Zachte
also https://phabricator.wikimedia.org/T44259 Whether WMF will also assume responsibility for building new reports on top of that API (and if so in what form) is another matter, but first things first. Current focus is on providing that API, as it should be IMO. Any thoughts? Erik Z

Re: [Analytics] the data for pageviews across all languages for wikisource seems to have a bug in it

2015-07-22 Thread Erik Zachte
We have seen underreporting in recent months far beyond Wikisource. See also https://phabricator.wikimedia.org/T106034 'Too few page views for June/July 2015' Erik Zachte From: analytics-boun...@lists.wikimedia.org [mailto:analytics-boun...@lists.wikimedia.org] On Behalf

Re: [Wikimedia-l] Wikimedia Board of Trustees Chair and Vice Chair positions

2015-07-17 Thread Erik Zachte
all board members, new and old, for their willingess to bear this enormous responsibility. Special mention for Jan-Bart. May you and your family enjoy the step-wise growing control over your personal timetable :-) Erik Zachte -Original Message- From: wikimedia-l-boun

Re: [Analytics] The awful truth about Wikimedia's article counts

2015-05-22 Thread Erik Zachte
ay we thought it had been 2.1%" Erik -Original Message----- From: Erik Zachte [mailto:ezac...@wikimedia.org] Sent: Friday, May 22, 2015 23:15 To: 'A mailing list for the Analytics Team at WMF and everybody who has an interest in Wikipedia and analytics.' Subject: RE: [Analyt

Re: [Analytics] The awful truth about Wikimedia's article counts

2015-05-22 Thread Erik Zachte
Historically consistent? Hmm, the article's main story is about how historical in-wiki data are unreliable and a periodic recount is needed. Just saying. And the main theme in comments is "do we care about article count?" Erik -Original Message- From: analytics-boun...@lists.wikimedia.o

Re: [Analytics] need correction range for converting stats.grok.se to total views including mobile...

2015-05-13 Thread Erik Zachte
15/2015-05/ (mobile and non-mobile) Erik Zachte -Original Message- From: analytics-boun...@lists.wikimedia.org [mailto:analytics-boun...@lists.wikimedia.org] On Behalf Of Oliver Keyes Sent: Wednesday, May 13, 2015 19:30 To: A mailing list for the Analytics Team at WMF and everybody who has an

[MediaWiki-commits] [Gerrit] Link tables from edit/reverts reports - change (analytics/wikistats)

2015-05-07 Thread Erik Zachte (Code Review)
Erik Zachte has submitted this change and it was merged. Change subject: Link tables from edit/reverts reports .. Link tables from edit/reverts reports Link structure as in dumps/perl/WikiReportsOutputMisc.pm. Bug: 65992

[MediaWiki-commits] [Gerrit] Add perl dependencies to a CollectMailArchives.pl comment - change (analytics/wikistats)

2015-05-07 Thread Erik Zachte (Code Review)
Erik Zachte has submitted this change and it was merged. Change subject: Add perl dependencies to a CollectMailArchives.pl comment .. Add perl dependencies to a CollectMailArchives.pl comment Change-Id

[MediaWiki-commits] [Gerrit] Add some more author aliases - change (analytics/wikistats)

2015-05-07 Thread Erik Zachte (Code Review)
Erik Zachte has submitted this change and it was merged. Change subject: Add some more author aliases .. Add some more author aliases Erik applied the patch to his server and it seems to be working, e.g. http://web.archive.org

Re: [Analytics] article creation stuck in February

2015-04-30 Thread Erik Zachte
Once all reports have been regenerated the official stats are updated, after some deliberately manual steps (vetting + manual run for deduped total editors) Newest fa dump is corrupt, but also commons dump parsing is in process and wikidata will follow as soon as new stub dump is available https

Re: [Wikimedia-l] A transition and a new chapter.

2015-04-13 Thread Erik Zachte
as a volunteer. Much too early to adopt the title of 'Wikimedia innovator emeritus'. All best with you next adventure. Erik Zachte ___ Wikimedia-l mailing list, guidelines at: https://meta.wikimedia.org/wiki/Mailing_lists/Guideli

Re: [Analytics] Country data from Squid

2015-04-01 Thread Erik Zachte
bugs. Best regards, Erik Zachte From: analytics-boun...@lists.wikimedia.org [mailto:analytics-boun...@lists.wikimedia.org] On Behalf Of Jakub Havlík Sent: Monday, March 30, 2015 9:44 To: analytics@lists.wikimedia.org Subject: [Analytics] Country data from Squid Hi, I'm a student of com

Re: [Analytics] Draft blog post on decline in Wikipedia pageviews: looking for analytics explanations

2015-03-26 Thread Erik Zachte
. (first bar chart in that row, column Sigma) Best regards, Erik Zachte From: analytics-boun...@lists.wikimedia.org [mailto:analytics-boun...@lists.wikimedia.org] On Behalf Of Vipul Naik Sent: Thursday, March 26, 2015 15:49 To: A mailing list for the Analytics Team at WMF and

  1   2   3   4   >