Re: [Wiki-research-l] Fwd: [Wikitech-l] statistics about frequent section titles

2016-03-02 Thread Tilman Bayer
Bumping this thread - has anyone made progress on this, for example to determine the percentage of enwiki articles that contain one of these standard sections? (I'm also curious how Danny B - BCCed - generates the lists at https://cs.wiktionary.org/wiki/User:Danny_B./Datamining/Nadpisy that Petr m

[Wiki-research-l] Ethical implication of humanitarian data use

2016-03-02 Thread Toby Negrin
It's in my to-be-read list but I thought that this would be interesting to this group: http://cis-india.org/papers/ebola-a-big-data-disaster -Toby ___ Wiki-research-l mailing list Wiki-research-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/l

Re: [Wiki-research-l] category system is hopelessly muddled

2016-03-02 Thread Gerard Meijssen
Hoi, I love categories, I use them all the time ... to export data from a Wikipedia to Wikidata. The big thing is that many categories are linked through interwiki links and consequently the same routines can be used for all of them. There are many categories where the overlap is very small. This d