Re: [Wiki-research-l] How to collect all the admin-specific edits for a subset of Wp admins
Hi all, FYI: the solution proposed below worked just fine. Thanks Dario! :) Cheers, Jérôme. 2013/10/10 Dario Taraborelli > Hi Jerôme, > > most of the actions you refer to are not stored as edits by mediawiki. > They can be accessed via the logging table [1] (with log_type 'delete' or > 'block'), which is replicated on tool labs (you can apply for a tool labs > account if you don't have one [2]). > > HTH > > Dario > > [1] https://www.mediawiki.org/wiki/Manual:Logging_table > [2] https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools/Help > > On Oct 10, 2013, at 10:02 AM, "Klein,Max" wrote: > > Hello Jerome, > > I'm not sure this is the best way, but pywikipediabot [1] has a library > called pagegenerators.py and there is a function *def > UserContributionsGenerator(username)* (around line 706). That would allow > you to iterate through theses user names, and I bet there will be a special > marking for deletions/undeletions. If not, worst comes to worse you can use > a regular expression for those words. > > [1] https://meta.wikimedia.org/wiki/pywikipediabot > > When you use have a pywikibot-hammer everything looks like a > pywikibot-nail! > > Maximilian Klein > Wikipedian in Residence, OCLC > +17074787023 > > ------ > *From:* wiki-research-l-boun...@lists.wikimedia.org < > wiki-research-l-boun...@lists.wikimedia.org> on behalf of Jérôme Hergueux > > *Sent:* Thursday, October 10, 2013 3:11 AM > *To:* wiki-research-l@lists.wikimedia.org > *Subject:* [Wiki-research-l] How to collect all the admin-specific edits > for a subset of Wp admins > > Dear all, > > I am starting this thread in the hope that some of the great Wiki > researchers on this list could advise me on a data collection problem. > > Here is the question: for a each of 120 Wikipedia admins (for whom I have > the usernames and unique numeric ids), I would like to reliably count the > number of times they (i) deleted a page (ii) undeleted (i.e. restored) a > page (iii) protected a page (iv) blocked a user and (v) unblocked a user. > Those types of edits all correspond to a specific "action" in the > Wikipedia API documentation page (http://en.wikipedia.org/w/api.php): > action=delete,action=undelete, action=protect, action=block and > action=unblock. > > I don't know, however, what would be the best strategy to go about > collecting those edits. Does anyone have an idea about which data > collection strategy I should adopt in this case? Is there a way to query > the Wikipedia API directly, or should I look for some specific markers in > the edit summaries? > > I would be very grateful for any advice of feedback! > Thanks much for your attention and time. :) > > Best, > > Jérôme. > ___ > Wiki-research-l mailing list > Wiki-research-l@lists.wikimedia.org > https://lists.wikimedia.org/mailman/listinfo/wiki-research-l > > > > ___ > Wiki-research-l mailing list > Wiki-research-l@lists.wikimedia.org > https://lists.wikimedia.org/mailman/listinfo/wiki-research-l > > ___ Wiki-research-l mailing list Wiki-research-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
Re: [Wiki-research-l] How to collect all the admin-specific edits for a subset of Wp admins
Thank you all for your kind help and suggestions!! :) Let me dig in a little bit and see what works best. I'll let you know how it goes! Best, Jérôme. 2013/10/10 Dario Taraborelli > Hi Jerôme, > > most of the actions you refer to are not stored as edits by mediawiki. > They can be accessed via the logging table [1] (with log_type 'delete' or > 'block'), which is replicated on tool labs (you can apply for a tool labs > account if you don't have one [2]). > > HTH > > Dario > > [1] https://www.mediawiki.org/wiki/Manual:Logging_table > [2] https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools/Help > > On Oct 10, 2013, at 10:02 AM, "Klein,Max" wrote: > > Hello Jerome, > > I'm not sure this is the best way, but pywikipediabot [1] has a library > called pagegenerators.py and there is a function *def > UserContributionsGenerator(username)* (around line 706). That would allow > you to iterate through theses user names, and I bet there will be a special > marking for deletions/undeletions. If not, worst comes to worse you can use > a regular expression for those words. > > [1] https://meta.wikimedia.org/wiki/pywikipediabot > > When you use have a pywikibot-hammer everything looks like a > pywikibot-nail! > > Maximilian Klein > Wikipedian in Residence, OCLC > +17074787023 > > ------ > *From:* wiki-research-l-boun...@lists.wikimedia.org < > wiki-research-l-boun...@lists.wikimedia.org> on behalf of Jérôme Hergueux > > *Sent:* Thursday, October 10, 2013 3:11 AM > *To:* wiki-research-l@lists.wikimedia.org > *Subject:* [Wiki-research-l] How to collect all the admin-specific edits > for a subset of Wp admins > > Dear all, > > I am starting this thread in the hope that some of the great Wiki > researchers on this list could advise me on a data collection problem. > > Here is the question: for a each of 120 Wikipedia admins (for whom I have > the usernames and unique numeric ids), I would like to reliably count the > number of times they (i) deleted a page (ii) undeleted (i.e. restored) a > page (iii) protected a page (iv) blocked a user and (v) unblocked a user. > Those types of edits all correspond to a specific "action" in the > Wikipedia API documentation page (http://en.wikipedia.org/w/api.php): > action=delete,action=undelete, action=protect, action=block and > action=unblock. > > I don't know, however, what would be the best strategy to go about > collecting those edits. Does anyone have an idea about which data > collection strategy I should adopt in this case? Is there a way to query > the Wikipedia API directly, or should I look for some specific markers in > the edit summaries? > > I would be very grateful for any advice of feedback! > Thanks much for your attention and time. :) > > Best, > > Jérôme. > ___ > Wiki-research-l mailing list > Wiki-research-l@lists.wikimedia.org > https://lists.wikimedia.org/mailman/listinfo/wiki-research-l > > > > ___ > Wiki-research-l mailing list > Wiki-research-l@lists.wikimedia.org > https://lists.wikimedia.org/mailman/listinfo/wiki-research-l > > ___ Wiki-research-l mailing list Wiki-research-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
[Wiki-research-l] How to collect all the admin-specific edits for a subset of Wp admins
Dear all, I am starting this thread in the hope that some of the great Wiki researchers on this list could advise me on a data collection problem. Here is the question: for a each of 120 Wikipedia admins (for whom I have the usernames and unique numeric ids), I would like to reliably count the number of times they (i) deleted a page (ii) undeleted (i.e. restored) a page (iii) protected a page (iv) blocked a user and (v) unblocked a user. Those types of edits all correspond to a specific "action" in the Wikipedia API documentation page (http://en.wikipedia.org/w/api.php): action=delete, action=undelete, action=protect, action=block and action=unblock. I don't know, however, what would be the best strategy to go about collecting those edits. Does anyone have an idea about which data collection strategy I should adopt in this case? Is there a way to query the Wikipedia API directly, or should I look for some specific markers in the edit summaries? I would be very grateful for any advice of feedback! Thanks much for your attention and time. :) Best, Jérôme. ___ Wiki-research-l mailing list Wiki-research-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wiki-research-l