Re: [Wiki-research-l] How to collect all the admin-specific edits for a subset of Wp admins

2013-11-15 Thread Jérôme Hergueux
Hi all,

FYI: the solution proposed below worked just fine. Thanks Dario! :)

Cheers,

Jérôme. 


2013/10/10 Dario Taraborelli 

> Hi Jerôme,
>
> most of the actions you refer to are not stored as edits by mediawiki.
> They can be accessed via the logging table [1] (with log_type 'delete' or
> 'block'), which is replicated on tool labs (you can apply for a tool labs
> account if you don't have one [2]).
>
> HTH
>
> Dario
>
> [1] https://www.mediawiki.org/wiki/Manual:Logging_table
> [2] https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools/Help
>
> On Oct 10, 2013, at 10:02 AM, "Klein,Max"  wrote:
>
> Hello Jerome,
>
> I'm not sure this is the best way, but pywikipediabot [1] has a library
> called pagegenerators.py and there is a function *def
> UserContributionsGenerator(username)* (around line 706). That would allow
> you to iterate through theses user names, and I bet there will be a special
> marking for deletions/undeletions. If not, worst comes to worse you can use
> a regular expression for those words.
>
> [1] https://meta.wikimedia.org/wiki/pywikipediabot
>
> When you use have a pywikibot-hammer everything looks like a
> pywikibot-nail!
>
> Maximilian Klein
> Wikipedian in Residence, OCLC
> +17074787023
>
> ------
> *From:* wiki-research-l-boun...@lists.wikimedia.org <
> wiki-research-l-boun...@lists.wikimedia.org> on behalf of Jérôme Hergueux
> 
> *Sent:* Thursday, October 10, 2013 3:11 AM
> *To:* wiki-research-l@lists.wikimedia.org
> *Subject:* [Wiki-research-l] How to collect all the admin-specific edits
> for a subset of Wp admins
>
> Dear all,
>
> I am starting this thread in the hope that some of the great Wiki
> researchers on this list could advise me on a data collection problem.
>
> Here is the question: for a each of 120 Wikipedia admins (for whom I have
> the usernames and unique numeric ids), I would like to reliably count the
> number of times they (i) deleted a page (ii) undeleted (i.e. restored) a
> page (iii) protected a page (iv) blocked a user and (v) unblocked a user.
> Those types of edits all correspond to a specific "action" in the
> Wikipedia API documentation page (http://en.wikipedia.org/w/api.php):
> action=delete,action=undelete, action=protect, action=block and
> action=unblock.
>
> I don't know, however, what would be the best strategy to go about
> collecting those edits. Does anyone have an idea about which data
> collection strategy I should adopt in this case? Is there a way to query
> the Wikipedia API directly, or should I look for some specific markers in
> the edit summaries?
>
> I would be very grateful for any advice of feedback!
> Thanks much for your attention and time. :)
>
> Best,
>
> Jérôme.
> ___
> Wiki-research-l mailing list
> Wiki-research-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
>
>
>
> ___
> Wiki-research-l mailing list
> Wiki-research-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
>
>
___
Wiki-research-l mailing list
Wiki-research-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l


Re: [Wiki-research-l] How to collect all the admin-specific edits for a subset of Wp admins

2013-10-11 Thread Jérôme Hergueux
Thank you all for your kind help and suggestions!! :)

Let me dig in a little bit and see what works best. I'll let you know how
it goes!

Best,

Jérôme.


2013/10/10 Dario Taraborelli 

> Hi Jerôme,
>
> most of the actions you refer to are not stored as edits by mediawiki.
> They can be accessed via the logging table [1] (with log_type 'delete' or
> 'block'), which is replicated on tool labs (you can apply for a tool labs
> account if you don't have one [2]).
>
> HTH
>
> Dario
>
> [1] https://www.mediawiki.org/wiki/Manual:Logging_table
> [2] https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools/Help
>
> On Oct 10, 2013, at 10:02 AM, "Klein,Max"  wrote:
>
> Hello Jerome,
>
> I'm not sure this is the best way, but pywikipediabot [1] has a library
> called pagegenerators.py and there is a function *def
> UserContributionsGenerator(username)* (around line 706). That would allow
> you to iterate through theses user names, and I bet there will be a special
> marking for deletions/undeletions. If not, worst comes to worse you can use
> a regular expression for those words.
>
> [1] https://meta.wikimedia.org/wiki/pywikipediabot
>
> When you use have a pywikibot-hammer everything looks like a
> pywikibot-nail!
>
> Maximilian Klein
> Wikipedian in Residence, OCLC
> +17074787023
>
> ------
> *From:* wiki-research-l-boun...@lists.wikimedia.org <
> wiki-research-l-boun...@lists.wikimedia.org> on behalf of Jérôme Hergueux
> 
> *Sent:* Thursday, October 10, 2013 3:11 AM
> *To:* wiki-research-l@lists.wikimedia.org
> *Subject:* [Wiki-research-l] How to collect all the admin-specific edits
> for a subset of Wp admins
>
> Dear all,
>
> I am starting this thread in the hope that some of the great Wiki
> researchers on this list could advise me on a data collection problem.
>
> Here is the question: for a each of 120 Wikipedia admins (for whom I have
> the usernames and unique numeric ids), I would like to reliably count the
> number of times they (i) deleted a page (ii) undeleted (i.e. restored) a
> page (iii) protected a page (iv) blocked a user and (v) unblocked a user.
> Those types of edits all correspond to a specific "action" in the
> Wikipedia API documentation page (http://en.wikipedia.org/w/api.php):
> action=delete,action=undelete, action=protect, action=block and
> action=unblock.
>
> I don't know, however, what would be the best strategy to go about
> collecting those edits. Does anyone have an idea about which data
> collection strategy I should adopt in this case? Is there a way to query
> the Wikipedia API directly, or should I look for some specific markers in
> the edit summaries?
>
> I would be very grateful for any advice of feedback!
> Thanks much for your attention and time. :)
>
> Best,
>
> Jérôme.
> ___
> Wiki-research-l mailing list
> Wiki-research-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
>
>
>
> ___
> Wiki-research-l mailing list
> Wiki-research-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
>
>
___
Wiki-research-l mailing list
Wiki-research-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l


[Wiki-research-l] How to collect all the admin-specific edits for a subset of Wp admins

2013-10-10 Thread Jérôme Hergueux
Dear all,

I am starting this thread in the hope that some of the great Wiki
researchers on this list could advise me on a data collection problem.

Here is the question: for a each of 120 Wikipedia admins (for whom I have
the usernames and unique numeric ids), I would like to reliably count the
number of times they (i) deleted a page (ii) undeleted (i.e. restored) a
page (iii) protected a page (iv) blocked a user and (v) unblocked a user.
Those types of edits all correspond to a specific "action" in the Wikipedia
API documentation page (http://en.wikipedia.org/w/api.php): action=delete,
action=undelete, action=protect, action=block and action=unblock.
I don't know, however, what would be the best strategy to go about
collecting those edits. Does anyone have an idea about which data
collection strategy I should adopt in this case? Is there a way to query
the Wikipedia API directly, or should I look for some specific markers in
the edit summaries?

I would be very grateful for any advice of feedback!
Thanks much for your attention and time. :)

Best,

Jérôme.
___
Wiki-research-l mailing list
Wiki-research-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l