Re: [Wiki-research-l] New toolbox Wikipedia pages

2011-01-25 Thread Wikipedia Signpost
Hi,

On Tue, Jan 25, 2011 at 9:30 PM, Felipe Ortega  wrote:
> You can access, from the corresponding "View history" page:
>
...
>
> I don't know when (exactly) these services were activated.

Most of them were added to the "View history" page in 2008 and 2009:

http://en.wikipedia.org/w/index.php?title=MediaWiki:Histlegend&action=history

We featured an overview of such page history related tools in the
Signpost a while ago, also mentioning a few others:

http://en.wikipedia.org/wiki/Wikipedia:Wikipedia_Signpost/2010-09-20/Dispatches

Regards, HaeB

-- 
Wikipedia Signpost Staff
http://en.wikipedia.org/wiki/Wikipedia:Wikipedia_Signpost

___
Wiki-research-l mailing list
Wiki-research-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l


Re: [Wiki-research-l] New toolbox Wikipedia pages

2011-01-25 Thread Andrew G. West
Dario,

Yes, it is certainly the same data source.

First, I wasn't aware there was a JSON API for [http://stats.grok.se] -- 
can you provide everyone a link to it?

Second, at least in visual form, that site presents only daily totals. 
The actual data uses hourly dumps -- and I was thinking my contribution 
could be finer granularity for those who need it (assuming I am not 
mistaken).

Thanks, -AW


On 01/25/2011 06:06 PM, Dario Taraborelli wrote:
> apologies – that's obviously just an interface to Domas Mituzas' raw data!
>
> Dario
>
> On 25 Jan 2011, at 23:02, Dario Taraborelli wrote:
>
>> Andrew,
>>
>>> So, while I'm yet to develop this into a formal public-facing API -- I'd
>>> be willing to run queries for interested researchers -- and they should
>>> feel free to contact me.
>>
>> are you aware of this tool based on your data: http://stats.grok.se ?
>>
>> It also has a JSON interface, which is really handy (I used it with a simple 
>> python script to download view stats for a sample of pages in a given 
>> timeframe)
>>
>> Dario
>
>
> ___
> Wiki-research-l mailing list
> Wiki-research-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wiki-research-l

-- 
Andrew G. West, Doctoral Student
Dept. of Computer and Information Science
University of Pennsylvania, Philadelphia PA
Phone:   (304)-415-5824
Email:   west...@cis.upenn.edu
Website: http://www.cis.upenn.edu/~westand

___
Wiki-research-l mailing list
Wiki-research-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l


Re: [Wiki-research-l] New toolbox Wikipedia pages

2011-01-25 Thread Dario Taraborelli
apologies – that's obviously just an interface to Domas Mituzas' raw data!

Dario

On 25 Jan 2011, at 23:02, Dario Taraborelli wrote:

> Andrew,
> 
>> So, while I'm yet to develop this into a formal public-facing API -- I'd 
>> be willing to run queries for interested researchers -- and they should 
>> feel free to contact me.
> 
> are you aware of this tool based on your data: http://stats.grok.se ?
> 
> It also has a JSON interface, which is really handy (I used it with a simple 
> python script to download view stats for a sample of pages in a given 
> timeframe)
> 
> Dario


___
Wiki-research-l mailing list
Wiki-research-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l


Re: [Wiki-research-l] New toolbox Wikipedia pages

2011-01-25 Thread Dario Taraborelli
Andrew,

> So, while I'm yet to develop this into a formal public-facing API -- I'd 
> be willing to run queries for interested researchers -- and they should 
> feel free to contact me.

are you aware of this tool based on your data: http://stats.grok.se ?

It also has a JSON interface, which is really handy (I used it with a simple 
python script to download view stats for a sample of pages in a given timeframe)

Dario
___
Wiki-research-l mailing list
Wiki-research-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l


Re: [Wiki-research-l] New toolbox Wikipedia pages

2011-01-25 Thread Andrew G. West
I'll add another note to this "article view" discussion:

I have parsed the hourly, per-page statistics at 
[http://dammit.lt/wikistats/]. If one assumes uniform intra-hour 
distributions, this makes it possible to arrive at highly accurate view 
estimates for arbitrary pages, for arbitrary time intervals.

I have found this useful to measure how many people saw a particular 
revision and used this heavily in my anti-vandalism research.

I believe this is the same data source all these other services are 
using -- but I don't do any aggregation. I've got data for all of 2010 
for en.wiki (some 400+GB). I'd imagine this volume of parsing and 
storage isn't something all Wiki researchers are capable of.

So, while I'm yet to develop this into a formal public-facing API -- I'd 
be willing to run queries for interested researchers -- and they should 
feel free to contact me.

Thanks, -Andrew G. West


On 01/25/2011 04:22 PM, Carlos d'Andréa wrote:
> Hi, Felipe,
>
> these tools are really useful!
>
> I like much the "Wikipedia Page History Statistics" too:
> http://vs.aka-online.de/cgi-bin/wppagehiststat.pl
>
> Here in Brazil I've developed (with a computer science student) a tool
> that extracs other interesting data from pages history, like number of
> protections and duration of time of each, number of revertions and
> editions undone, number anda percentage of editions made by
> administrators, bots and IP etc.
>
> Unfortunately it works only in portuguese Wikipedia, but we are very
> interessed in open the code e make it better.
>
> BTW, as it's my first mensage here, let me present myself: I'm
> journalist, teacher in Federal University of Viçosa and PHD student in
> Applied Linguistics in Minas Gerais Federal University. In summary, I'm
> studing the editorial process of "Biographies of living persons" in
> portuguese Wikipedia.
>
> Best,
>
> --
> Carlos d'Andréa
> carlosdand.com 
> novasm.blogspot.com 
>
>
>
> On Tue, Jan 25, 2011 at 6:30 PM, Felipe Ortega  > wrote:
>
> Hi all.
>
> I just discovered this, it may be potentially interesting for the
> Wikipedia
> research community.
>
> In short, now for any Wikipedia page, not only articles, e.g.
>
> http://en.wikipedia.org/wiki/History_of_free_and_open_source_software
>
> You can access, from the corresponding "View history" page:
>
> * Nice stats (via soxred93 tool in Toolserver) :
> 
> http://toolserver.org/~soxred93/articleinfo/index.php?article=History_of_Free_Software
> 
> 〈=en&wiki=wikipedia
>
>
> * Ranked contributors (Daniel's tool in Toolserver):
> 
> http://toolserver.org/~daniel/WikiSense/Contributors.php?wikilang=en&wikifam=.wikipedia.org&grouped=on&page=History_of_Free_Software
> 
> 
>
>
> * Revision history search (WikiBlame):
> 
> http://wikipedia.ramselehof.de/wikiblame.php?lang=en&article=History_of_Free_Software
> 
> 
>
>
> * Page view statistics:
> http://stats.grok.se/en/201101/History_of_Free_Software
>
> And... incredible:
>
> * Number of watchers (!!!) (mzmcbride tool in Toolserver):
> 
> http://toolserver.org/~mzmcbride/cgi-bin/watcher.py?db=enwiki_p&titles=History_of_Free_Software
> 
> 
>
>
> I don't know when (exactly) these services were activated.
>
> I've also found some (still inactive) "API" links. Anybody has any
> further info
> about this?
>
> Cheers,
> Felipe.
> ___
> Wiki-research-l mailing list
> Wiki-research-l@lists.wikimedia.org
> 
> https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
>
> ___
> Wiki-research-l mailing list
> Wiki-research-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wiki-research-l

-- 
Andrew G. West, Doctoral Student
Dept. of Computer and Information Science
University of Pennsylvania, Philadelphia PA
Phone:   (304)-415-5824
Email:   west...@cis.upenn.edu
Website: http://www.cis.upenn.edu/~westand

___
Wiki-research-l mailing list
Wiki-research-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l


Re: [Wiki-research-l] New toolbox Wikipedia pages

2011-01-25 Thread Felipe Ortega
De: Carlos d'Andréa 

Para: Research into Wikimedia content and communities 

Enviado: mar,25 enero, 2011 22:22
Asunto: Re: [Wiki-research-l] New toolbox Wikipedia pages

Hi, Felipe, 

these tools are really useful! 

I like much the "Wikipedia Page History Statistics" too: 
http://vs.aka-online.de/cgi-bin/wppagehiststat.pl

Here in Brazil I've developed (with a computer science student) a tool that 
extracs other interesting data from pages history, like number of protections 
and duration of time of each, number of revertions and editions undone, number 
anda percentage of editions made by administrators, bots and IP etc.

Unfortunately it works only in portuguese Wikipedia, but we are very interessed 
in open the code e make it better.

Nice to meet you, Carlos.

You might also like:
http://meta.wikimedia.org/wiki/Statistics

There are some tools producing stats for any language, including:

http://meta.wikimedia.org/wiki/StatMediaWiki
http://meta.wikimedia.org/wiki/WikiXRay

Best,
Felipe

BTW, as it's my first mensage here, let me present myself: I'm journalist, 
teacher in Federal University of Viçosa and PHD student in Applied Linguistics 
in Minas Gerais Federal University. In summary, I'm studing the editorial 
process of "Biographies of living persons" in portuguese Wikipedia.

Best,

-- 
Carlos d'Andréa
carlosdand.com
novasm.blogspot.com


On Tue, Jan 25, 2011 at 6:30 PM, Felipe Ortega  wrote:

Hi all.
>
>I just discovered this, it may be potentially interesting for the Wikipedia
>research community.
>
>In short, now for any Wikipedia page, not only articles, e.g.
>
>http://en.wikipedia.org/wiki/History_of_free_and_open_source_software
>
>You can access, from the corresponding "View history" page:
>
>* Nice stats (via soxred93 tool in Toolserver) :
>http://toolserver.org/~soxred93/articleinfo/index.php?article=History_of_Free_Software〈=en&wiki=wikipedia
>
>
>
>* Ranked contributors (Daniel's tool in Toolserver):
>http://toolserver.org/~daniel/WikiSense/Contributors.php?wikilang=en&wikifam=.wikipedia.org&grouped=on&page=History_of_Free_Software
>
>
>
>* Revision history search (WikiBlame):
>http://wikipedia.ramselehof.de/wikiblame.php?lang=en&article=History_of_Free_Software
>
>
>
>* Page view statistics: http://stats.grok.se/en/201101/History_of_Free_Software
>
>And... incredible:
>
>* Number of watchers (!!!) (mzmcbride tool in Toolserver):
>http://toolserver.org/~mzmcbride/cgi-bin/watcher.py?db=enwiki_p&titles=History_of_Free_Software
>
>
>
>I don't know when (exactly) these services were activated.
>
>I've also found some (still inactive) "API" links. Anybody has any further info
>about this?
>
>Cheers,
>Felipe.
>
>
>
>
>
>___
>Wiki-research-l mailing list
>Wiki-research-l@lists.wikimedia.org
>https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
>


  ___
Wiki-research-l mailing list
Wiki-research-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l


Re: [Wiki-research-l] New toolbox Wikipedia pages

2011-01-25 Thread Carlos d'Andréa
Hi, Felipe,

these tools are really useful!

I like much the "Wikipedia Page History Statistics" too:
http://vs.aka-online.de/cgi-bin/wppagehiststat.pl

Here in Brazil I've developed (with a computer science student) a tool that
extracs other interesting data from pages history, like number of
protections and duration of time of each, number of revertions and editions
undone, number anda percentage of editions made by administrators, bots and
IP etc.

Unfortunately it works only in portuguese Wikipedia, but we are very
interessed in open the code e make it better.

BTW, as it's my first mensage here, let me present myself: I'm journalist,
teacher in Federal University of Viçosa and PHD student in Applied
Linguistics in Minas Gerais Federal University. In summary, I'm studing the
editorial process of "Biographies of living persons" in portuguese
Wikipedia.

Best,

-- 
Carlos d'Andréa
carlosdand.com
novasm.blogspot.com

On Tue, Jan 25, 2011 at 6:30 PM, Felipe Ortega wrote:

> Hi all.
>
> I just discovered this, it may be potentially interesting for the Wikipedia
> research community.
>
> In short, now for any Wikipedia page, not only articles, e.g.
>
> http://en.wikipedia.org/wiki/History_of_free_and_open_source_software
>
> You can access, from the corresponding "View history" page:
>
> * Nice stats (via soxred93 tool in Toolserver) :
>
> http://toolserver.org/~soxred93/articleinfo/index.php?article=History_of_Free_Software
> 〈=en&wiki=wikipedia
>
>
> * Ranked contributors (Daniel's tool in Toolserver):
>
> http://toolserver.org/~daniel/WikiSense/Contributors.php?wikilang=en&wikifam=.wikipedia.org&grouped=on&page=History_of_Free_Software
>
>
> * Revision history search (WikiBlame):
>
> http://wikipedia.ramselehof.de/wikiblame.php?lang=en&article=History_of_Free_Software
>
>
> * Page view statistics:
> http://stats.grok.se/en/201101/History_of_Free_Software
>
> And... incredible:
>
> * Number of watchers (!!!) (mzmcbride tool in Toolserver):
>
> http://toolserver.org/~mzmcbride/cgi-bin/watcher.py?db=enwiki_p&titles=History_of_Free_Software
>
>
> I don't know when (exactly) these services were activated.
>
> I've also found some (still inactive) "API" links. Anybody has any further
> info
> about this?
>
> Cheers,
> Felipe.
>
>
>
>
>
> ___
> Wiki-research-l mailing list
> Wiki-research-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
>
___
Wiki-research-l mailing list
Wiki-research-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l


[Wiki-research-l] New toolbox Wikipedia pages

2011-01-25 Thread Felipe Ortega
Hi all.

I just discovered this, it may be potentially interesting for the Wikipedia 
research community.

In short, now for any Wikipedia page, not only articles, e.g.

http://en.wikipedia.org/wiki/History_of_free_and_open_source_software

You can access, from the corresponding "View history" page:

* Nice stats (via soxred93 tool in Toolserver) : 
http://toolserver.org/~soxred93/articleinfo/index.php?article=History_of_Free_Software〈=en&wiki=wikipedia


* Ranked contributors (Daniel's tool in Toolserver): 
http://toolserver.org/~daniel/WikiSense/Contributors.php?wikilang=en&wikifam=.wikipedia.org&grouped=on&page=History_of_Free_Software


* Revision history search (WikiBlame): 
http://wikipedia.ramselehof.de/wikiblame.php?lang=en&article=History_of_Free_Software


* Page view statistics: http://stats.grok.se/en/201101/History_of_Free_Software

And... incredible:

* Number of watchers (!!!) (mzmcbride tool in Toolserver): 
http://toolserver.org/~mzmcbride/cgi-bin/watcher.py?db=enwiki_p&titles=History_of_Free_Software


I don't know when (exactly) these services were activated.

I've also found some (still inactive) "API" links. Anybody has any further info 
about this?

Cheers,
Felipe.



  

___
Wiki-research-l mailing list
Wiki-research-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l