Re: [Wiki-research-l] 1-year dump of English Wikipedia article ratings

2012-10-22 Thread Dario Taraborelli
Thanks Jérémie, we are definitely aiming for a more official announcement. The reason for the soft launch is that, after experimenting for a few months with the DataHub, we are still reporting to the developers issues that need to be addressed before a broader announcement. The CKAN data browse

Re: [Wiki-research-l] A wiki search engine

2012-10-22 Thread ENWP Pine
I agree that this sounds like an interesting experiment. I hope that you get good faith editors. I worry that you’ll get COI editors playing with the search rankings. Do you have a way in mind to deal with that issue? Pine From: emijrp Sent: Monday, 22 October, 2012 08:29 To: Research into Wi

Re: [Wiki-research-l] 1-year dump of English Wikipedia article ratings

2012-10-22 Thread Jérémie Roquet
cc-ed xmldatadumps-l Hi, 2012/10/23 Dario Taraborelli : > 2012/10/23 James Forrester : >> On 22 October 2012 16:03, Hydriz Wikipedia wrote: >>> I have long been wanting to say this, but is it possible for the team behind >>> compiling such datasets to put future (and if possible, current) datase

Re: [Wiki-research-l] 1-year dump of English Wikipedia article ratings

2012-10-22 Thread Dario Taraborelli
The dumps server was never meant to become a permanent open data repository, but it started being used as an ad-hoc solution to host all sort of datasets published by WMF on top of the actual XML dumps: that's the problem we're trying to fix. Regardless of where the data is physically hosted

Re: [Wiki-research-l] 1-year dump of English Wikipedia article ratings

2012-10-22 Thread James Forrester
On 22 October 2012 16:03, Hydriz Wikipedia wrote: > Hi all, > > I have long been wanting to say this, but is it possible for the team behind > compiling such datasets to put future (and if possible, current) datasets > into dumps.wikimedia.org so that it is easier for everyone to find stuff and >

Re: [Wiki-research-l] 1-year dump of English Wikipedia article ratings

2012-10-22 Thread Hydriz Wikipedia
Hi all, I have long been wanting to say this, but is it possible for the team behind compiling such datasets to put future (and if possible, current) datasets into dumps.wikimedia.org so that it is easier for everyone to find stuff and not be all over the place? Thanks for that! On Tue, Oct 23, 2

[Wiki-research-l] 1-year dump of English Wikipedia article ratings

2012-10-22 Thread Dario Taraborelli
We've released a full, anonymized dump of article ratings (aka AFTv4) collected over 1 year since the deployment of the tool on the entire English Wikipedia (July 22, 2011 - July 22, 2012). http://thedatahub.org/en/dataset/wikipedia-article-ratings The dataset (which includes 11m unique article

[Wiki-research-l] A wiki search engine

2012-10-22 Thread emijrp
Hi all; I'm starting a new project, a wiki search engine. It uses MediaWiki, Semantic MediaWiki and other minor extensions, and some tricky templates and bots. I remember Wikia Search and how it failed. It had the mini-article thingy for the introduction, and then a lot of links compiled by a cra