Re: [Wiki-research-l] [Analytics] Bots vs. Wikipedians – Who edits more?

2013-10-14 Thread Diederik van Liere
Very cool. If you include wikidata then more than 50% of the edits on the Wikimedia projects are made by bots. One of the dead horses I like to beat is that bot editors should be treated as first class citizens of Wikipedia and this data nicely illustrates that. I think this is a bigger watershed

Re: [Wiki-research-l] diffdb formatted Wikipedia dump

2013-10-11 Thread Diederik van Liere
*From: *Susan Biancani inacn...@gmail.com *Subject: **[Wiki-research-l] diffdb formatted Wikipedia dump* *Date: *October 3, 2013 10:06:44 PM PDT *To: *wiki-research-l@lists.wikimedia.org *Reply-To: *Research into Wikimedia content and communities wiki-research-l@lists.wikimedia.org I'm

[Wiki-research-l] Announcing availability new dataset diffdb

2011-11-04 Thread Diederik van Liere
application [1] that will allow us to quickly search for specific strings being added or removed. If you have any questions, then please let me know! [0] https://github.com/whym/wikihadoop [1] https://github.com/whym/diffindexer Best regards, Diederik van Liere

Re: [Wiki-research-l] Announcing availability new dataset diffdb

2011-11-04 Thread Diederik van Liere
bandwidth if I know that I can not deal with it ;). By the way, what you did is exactly what I just started working on to implement for my project, so thanks a lot :) Regards. On Fri, Nov 4, 2011 at 13:19, Diederik van Liere dvanli...@gmail.comwrote: Dear Wiki Researchers, During the summer

[Wiki-research-l] Announcing Wikihadoop: using Hadoop to analyze Wikipedia dump files

2011-08-17 Thread Diederik van Liere
Hello! Over the last few weeks, Yusuke Matsubara, Shawn Walker, Aaron Halfaker and Fabian Kaelin (who are all Summer of Research fellows)[0] have worked hard on a customized stream-based InputFormatReader that allows parsing of both bz2 compressed and uncompressed files of the full Wikipedia dump

Re: [Wiki-research-l] Announcing Wikihadoop: using Hadoop to analyze Wikipedia dump files

2011-08-17 Thread Diederik van Liere
? -- Best, Dmitry On Wed, Aug 17, 2011 at 9:58 AM, Diederik van Liere dvanli...@gmail.comwrote: Hello! Over the last few weeks, Yusuke Matsubara, Shawn Walker, Aaron Halfaker and Fabian Kaelin (who are all Summer of Research fellows)[0] have worked hard on a customized stream-based

Re: [Wiki-research-l] Fraction of reverts

2011-08-15 Thread Diederik van Liere
Some more pointers: http://meta.wikimedia.org/wiki/Research:Newbie_reverts_and_article_length http://meta.wikimedia.org/wiki/Research:Newbie_reverts_and_subsequent_editing_behavior Best, Diederik On 2011-08-15, at 9:00 PM, Denny Vrandecic wrote: Thank you, Daniel! On Aug 15, 2011, at

Re: [Wiki-research-l] Editor Trends Study - Improving the tool

2010-11-11 Thread Diederik van Liere
into Wikimedia content and communities        wiki-research-l@lists.wikimedia.org Message-ID: 4cd9d9e3.4040...@post.pl Content-Type: text/plain; charset=ISO-8859-1; format=flowed Diederik van Liere wrote: We are looking for some volunteers that would enjoy testing the tool. You don't need

[Wiki-research-l] Editor Trends Study - Improving the tool

2010-11-09 Thread Diederik van Liere
Dear researchers, Recently, we started the Editor Trends Study ( http://strategy.wikimedia.org/wiki/Editor_Trends_Study). The goal of this study is to get a better understanding of the community dynamics within the different Wikipedia projects. Part of this project consists of developing a tool

[Wiki-research-l] Editor Trends Study - Requesting your Input

2010-10-18 Thread Diederik van Liere
Dear Wikipedia Researchers, We have posted a wiki about the Editor Trends Study on the strategy wiki, you can find it here: http://strategy.wikimedia.org/wiki/Editor_Trends_Study We would like to have your input on our suggested approach and in particular we are curious about your thoughts