Re: [Wiki-research-l] diffdb formatted Wikipedia dump

2013-10-11 Thread Diederik van Liere
> *From: *Susan Biancani > *Subject: **[Wiki-research-l] diffdb formatted Wikipedia dump* > *Date: *October 3, 2013 10:06:44 PM PDT > *To: *wiki-research-l@lists.wikimedia.org > *Reply-To: *Research into Wikimedia content and communities < > wiki-research-l@lists.wikimedia

Re: [Wiki-research-l] diffdb formatted Wikipedia dump

2013-10-08 Thread Klein,Max
wiki-research-l-boun...@lists.wikimedia.org on behalf of Susan Biancani Sent: Tuesday, October 08, 2013 3:28 PM To: Research into Wikimedia content and communities Subject: Re: [Wiki-research-l] diffdb formatted Wikipedia dump Right now, I want all the edits to user pages and user talk pages, 2010-2013. But

Re: [Wiki-research-l] diffdb formatted Wikipedia dump

2013-10-08 Thread Susan Biancani
nce, OCLC > +17074787023 > > -- > *From:* wiki-research-l-boun...@lists.wikimedia.org > on > behalf of Susan Biancani > > *Sent:* Thursday, October 03, 2013 10:06 PM > *To:* wiki-research-l@lists.wikimedia.org > *Subject:* [Wiki-research-l] diffdb for

Re: [Wiki-research-l] diffdb formatted Wikipedia dump

2013-10-07 Thread Pierre-Carl Langlais
behalf of Susan Biancani *Sent:* Thursday, October 03, 2013 10:06 PM *To:* wiki-research-l@lists.wikimedia.org *Subject:* [Wiki-research-l] diffdb formatted Wikipedia dump I'm looking for a dump from English Wikipedia in diff format (i.e. each entry is the text that was added/deleted

Re: [Wiki-research-l] diffdb formatted Wikipedia dump

2013-10-07 Thread Klein,Max
PM To: wiki-research-l@lists.wikimedia.org Subject: [Wiki-research-l] diffdb formatted Wikipedia dump I'm looking for a dump from English Wikipedia in diff format (i.e. each entry is the text that was added/deleted since the last edit, rather than each entry is the current state of the page).

[Wiki-research-l] diffdb formatted Wikipedia dump

2013-10-03 Thread Susan Biancani
I'm looking for a dump from English Wikipedia in diff format (i.e. each entry is the text that was added/deleted since the last edit, rather than each entry is the current state of the page). The Summer of Research folks provided a handy guide to how to create such a dataset from the standard comp