Re: [Wiki-research-l] [WikiEN-l] Old Wikipedia backups discovered

2010-12-16 Thread Joseph Reagle
On Thursday, December 16, 2010, lior gimel wrote: > This is amazing! And buggy! :-) > Thanks for the work and effort, this reconstruction is a priceless resource > for researchers. Thanks to Tim for providing the data, and for working on a much better version that I look forward to! __

Re: [Wiki-research-l] [WikiEN-l] Old Wikipedia backups discovered

2010-12-16 Thread lior gimel
This is amazing! Thanks for the work and effort, this reconstruction is a priceless resource for researchers. Lior On Thu, Dec 16, 2010 at 8:53 PM, Joseph Reagle wrote: > > I have the first 10K edits up reconstructed in their various pages at: > > http://cyber.law.harvard.edu/~reagle/wp-redux/<

Re: [Wiki-research-l] [WikiEN-l] Old Wikipedia backups discovered

2010-12-16 Thread Joseph Reagle
I have the first 10K edits up reconstructed in their various pages at: http://cyber.law.harvard.edu/~reagle/wp-redux/ ___ Wiki-research-l mailing list Wiki-research-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wiki-research-l

Re: [Wiki-research-l] [WikiEN-l] Old Wikipedia backups discovered

2010-12-16 Thread Tim Starling
On 16/12/10 23:10, Joseph Reagle wrote: > On Wednesday, December 15, 2010, Tim Starling wrote: >> There were some changes made to the page text that weren't represented >> in diff_log, specifically changing certain camel-case links to free >> links. > It appears my problems were related to some CR/

Re: [Wiki-research-l] [WikiEN-l] Old Wikipedia backups discovered

2010-12-16 Thread Joseph Reagle
On Wednesday, December 15, 2010, Tim Starling wrote: > There were some changes made to the page text that weren't represented > in diff_log, specifically changing certain camel-case links to free > links. It appears my problems were related to some CR/LF issues not round-tripping between diff and

Re: [Wiki-research-l] [WikiEN-l] Old Wikipedia backups discovered

2010-12-15 Thread Tim Starling
On 16/12/10 08:04, Joseph Reagle wrote: > Unfortunately, it doesn't look like versions of the articles beyond > the first ~10 are automatically recoverable. There were some changes made to the page text that weren't represented in diff_log, specifically changing certain camel-case links to free li

Re: [Wiki-research-l] [WikiEN-l] Old Wikipedia backups discovered

2010-12-15 Thread Joseph Reagle
On Tuesday, December 14, 2010, Tim Starling wrote: > I didn't want to believe that those revisions had been lost forever, > and I even opened the UseMod source code and stared forlornly at the > unlink() call. What I (and Brion before) missed is that UseMod appends > a record of every change made t