Hi all,
some of you could find my recent experiments interesting. I posted them
here:
http://morfologik.blogspot.com/2007/01/wikipedia-history-diff-as-revision.html
In short, it seems that Lars' idea was brilliant, and it is possible to
filter out the edit wars using simple metrics. Prepare to buy larger
disks, revision histories are big files :)
best,
Marcin
[email protected] napisał(a):
Lars wrote:
One idea for finding stats on errors is to compare changes made to
Wikipedia articles. The complete text revision history is
That might make a good corpus.
Would it be possible to write a script that picks up just the
spelling/grammar changes? If not, you'll be counting the effects of
numerous edit wars.
xan
jonathon
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]