2011/1/19 Aryeh Gregor <simetrical+wikil...@gmail.com>:
> We used to do this, but the problem was that many articles are much
> larger than the compression window of typical compression algorithms,
> so the redundancy between adjacent revisions wasn't helping
> compression except for short articles.  Tim wrote a diff-based history
> storage method (see DiffHistoryBlob in includes/HistoryBlob.php) and
> deployed it on Wikimedia, for 93% space savings:
>
> http://lists.wikimedia.org/pipermail/wikitech-l/2010-March/047231.html
>
That's right, I forgot about that.

> I don't know if this was ever deployed to all of external storage,
> though.  In that thread Tim mentioned only recompressing about 40% of
> revisions, and said that the recompression script required care and
> human attention to work correctly, so maybe he never got around to
> recompressing all the rest -- I don't think he ever said, that I saw.
>
I think he finished recompressing a couple of months ago.

Roan Kattouw (Catrope)

_______________________________________________
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

Reply via email to