On Wed, Jul 22, 2009 at 8:15 AM, Dmitriy Sintsov <ques...@rambler.ru> wrote:

> * Tei <oscar.vi...@gmail.com> [Tue, 21 Jul 2009 19:42:45 +0200]:
> > On Tue, Jul 21, 2009 at 7:17 PM, Chengbin
> Zheng<chengbinzh...@gmail.com>
> > wrote:
> > ...
> > >
> > > No, I know what parsing means. Even if it takes 2 days to parse
> them,
> > > wouldn't it be faster than to actually create a static HTML dump the
> > > traditional way?
> > >
> > > If it is not, then what is the difficulty of making static HTML
> dumps?
> > It
> > > can't be bandwidth, storage, or speed.
> > >
> >
> > WikiMedia work with limited resources on manpower, hardware,
> etc..etc...
> >
> > Things are done. When? when theres available resources, humans and of
> > the other types.
> > Is not only you, there are lots of people that want to download the
> > wikipedia (sometimes in a periodic fashion)
> >
> > There are a log somewhere with the daily work of some wikipedia admin.
> (
> > - :
> > http://wikitech.wikimedia.org/view/Server_admin_log
> >
> > Some of these are even very fun, like in:
> > 02:11 b****: CPAN sux
> > 01:47 d******: I FOUND HOW TO REVIVE APACHES
> > ( names obscured to protect the inocents ).
> >
> Speaking of compact off-line English Wikipedia I liked the TomeRaider
> version:
> http://en.wikipedia.org/wiki/TomeRaider
> I wish there were newer TR builds, because English Wikipedia grows
> really fast.
> Dmitriy
>
> _______________________________________________
> Wikitech-l mailing list
> Wikitech-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikitech-l
>

Yes, the "TombRaider" version is exactly the version I want for static
HTML.

Just curious, is
pages-articles.xml.bz2<http://download.wikimedia.org/enwiki/20090713/enwiki-20090713-pages-articles.xml.bz2>
like
a "TombRaider" version? If not, what's the difference?

And another curiosity, at
http://en.wikipedia.org/wiki/Wikipedia:TomeRaider_database, it says the
English Wikipedia database is only 3.3GB. Did they use compression? That
seems awfully small. Even if they did, that's an incredible compression
ratio, similar to 7-zip, I don't know how you can do that on a eBook format.
NTFS compression only brings size down 50%.
_______________________________________________
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

Reply via email to