___
Xmldatadumps-l mailing list
Xmldatadumps-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l
___
Xmldatadumps-l mailing list
Xmldatadumps-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l
___
Xmldatadumps-l mailing list
Xmldatadumps-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l
Thank you v much John!
Yes that was the case! The number does match now.
Thanks!
Yuki
On 20 Aug 2020, at 16:42, John wrote:
Are you limiting your count to namespace 0?
On Thu, Aug 20, 2020 at 10:45 AM Yuki Kumagai
wrote:
> Hiya
>
> I have a question about wikipedia xml database dump.
Are you limiting your count to namespace 0?
On Thu, Aug 20, 2020 at 10:45 AM Yuki Kumagai
wrote:
> Hiya
>
> I have a question about wikipedia xml database dump. Apologies if this
> wasn't an appropriate place for asking a question.
> On a wikipedia page, it's mentioned that the current number
Hiya
I have a question about wikipedia xml database dump. Apologies if this
wasn't an appropriate place for asking a question.
On a wikipedia page, it's mentioned that the current number of articles in
english is: 6,144,248
https://en.wikipedia.org/wiki/Wikipedia:Size_of_Wikipedia
However when I
@@@$
___
Xmldatadumps-l mailing list
Xmldatadumps-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l
n
___
Xmldatadumps-l mailing list
Xmldatadumps-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l
Hu
badassbarn
___
Xmldatadumps-l mailing list
Xmldatadumps-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l
___
Xmldatadumps-l mailing list
Xmldatadumps-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l
For external uses like XML dumps integrating the compression
strategy into LZMA would however be very attractive. This would also
benefit other users of LZMA compression like HBase.
For dumps or other uses, 7za -mx=3 / xz -3 is your best bet.
That has a 4 MB buffer, compression ratios within
--
יוסי גלנטי
0502441015
galan...@gmail.com
___
Xmldatadumps-l mailing list
Xmldatadumps-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l
12 matches
Mail list logo