if you use one of the utilities listed here:
https://phabricator.wikimedia.org/T239866
I'd like you to download one of the 'multistream' dumps and see if your
utility decompresses it fully or not (you can compare the md5sum of the
decompressed content to the regular file's decompressed content and see if
they are the same). Then note the results and the version of the utility on
this task.

Alternatively, if you use some other utility to work with the bz2 files,
please test using that, and add that on the task too.

Here are two files for download and comparison of decompressed content:

https://dumps.wikimedia.org/cewiki/20191201/cewiki-20191201-pages-articles.xml.bz2
and
https://dumps.wikimedia.org/cewiki/20191201/cewiki-20191201-pages-articles-multistream.xml.bz2

Both are around 50 megabytes.

Thank you in advance to whomever participates!

Ariel
_______________________________________________
Xmldatadumps-l mailing list
Xmldatadumps-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l

Reply via email to