Hi all, I am trying to read the dump from
https://dumps.wikimedia.your.org/enwiki/20200701/enwiki-20200701-pages-articles-multistream.xml.bz2 using a Java XMLStreamReader but it complains about the format. It looks like the file does not contain an XML header (<?xml version="1.0"?> or such) and after unpacking and prepending the header all seems fine. Is there a good reason why headers are missing? Cheers, Alex
_______________________________________________ Xmldatadumps-l mailing list Xmldatadumps-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l