In order to wget the bz2 file we had to use a different URL pattern for April 2024: https://dumps.wikimedia.org/enwiki/20240101/enwiki-20240101-pages-articles-multistream.xml.bz2
we used to use a pattern without the extra 01 suffix… Best Regards, Nat Senior Technical Staff Member, T.J. Watson Research, IBM +1 860 812 5089 https://research.ibm.com/people/nathaniel-mills
_______________________________________________ Xmldatadumps-l mailing list -- [email protected] To unsubscribe send an email to [email protected]
