Looks like someone hasn't learned the lesson: https://www.mail-archive.com/wikidata-l@lists.wikimedia.org/msg02588.html
On Thu, Feb 26, 2015 at 9:27 PM, Lukas Benedix <lukas.bene...@fu-berlin.de> wrote: > I second this! > > > btw: what is the status of the problem with the missing dumps with > history? (latest available from November 2014) > > Lukas > > Am Do 26.02.2015 um 14:52 schrieb Markus Kroetzsch: >> Hi, >> >> It's that time of the year again when I am sending a reminder that we >> still have broken JSON in the dump files ;-). As usual, the problem is >> that empty maps {} are serialized wrongly as empty lists []. I am not >> sure if there is any open bug that tracks this, so I am sending an >> email. There was one, but it was closed [1]. >> >> As you know (I had sent an email a while ago), there are some remaining >> problems of this kind in the JSON dump, and also in the live exported >> JSON, e.g., >> >> https://www.wikidata.org/wiki/Special:EntityData/Q4383128.json >> (uses [] as a value for snaks: this item has a reference with an empty >> list of snaks, which is an error by itself) >> >> However, the situation is considerably worse in the XML dumps, which >> have seen less usage since we have JSON, but as it turns out are still >> preferred by some users. Surprisingly (to me), the JSON content in the >> XML dumps is still not the same as in the JSON dumps. A large part of >> the records in the XML dump is broken because of the map-vs-list issue. >> >> For example, the latest dump of current revisions [2] has countless >> instances of the problem. The first is in the item Q3261 (empty list for >> claims), but you can easily find more by grepping for things like >> >> "claims":[] >> >> It seems that all empty maps are serialized wrongly in this dump >> (aliases, descriptions, claims, ...). In contrast, the site's export >> simply omits the key of empty maps entirely, see >> >> https://www.wikidata.org/wiki/Special:EntityData/Q3261.json >> >> The JSON in the JSON dumps is the same. >> >> Cheers, >> >> Markus >> >> >> [1] https://github.com/wmde/WikibaseDataModelSerialization/issues/77 >> [2] >> http://dumps.wikimedia.org/wikidatawiki/20150207/wikidatawiki-20150207-pages-meta-current.xml.bz2 >> >> > > > > _______________________________________________ > Wikidata-l mailing list > Wikidata-l@lists.wikimedia.org > https://lists.wikimedia.org/mailman/listinfo/wikidata-l > _______________________________________________ Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l