https://bugzilla.wikimedia.org/show_bug.cgi?id=72678
Bug ID: 72678 Summary: json dumps have duplicate items (one for the redirect, one for the target) Product: MediaWiki extensions Version: unspecified Hardware: All OS: All Status: NEW Severity: normal Priority: Unprioritized Component: WikidataRepo Assignee: wikidata-b...@lists.wikimedia.org Reporter: aude.w...@gmail.com CC: wikidata-b...@lists.wikimedia.org Web browser: --- Mobile Platform: --- from project chat: https://www.wikidata.org/wiki/Wikidata:Project_chat#JSON_dump_has_duplicates I've been working with the JSON dumps and notice that it has identical duplicate entries. For example, in the latest dump [3], line numbers 921522 and 16155575 are identical dumps of item Turi railway station (Q17100180). There are dozens of these duplicates. Should these be treated in a special way when processing the data dump? Jefft0 (talk) 01:17, 29 October 2014 (UTC) :It looks like another item page [4] redirects to Turi railway station (Q17100180). I don't think the redirect should be in the dump as a duplicate, so seems like a bug. But the redirect probably should be represented somewhere and in some form. Aude (talk) 07:18, 29 October 2014 (UTC) -- You are receiving this mail because: You are on the CC list for the bug. _______________________________________________ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l