https://bugzilla.wikimedia.org/show_bug.cgi?id=72678

            Bug ID: 72678
           Summary: json dumps have duplicate items (one for the redirect,
                    one for the target)
           Product: MediaWiki extensions
           Version: unspecified
          Hardware: All
                OS: All
            Status: NEW
          Severity: normal
          Priority: Unprioritized
         Component: WikidataRepo
          Assignee: wikidata-b...@lists.wikimedia.org
          Reporter: aude.w...@gmail.com
                CC: wikidata-b...@lists.wikimedia.org
       Web browser: ---
   Mobile Platform: ---

from project chat:

https://www.wikidata.org/wiki/Wikidata:Project_chat#JSON_dump_has_duplicates

I've been working with the JSON dumps and notice that it has identical
duplicate entries. For example, in the latest dump [3], line numbers 921522 and
16155575 are identical dumps of item Turi railway station (Q17100180). There
are dozens of these duplicates. Should these be treated in a special way when
processing the data dump? Jefft0 (talk) 01:17, 29 October 2014 (UTC)

:It looks like another item page [4] redirects to Turi railway station
(Q17100180). I don't think the redirect should be in the dump as a duplicate,
so seems like a bug. But the redirect probably should be represented somewhere
and in some form. Aude (talk) 07:18, 29 October 2014 (UTC)

-- 
You are receiving this mail because:
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to