Christopher added a subscriber: Smalyshev. Christopher added a comment. After researching this, I have discovered that the Munger that processes the RDF dump removes several ontology types (wikibase:Item, wikibase:Statement, wikibase:Reference, and wikibase:Value) that are needed for object counting and comparison.
See here https://github.com/wikimedia/wikidata-query-rdf/blob/master/tools/src/main/java/org/wikidata/query/rdf/tool/rdf/Munger.java, lines 405, 466, 514, 556. @Smalyshev Is it possible to add an option to keep them? And approximately how much additional space/memory would these use? TASK DETAIL https://phabricator.wikimedia.org/T115120 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Christopher Cc: Smalyshev, Christopher, Andrew, yuvipanda, coren, scfc, Matthewrbowker, TempleM, Aklapper, RP88, Revi, Luke081515, jkroll, Wikidata-bugs, Jdouglas, aude, Deskana, Manybubbles, Gryllida, JanZerebecki _______________________________________________ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs