Christopher added a subscriber: Smalyshev.
Christopher added a comment.

After researching this, I have discovered that the Munger that processes the 
RDF dump removes several ontology types (wikibase:Item, wikibase:Statement, 
wikibase:Reference, and wikibase:Value) that are needed for object counting and 
comparison.

See here 
https://github.com/wikimedia/wikidata-query-rdf/blob/master/tools/src/main/java/org/wikidata/query/rdf/tool/rdf/Munger.java,
 lines 405, 466, 514, 556.

@Smalyshev Is it possible to add an option to keep them?  And approximately how 
much additional space/memory would these use?


TASK DETAIL
  https://phabricator.wikimedia.org/T115120

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Christopher
Cc: Smalyshev, Christopher, Andrew, yuvipanda, coren, scfc, Matthewrbowker, 
TempleM, Aklapper, RP88, Revi, Luke081515, jkroll, Wikidata-bugs, Jdouglas, 
aude, Deskana, Manybubbles, Gryllida, JanZerebecki



_______________________________________________
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs

Reply via email to