Smalyshev created this task. Smalyshev added projects: Wikidata, Wikidata-Query-Service, Discovery-Wikidata-Query-Service-Sprint.
TASK DESCRIPTION Since RDF dumps are sharded, resulting dump contains multiple wikibase:Dump schema:dateModified statements. When starting Updater anew from fresh dump load, it bases its starting point on `schema:dateModified` for dump, however since there are multiple ones, it can choose wrong one (too late) and miss some updates. The fix for it can be twofold: 1. Make Updater use only the earliest dateModified statement 2. Make Munger filter out the extra ones (maybe remember the earliest one and drop ones that are higher). TASK DETAIL https://phabricator.wikimedia.org/T229617 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Smalyshev Cc: matej_suchanek, Cyberpower678, Vojtech.dostal, Aklapper, Mormegil, AMDmi3, LiberatorG, Smalyshev, darthmon_wmde, DannyS712, Nandana, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Cirdan, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331
_______________________________________________ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs