daniel added a comment.

For the record: while I was proposing to have the dump "flavor" at the bottom 
of the hierarchy and putting the timestamp only into the filename, I'm coming 
around to the opposite view again: have the date at the base of the hierarchy.

Having the date as the base makes sense if we can make sure that all the dumps 
in that directory consistently reflect the state of the data at the given point 
in time. This is infamously untrue for the "standard" MediaWiki dumps. We could 
however make it true for our dumps by generating everything off a single JSON 
dump, as @mkroetzsch suggested.

If we want to split our RDF output into several files (terms, sitelinks, 
statements, etc), this consistency is essential. I think we should go that 
route, so I filed a ticket for implementing a script for generating RDF from 
JSON: https://phabricator.wikimedia.org/T94019.


TASK DETAIL
  https://phabricator.wikimedia.org/T72385

REPLY HANDLER ACTIONS
  Reply to comment or attach files, or !close, !claim, !unsubscribe or !assign 
<username>.

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: ArielGlenn, daniel
Cc: Manybubbles, JanZerebecki, Smalyshev, aude, daniel, Wikidata-bugs, 
Nemo_bis, mkroetzsch, Svick, ArielGlenn, Lydia_Pintscher, hoo, jeremyb



_______________________________________________
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs

Reply via email to