nichtich created this task.
nichtich added projects: Wikidata, Wikidata-Query-Service.
Herald added a subscriber: Aklapper.
Herald added projects: Discovery, Internet-Archive.

TASK DESCRIPTION

The machine readable description of Wikidata dumps in DCAT-AP is only provided as RDF/XML and poorly documented, limiting its usefulness. Please automatically import the RDF file into Wikidata Query Service so we can get a list of current dumps for instance with This SPARQL query:

~~~
PREFIX dcat: http://www.w3.org/ns/dcat#
PREFIX dct: http://purl.org/dc/terms/

SELECT ?url ?date ?size WHERE {

<https://www.wikidata.org/about#catalog> dcat:dataset ?dump .
?dump dcat:distribution [
  dct:format "application/json" ;
  dcat:downloadURL ?url ;
  dcat:issued ?date ;
  dcat:byteSize ?bytes 
] .

}
~~~

The only open question is whether to keep information about dumps removed from https://dumps.wikimedia.org/wikidatawiki/entities/. I don't this so but DCAT information from other dump hosters such as Internet Archive (see their list of Wikdata dumps should be included as well.


TASK DETAIL
https://phabricator.wikimedia.org/T178978

EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: nichtich
Cc: Aklapper, nichtich, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, Avner, debt, Gehel, Jonas, FloNight, Xmlizer, jkroll, Smalyshev, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331, Jay8g
_______________________________________________
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs

Reply via email to