nichtich created this task. nichtich added projects: Wikidata, Wikidata-Query-Service. Herald added a subscriber: Aklapper. Herald added projects: Discovery, Internet-Archive. |
TASK DESCRIPTION
The machine readable description of Wikidata dumps in DCAT-AP is only provided as RDF/XML and poorly documented, limiting its usefulness. Please automatically import the RDF file into Wikidata Query Service so we can get a list of current dumps for instance with This SPARQL query:
~~~
PREFIX dcat: http://www.w3.org/ns/dcat#
PREFIX dct: http://purl.org/dc/terms/
SELECT ?url ?date ?size WHERE {
<https://www.wikidata.org/about#catalog> dcat:dataset ?dump . ?dump dcat:distribution [ dct:format "application/json" ; dcat:downloadURL ?url ; dcat:issued ?date ; dcat:byteSize ?bytes ] .
}
~~~The only open question is whether to keep information about dumps removed from https://dumps.wikimedia.org/wikidatawiki/entities/. I don't this so but DCAT information from other dump hosters such as Internet Archive (see their list of Wikdata dumps should be included as well.
TASK DETAIL
EMAIL PREFERENCES
To: nichtich
Cc: Aklapper, nichtich, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, Avner, debt, Gehel, Jonas, FloNight, Xmlizer, jkroll, Smalyshev, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331, Jay8g
Cc: Aklapper, nichtich, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, Avner, debt, Gehel, Jonas, FloNight, Xmlizer, jkroll, Smalyshev, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331, Jay8g
_______________________________________________ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs