[Wikidata-bugs] [Maniphest] [Commented On] T173749: QuickStatements to RDF converter

2017-09-27 Thread Pintoch
Pintoch added a comment.
@Lydia_Pintscher that makes sense. Okay, thank you to you both, we are on the same page! Given all these tickets on the topic I was worried that I had missed something obvious about this issue…TASK DETAILhttps://phabricator.wikimedia.org/T173749EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: PintochCc: Pintoch, Aklapper, Hjfocs, Tpt, Lydia_Pintscher, Smalyshev, GoranSMilovanovic, Kiailandi, QZanden, dachary, Izno, Wikidata-bugs, aude, Ricordisamoa, Sjoerddebruin, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T173749: QuickStatements to RDF converter

2017-09-27 Thread Lydia_Pintscher
Lydia_Pintscher added a comment.
Right and I don't think there is desire to change that. But for tools like the primary sources tool that could ingest rdf and then feed data into Wikidata through the usual API I think that'd be ok.TASK DETAILhttps://phabricator.wikimedia.org/T173749EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: Lydia_PintscherCc: Pintoch, Aklapper, Hjfocs, Tpt, Lydia_Pintscher, Smalyshev, GoranSMilovanovic, Kiailandi, QZanden, dachary, Izno, Wikidata-bugs, aude, Ricordisamoa, Sjoerddebruin, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T173749: QuickStatements to RDF converter

2017-09-27 Thread Smalyshev
Smalyshev added a comment.
Right now RDF is only a secondary database format, so all imports are supposed to go through Wikibase instance, using one of Wikibase import formats. Using RDF for import AFAIK is not a use case we currently implement.TASK DETAILhttps://phabricator.wikimedia.org/T173749EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: SmalyshevCc: Pintoch, Aklapper, Hjfocs, Tpt, Lydia_Pintscher, Smalyshev, GoranSMilovanovic, Kiailandi, QZanden, dachary, Izno, Wikidata-bugs, aude, Ricordisamoa, Sjoerddebruin, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T173749: QuickStatements to RDF converter

2017-09-27 Thread Lydia_Pintscher
Lydia_Pintscher added a comment.
I am not aware of any "standard" but this is the time to make one then I guess ;-)TASK DETAILhttps://phabricator.wikimedia.org/T173749EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: Lydia_PintscherCc: Pintoch, Aklapper, Hjfocs, Tpt, Lydia_Pintscher, Smalyshev, GoranSMilovanovic, Kiailandi, QZanden, dachary, Izno, Wikidata-bugs, aude, Ricordisamoa, Sjoerddebruin, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T173749: QuickStatements to RDF converter

2017-09-27 Thread Pintoch
Pintoch added a comment.
@Smalyshev thanks for your quick reply! Just for clarity, I am not personally working on the PST, I was just trying to find out if there was any established way to use RDF to represent a data import. If that is the case, then other tools could use that format too (for instance, OpenRefine could export datasets to this format). I'd be happy to work on that but I can only do it if a RDF model is agreed on.

It looks like the Munger from wikidata-query-rdf can be used to fill a Wikibase instance, but that is a different use case.TASK DETAILhttps://phabricator.wikimedia.org/T173749EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: PintochCc: Pintoch, Aklapper, Hjfocs, Tpt, Lydia_Pintscher, Smalyshev, GoranSMilovanovic, Kiailandi, QZanden, dachary, Izno, Wikidata-bugs, aude, Ricordisamoa, Sjoerddebruin, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T173749: QuickStatements to RDF converter

2017-09-27 Thread Pintoch
Pintoch added a comment.
@Lydia_Pintscher , @Smalyshev and @Tpt : is there any info about how RDF is expected to behave as an import format for Wikidata? As far as I can tell, the RDF that gets fed into the Query Service is not designed for import at all:


first, there is a lot of redundancy: values are represented by simple values and value nodes, truthy statements are redundant with statement nodes, and other things like that. (this is absolutely not a criticism of the RDF serialization strategy: it totally makes sense as an export format!) So is there any designated subset of the exported triples that data producers would need to emit? I assume that subset would need to be as expressive as possible (so, for instance, the truthy triples would be dropped in favor of the full statement nodes). That is going to be very verbose, right?
second, the identifiers on the nodes are generated by Wikibase: so, how does a data producer picks identifiers? Is it just going to impose its own hashes that Wikibase will have to respect?


It would be great to have something else than QuickStatements to represent a data import, but I still have doubts about why RDF is suitable for that in the first place. The good thing about RDF is that it is a standard, so many tools can deal with it. But given the issues mentioned above, I expect it is going to be quite painful to reuse these tools to produce data in the right schema, as everything is deeply reified. Anyway, if that is the path you have chosen, we need specs please!

Also, it seems that this project uses Java, so may I suggest that the reusable parts go to the Wikidata-Toolkit rather than the Primary Sources Tool? Wikidata-Toolkit has already got RDF export, so it would make sense to have RDF import (from RDF statements to the datamodel representation, say).TASK DETAILhttps://phabricator.wikimedia.org/T173749EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: PintochCc: Pintoch, Aklapper, Hjfocs, Tpt, Lydia_Pintscher, Smalyshev, GoranSMilovanovic, Kiailandi, QZanden, dachary, Izno, Wikidata-bugs, aude, Ricordisamoa, Sjoerddebruin, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs