Re: How do I do a join between multiple model.listStatments calls?

Andy Seaborne Mon, 14 Nov 2016 11:45:51 -0800

Jena has APIs for local and remote access for SPARQL.

Many large installations are a SPARQL triple store with business logiclayer.


On 14/11/16 19:10, Niels Andersen wrote:

Andy is answering my original question about joins, he stated that
Jena ARQ is using the Jena API, Graph.find and listStatement (you
included this in your response).


I said it uses Graph.find or is faster.

TDB cuts through Graph.find and listStatements to work on the indexesthemselves.

Again, if I understand this
correctly, then Jena ARQ does not implement a join algorithm based on
two sorted lists, so the join must be performed using lookups for
each element returned from the first list (like I showed in my
example). While this is OK for small datasets, it becomes problematic
for large datasets. Do I understand this correctly?

It's called an index join and in TDB does work with RDF terms but withinternal ids (which are fixed 8 bytes long). The representation of tehRDF terms are left on disk unless needed later ("if you do not needdata, do not touch it.").

If the first set is small, an index join is faster than a merge join. Amerge join still need to traverse the whole of both sides if it does notuse sideways passing ... in which case it becomes a form of index join.Due to caching, index lookup is not necessarily expensive.

I would still like to hear what you are intending to use RDF for. Whatfeatures of semntic web, or RDF are you exploting? You email addresssuggests an IoT application.


        Andy

Re: How do I do a join between multiple model.listStatments calls?

Reply via email to