Re: SPARQL vs Jena rules

Dave Reynolds Thu, 31 Aug 2017 00:49:25 -0700

On 30/08/17 15:10, baran...@gmail.com wrote:

PS: I wonder why Dave doesn't comment in this thread. Perhaps because hethinks, Lorenz is ok, i myself cannot stand the low-level-knowledge ofthe users in this thread or no matter what you do, by some heavydata-input an app with InfModel would hang anyway? Lorenz is ofcourseok, but i 'guess' Jena users are also very curious about Dave's comments...

I didn't comment on this thread because, as Andy has already pointedout, this seems to be a repeat of a recent similar thread (on which Idid comment). That in turn was a near repeat of another similar thread.All from the same group.


Also I think Lorenz has covered it all, with admirable patience.

However, in an attempt to clarify the trade-offs in more depth maybe thefollowing would be helpful:

When comparing a rule system against a set of SPARQL Update queriesthere are several factors that affect the trade-offs including (1) thespecific nature of the rules/queries, (2) the data flow and, (3)preferences on syntax and machinery.

1a. For a single (forward) Jena rule then you can always achieve thesame with a SPARQL Update. For a single set of input data then SPARQLhas the benefit of being a standard [1] and offering better performanceover a store like TDB. Conversely SPARQL is much richer than Jena rulesso there are things that you could achieve with a single SPARQL Updatequery using, say, property paths that would require multiple Jena rules.

1b. If you have a set of rules, but they don't create loops/recursion,then you can "stratify" them into groups of rules than can be run oneafter the other. In that case, for a single set of input data, thenagain you can implement it as a sequence of SPARQL Updates with similarbenefits.

1c. If your rules can't be stratified, i.e. one rule can indirectlytrigger itself, then it's more complex. In that case you would have toe.g. run the set of SPARQL Updates repeatedly until nothing new isdeduced. Depending on the specifics of the rules and the data that maybe quite expensive and you would be better off with something Jenarules. However, in some cases you may be able to use things like SPARQLproperty paths to achieve the desired effect without have to recurse.

2. If you have a single data set and just want to run your rules on itthen the above applies. If you are repeatedly adding new data and wantto keep your deductions up to date then the Jena forward rules enginehas the advantage that it keeps all the partial matches around. Soaddition of one more triple may cause a rule to fire without it havingto search for all the other triples in the body. This is also why"recursive" rules work relatively efficiently.

This doesn't apply if you delete data. In that case Jena rules have tostart over and can't reuse state across data deletions.

If you keeping changing your data but very rarely ask questions of it,and then only limited questions, then Jena back rules have advantages.The backward engine will only run the rules needed for the specificquery. If that's a lot fewer than the overall rules then that should becheaper than running a full forward deduction using SPARQL Updates. Inthis situation it may be possible to achieve the same effects throughSPARQL query (not update) by query rewriting but that's a wholedifferent ball game.

3. With Jena rules you have some prebuilt machinery for running therules (InfGraphs and all that) and some support for externalizing therules in separate files. With SPARQL you have to create all that (thoughit's easy) and you have a nicer syntax.

So fundamentally, like all "X vs Y" questions it depends on thespecifics of what you are trying to do.


Dave

[1] There is a standard for rules, RIF, but it is not aimed atparticularly RDF processing and post-dates Jena rules.

Re: SPARQL vs Jena rules

Reply via email to