Re: reasoner performance

Dave Reynolds Tue, 07 Jan 2020 01:00:23 -0800

On 07/01/2020 08:31, Luis Enrique Ramos García wrote:

Dear friends,


I am currently working in an application in where I have to implement a
reasoner, in which I have had some experience, the difference is that this
time i have to implement it in a big data environment, where I have to deal
with a data set od some giga bytes.

About that, my questions are the following:

1. is there a benchmark or evaluation of performance of jena with some
reasoners, which consider memory or quantity of triples, and
execution time?.


Depends what sort of inference you are talking about.

Apart from the OWL benchmarks you mention, some of the Sparql benchmarksdo require small amounts of reasoning loosely around RDFS++. Forexample, I seem to remember LUBM requires this but I've never workedwith it.

Jena's inference is not designed to scale to billons of triples, it's amemory-only solution (though "giga byes" might mean just millions oftriples and might fit in memory). So reasoning at scale benchmarks onJena are not going to be much use to you. Look at the results forcommercial stores that do claim inference at scale.

2. is elephas, and a map reduce approach a good alternative to deal with a
big data environment?

Depends what sort of inference you are talking about and whether youcare about latency or just overall throughput at scale. Map reduce isnot good for low latency interactive queries.

3. is necessary a triple store to use with reasoner and rule engine?, in
that case what do you recommend?

Don't understand the question. Triple stores and reasoners are differentthings. You can have reasoners that have nothing to do withRDF/triple-stores and you can have triple stores with no reasoner. Thereare fair number of commercial and open source tools in both categoriesand in the overlap.


Dave

Re: reasoner performance

Reply via email to