"Scale" in terms of what? I mean, people loaded full Wikidata dump [1], so that works for sure on a single (though powerful) machine and indeed takes some (lots of) time. Query response time does indeed depend on the query size and complexity as well as the data size. But it's impossible to give any numbers here. You could have a look at benchmarks for some estimates or even better, just try it and load your dataset.

There is no dedicated cluster version of Jena, but some people managed to somehow make some extension (dubbed Mosaic) on top


[1] http://wiki.bitplan.com/index.php/Get_your_own_copy_of_WikiData

On 23.07.21 19:32, Matt Whitby wrote:
A little bit of a vague question, and perhaps a silly one.

How well does Jena scale?  Would it tap out after a given number of triples?

Do people sometimes split very large datasets over different instances and
just query across the different servers?

Thanks all.


Reply via email to