"Scale" in terms of what? I mean, people loaded full Wikidata dump [1],
so that works for sure on a single (though powerful) machine and indeed
takes some (lots of) time. Query response time does indeed depend on the
query size and complexity as well as the data size. But it's impossible
to give any numbers here. You could have a look at benchmarks for some
estimates or even better, just try it and load your dataset.
There is no dedicated cluster version of Jena, but some people managed
to somehow make some extension (dubbed Mosaic) on top
[1] http://wiki.bitplan.com/index.php/Get_your_own_copy_of_WikiData
On 23.07.21 19:32, Matt Whitby wrote:
A little bit of a vague question, and perhaps a silly one.
How well does Jena scale? Would it tap out after a given number of triples?
Do people sometimes split very large datasets over different instances and
just query across the different servers?
Thanks all.