I did say it was somewhat of a 'length of a piece of string' type question. Having just dealt with relational databases for 25 years I know pretty much nothing about triplestores. Realistically I don't see us storing over 250m triples and if you say the whole Wikidata dump works ok (and that's sitting at 12,878,628,927) then I'm pretty sure we'll be okay. ;)
On Sat, 24 Jul 2021 at 09:33, Lorenz Buehmann < [email protected]> wrote: > "Scale" in terms of what? I mean, people loaded full Wikidata dump [1], > so that works for sure on a single (though powerful) machine and indeed > takes some (lots of) time. Query response time does indeed depend on the > query size and complexity as well as the data size. But it's impossible > to give any numbers here. You could have a look at benchmarks for some > estimates or even better, just try it and load your dataset. > > There is no dedicated cluster version of Jena, but some people managed > to somehow make some extension (dubbed Mosaic) on top > > > [1] http://wiki.bitplan.com/index.php/Get_your_own_copy_of_WikiData > > On 23.07.21 19:32, Matt Whitby wrote: > > A little bit of a vague question, and perhaps a silly one. > > > > How well does Jena scale? Would it tap out after a given number of > triples? > > > > Do people sometimes split very large datasets over different instances > and > > just query across the different servers? > > > > Thanks all. > > > > > -- Matt Southend. Essex, England Guff follows.... Me: http://www.about.me/matt.whitby Photography: http://www.whitbyphoto.com Travels: http://www.whitbyadventures.com Music: http://www.last.fm/user/MattWhitby <http://www.last.fm/user/MattWhitby/%3C/a%3E> Reading: https://www.goodreads.com/user_challenges/19398505 Development: https://www.hackerrank.com/matt_whitby
