Re: Scalability

Matt Whitby Sat, 24 Jul 2021 02:36:09 -0700

I did say it was somewhat of a 'length of a piece of string' type
question.  Having just dealt with relational databases for 25 years I know
pretty much nothing about triplestores. Realistically I don't see us
storing over 250m triples and if you say the whole Wikidata dump works ok
(and that's sitting at 12,878,628,927) then I'm pretty sure we'll be okay.
;)


On Sat, 24 Jul 2021 at 09:33, Lorenz Buehmann <
[email protected]> wrote:

> "Scale" in terms of what? I mean, people loaded full Wikidata dump [1],
> so that works for sure on a single (though powerful) machine and indeed
> takes some (lots of) time. Query response time does indeed depend on the
> query size and complexity as well as the data size. But it's impossible
> to give any numbers here. You could have a look at benchmarks for some
> estimates or even better, just try it and load your dataset.
>
> There is no dedicated cluster version of Jena, but some people managed
> to somehow make some extension (dubbed Mosaic) on top
>
>
> [1] http://wiki.bitplan.com/index.php/Get_your_own_copy_of_WikiData
>
> On 23.07.21 19:32, Matt Whitby wrote:
> > A little bit of a vague question, and perhaps a silly one.
> >
> > How well does Jena scale?  Would it tap out after a given number of
> triples?
> >
> > Do people sometimes split very large datasets over different instances
> and
> > just query across the different servers?
> >
> > Thanks all.
> >
> >
>


-- 
Matt
Southend. Essex, England

Guff follows....

Me: http://www.about.me/matt.whitby


Photography: http://www.whitbyphoto.com


Travels: http://www.whitbyadventures.com


Music: http://www.last.fm/user/MattWhitby
<http://www.last.fm/user/MattWhitby/%3C/a%3E>


Reading: https://www.goodreads.com/user_challenges/19398505


Development: https://www.hackerrank.com/matt_whitby

Re: Scalability

Reply via email to