Re: How to implement a custom JENA Backend

Andy Seaborne Tue, 24 Apr 2012 11:56:32 -0700

On 24/04/12 14:57, Paolo Castagna wrote:

Tobias Neef wrote:

Hi Paolo,


thanks for the quick response! The reason for doing this is, because I
think it would be useful to have a RDF-Database with SPARQL-Interface
which can be used as a PAAS offering like Amazon RDS or Amazon Dynamo
DB: For the developer this would mean no hassle about replication, or
scaling etc. To some extend you can achieve that when using Jena SDB
on top of something like Amazon RDS or MS SQL Azure. I want to try how
far I can get when I use Jena as API and map it to something like
Dynamo DB or MS Azure Tables which have quite unique
Scalability/Availability characteristics. There is for example
http://datomic.com/ which also goes along those lines. They
implemented it on top of Dynamo DB but with a custom query language.

Does that make sense from your perspective?


Hi Tobias,

Interesting space and it would be great to have such a service.

There are quite a few design choices to make and they can greatlyinfluence the desing. For example: a service that offered replicationetc and had many datasets can be built using one dataset per machine asthe unit. It scales in total data but not in data-per-dataset or graph.

A service that specialised in massive data (more about data managementthan raw query performance; maybe like a column store if aggregationqueries matter) if different from one giving as-near-real-time responsefor UIs (basically, in-memory or the working set is in-memory).


In terms of where to start,

SDB if you are building on top of an SQL service

TDB, or the shell of TDB, if you building on what amounts to a indexservice. TDB is built on top of indexes - you can plug in your own.

I have built a TDB that used Project Voldemort as a block store for theTDB B+Trees. It worked quite well but as highly scalable base, it'slimited as too much work ends up on the query engine and not enough ofthe indexing work access work is doe in the cluster.


As for examples: see http://www.dydra.com/ which is SPARQL.

        Andy

Re: How to implement a custom JENA Backend

Reply via email to