Re: About JENA-2339 - security related

Andy Seaborne Mon, 15 Aug 2022 06:28:01 -0700

There is one Jena user - it's Vilnis (for Iotics).

Your use cases - whatever they are - are for the current product andwill evolve. Whether the way you propose will support the evolution ofthe use cases in the future, say the next 5 years, is unclear (and Ithink quite unlikely both on security features because product featureevolve, and on wanting to working with spatial or text datasets). Jenatries to give stability.


The essence of the PR is ~30 lines in SecurityContextDynamic.
The rest is rearranging the plumbing to have a magic user.
This does not need DatasetGraphAccessControl.

This could be in a custom query processor extending SPARQL_QueryDatasetoverriding decideDataset delivered as Fuseki Module. (You can overridethe standard query processor (1 line of code) if you want all queryservices to have this, or all for a particular service (2 lines ofcode), or be a new endpoint that offers only SPARQL query over a view ofthe dataset. The latter is better for you because you can put APIsecurity on the endpoint. It's a opt-in, drop-in extension, to astandard distribution Fuseki/Main.

The amount of code reuse from SecurityContextView is 20 lines maybe viaSecurityContextView.filterTDB and the functionality could made into afunction.

Now your usage is not a security issue for the Fuseki server as the HTTPrequest interface is not changed. No interaction with GSP.

So Iotics add their own query processor to a standard Fuseki server andcan evolve the extension. Configuring the network for the extension isthe responsibility of the Iotics deployment.

The extension might even be interesting to other Jena users not assecurity feature but as for the dynamic view capability.


>>> using a "SELECT {} 1" query, and
>>> adding a certain set of graphs makes the queries on my laptop take:
>>> ~600 graphs ~115ms
>>> ~1500 graphs ~162ms
>>> ~3k graphs ~240ms
>>> ~6k graphs ~400ms
>>

>> That's an illustration of the current system but we don't know what>> is the cause of the cost.

>>
>> What piece of the code is taking the time?
>> Maybe the right thing to do is make it faster.
>
> I haven't looked into this in great detail, but from my understanding
> the time taken is a combination of a) parsing the input of allowed
> graphs and b) generating a new SecurityContext (holding a hashmap of
> said graphs). If providing a set of allowed graphs in the proposed way
> is not a no-go, I'm happy to dig into where the cost is exactly.

We haven't seen the queries you were making. It is difficult to believethat Java takes >100ms to build a 6K entry hash map.

You mentioned that request line gets too long. True for GET but a SPARQLquery request could be sent as a HTML form(application/x-www-form-urlencoded) so listing the graph using?default-graph-uri=/?named-graph-uri= can be much larger than thepractical GET limits.


    Andy

Re: About JENA-2339 - security related

Reply via email to