Re: question about RDFPatch headers

Andy Seaborne Fri, 17 May 2019 08:40:51 -0700

Hi Chris,

If the "meta" part becomes complicated, it might be better to put a linkin the header that goes to another file. There is balance to be struckbetween arbitrary structures and simple processing.


It does make some sense to have a multi-valued patch header.

(all one line with or without separating comma)

H graph <http://purl.bdrc.io/graph/0686c69d-8f89-4496-acb5-744f0157a8db><http://purl.bdrc.io/graph/3ee0eca0-6d5f-4b4d-85db-f69ab1167eb1> .

Having the header as a map makes mixing header entries and reprocessingthem work better. No assumed order creeps in and no confusion aboutduplicates for things that must be unique (like id). c.f. HTTP headers.

If you think the meta data is going to get large, then a link toelsewhere may be better for other reasons like using the metadatawithout needing to access the patch in the log.


    Andy

On 16/05/2019 18:35, Chris Tomlinson wrote:

Hi,

We’re building an editing service for our RDF Linked Data Service and are 
thinking to use at least some of the features of RDFPatch/RDFDelta.

We use named graphs for the various Entities that we model: Works, Persons, 
Places, Lineages and so on. We are wanting to include in the patch some headers 
indicating the graphs that are being updated in the patch and the graphs that 
are created in the patch. We want this information to help the editing service 
have easy access to this information w/o analyzing the patch and doing other 
work to discover what’s being created and so on.

At first we thought of using a couple of keywords like, graph and create:

H graph <http://purl.bdrc.io/graph/0686c69d-8f89-4496-acb5-744f0157a8db> .
H graph <http://purl.bdrc.io/graph/3ee0eca0-6d5f-4b4d-85db-f69ab1167eb1> .
H create <http://purl.bdrc.io/graph/b1167eb1-85db-4b4d-6d5f-3ee0eca0f69a> .
H create <http://purl.bdrc.io/graph/0157a8db-acb5-4496-8f89-0686c69d744f> .
H id … .

but org.seaborne.patch.PatchHeader uses a Map so we can only one H graph … and 
one H create … in the patch. Two alternatives we’ve considered are to use a 
String of comma separated graphIds:

H graph "0686c69d-8f89-4496-acb5-744f0157a8db , 
3ee0eca0-6d5f-4b4d-85db-f69ab1167eb1” .
H create "b1167eb1-85db-4b4d-6d5f-3ee0eca0f69a , 
0157a8db-acb5-4496-8f89-0686c69d744f” .

which is plausible but in some cases the list of graphIds could become quite 
long and so this could be an issue down the line with very large strings.

A second idea was to add the notion of a preamble to the patch using PS, for 
preamble start, and PE, for preamble end, which would separate our extensions 
from the defined RDFPatch structure:

PS .
H graph <http://purl.bdrc.io/graph/0686c69d-8f89-4496-acb5-744f0157a8db> .
H graph <http://purl.bdrc.io/graph/3ee0eca0-6d5f-4b4d-85db-f69ab1167eb1> .
H create <http://purl.bdrc.io/graph/b1167eb1-85db-4b4d-6d5f-3ee0eca0f69a> .
H create <http://purl.bdrc.io/graph/0157a8db-acb5-4496-8f89-0686c69d744f> .
PE .
H id … .
TX .
…

We would then pre-parse the patch payload up to the PE and submit the remainder 
to RDFPatch, and so on.

A 3rd possibility is to consider some extension to RDFPatch to use a different 
signature for the Map in PatchHeader. This seems rather involved.

So we’re asking what approaches others might have taken for this sort of 
use-case or how best to accommodate this in RDFPatch as is.

Thanks very much,
Chris

Re: question about RDFPatch headers

Reply via email to