Re: [BioRDF] All about the LSID URI/URN

Carole Goble Fri, 28 Jul 2006 11:26:48 -0700


Sean

We will be joining you on Monday for the telecon. As you know, ourprojects use LSIDs heavily. And we find it invaluable.

Our practical experiences are, in a nutshell

Conceptually we like
1. decoupled naming from physical location (essential)
2. versioning (very useful)
3. separate data from metadata (very useful)

4. foreign authorities can add metadata to an LSID in a transparent way(useful)

5. can be retrofitted (useful - advantage over PURLs)
6. metadata is in RDF (useful)

However, we have problems with the implementation, specifically the useof SOAP within the resolution

system, because:
1. its not needed conceptually
2. its costly
3. its overkill which affects performance

4. the main implementation is Axis based - not suitable for phones, pdasand other thin clients

And we hardly ever type an LSID into a browser :-) we use them asdistributed object ids that are rapidly adoptable by a very distributedservice base


Chat on Monday.

By the way I have already lodged an objection to Susie that to have sucha telecon when many people who actually, like, use the stuff for, like,real are at ISMB2006 in Brazil and will not be able to participate. LikeDoh!


Carole

Professor Carole Goble
Director, myGrid project (http://www.mygrid.org.uk)
Chair, Open Middleware Infrastructure Institute-UK (http://www.omii.ac.uk)

Hello Dan,

> Thanks for continuing to explain the requirements. I haven't seen
> LSID requirements that can't be met with http/DNS yet, but that
> doesn't mean they're not there.
>
> Yes, it's easy to see how starting fresh simplified some things.
> But I am not convinced that starting fresh is the only option,
> nor that working within the constraints of http/DNS won't give
> a lot more benefit for approximately the same investment.
>
>
I don’t know exactly why a URN style identifier was chosen over a httpstyle URI for LSIDs as I was not involved at the time. My educatedguess is that for a number of the reasons I have detailed in myearlier posts http URLs, as understood in common practice then &indeed now, were not seen to exactly fit all the requirementsgenerated when an exact fit was required for them to actually beuseful to their fairly fractured community. Nuance detail is importantin this problem and I can see from your last reply that we are notmeeting there at many levels.
In particular though I suspect the highly distributed nature of theLife Science community, the perceived fragility of URL links and theURL's ambiguous technical/social contracts were major motivatingfactors and also the successful application of such a schemeinternally by a number of the standard initiators. In circumstanceswhere there were (and perhaps are still) not enough obvious standards,guidance & best practices available to show how to make http URLs fitthis particular bill, it is perfectly understandable why they fellback on the URN specifications. In those there was clear guidanceabout how to make persistent identifiers that would work. Given theexistence of other substantial persistent identifier efforts (e.g. ARKand DOI), it seems to me they were not alone nor unreasonable in theirthinking.
I was involved in the decision making surrounding the choice of thedereferencing protocol for the LSID standard and know that it waspurposely based on as many preexisting existing standards as possible,namely the DDDS RFCs (for URN dereferencing), DNS SRV records (forservice discovery) and SOAP/WSDL (for communicating metadata/data endpoints) as well as the common web protocols for transport (http, ftp,file://, SOAP) given that much data that required naming was alreadyaccessible online. It was entirely deliberate that as much existingprecedent be used as possible and consequently little of the LSIDprotocol specification is completely new invention.
That said, I can also see the great benefits to allowing direct httpURL style access to information named using the LSID scheme, which iswhy I personally would be much inclined to back the suggestion byHenry Thompson that the LS community go the extra step in the standardto establish the necessary mechanisms to link LSIDs to the web usingthe pattern he suggested from the ARK group.
From my point of view, we finally have an extremely hard won LSIDstandard which is already being usefully put into practice by variousgroups in the Life Sciences community. This is no small achievement.It has a fairly clear social/technical contract, although I believethere are improvements that can be made in the area of metadata. Itseems to me that it, f is best used for uniquely naming LS digitalobjects - anything one can think of today as a file but there are alsoother valid uses. In addition it seems perfectly valid to use an LSIDas a URI in RDF because it is a URN and URNs are URIs. We intend to goon doing so while we find it useful for our purposes. What would bemarvelous would be to start defining the scope of the metadatareturned so we can take the existing usefulness to the next level.
I will look forwards to talking with you next week. Have a greatweekend everyone.
Kindest regards, Sean

--
Sean Martin
IBM Corp

Re: [BioRDF] All about the LSID URI/URN

Reply via email to