Re: identifier to use

Hilmar Lapp Tue, 21 Aug 2007 20:59:32 -0700


On Aug 21, 2007, at 1:39 PM, Eric Jain wrote:

Hilmar Lapp wrote:
It seems to me that domain-specific resolution systems are rathera fact and we deal with them all the time.
We try to deal with it, but it's a pain, even though the number ofdifferent systems I need to deal with is limited compared tosomeone who is developing applications that must work across theentire life-sciences domain, or even outside of this domain as well-- completely impractical!

Right. That was one of the problems that was faced when the I3Cconsortium started (namely multiple identifier systems withidiosyncratic translation rules to convert to a resolvable URL), andwhich it tries to address by unifying the identifier and resolutionschemes.

My point was that domain-specific identifier and resolution schemesare a matter of fact, and some evidence shows that the fact that theyare domain specific doesn't diminish their ability to succeed andbecome de-facto standards.

As for being limited to a domain or not, would the LSID mechanism bemore appealing if it read urn:guid:foo.org:Foo:12345? There's nothingin the LSID spec that makes it LS-specific, or due to which it makeno sense outside of the LS.

For example, articles are referenced by DOI, entries in mostinstitutional repositories are referenced by Handles, and GenBanksequences are referenced by a GI number. Any generic tool thatwants to deal with statements made about or to articles(presumably almost all will want to) will need to know how todereference a DOI. Alternatively, for the time being we can prefixthe DOI with http://dx.doi.org/ and have a dereferancable HTTP URI.
That's the single best feature of that system, in my opinion :-)

Do you mean you would prefer if each journal set up URIs based on itsself-chosen domain-name and we reference articles through thatinstead of DOIs? Or did you want to say something else?

I'm not sure why we can't apply the same principle to LSIDs. Thelife science field isn't necessarily a small one, and it seemslike a small price to pay for a tool creator to implement a singleresolution system to resolve any life science identifier. Is thisbeing naive?
From what I see, tool creators haven't shown much interest inimplementing domain specific schemes, or even at least make it easyto plug in your own.
How many semantic web tools support LSID resolution, for example?

I'm not sure you are trying to advocate future standards based on theabilities or lack thereof of the current generation of semantic webtools?

Just as they will have to support DOIs to be practical, I don't seewhy they would shy away from supporting LSIDs, if they are widely used.

To make them widely used is upon the data providers, though, not thetool makers.

There seems to be a notion that all "life science databases" willbe there in perpetuity, but in reality there are plenty ofexamples of databases that lost funding and went "out ofbusiness", with PIR or BIND being some of the better known ones.I'm not quite following why after all these years of discussionthe validity of URIs should again be subject to the vagaries offunding, or the business acumen of commercial enterprises.
The going out of business problem is a big challenge, but in myexperience the majority of changes are nothing else but URLschanging from something like /cgi-bin/fetch.cgi?P00001 to /fetch.do?id=P00001 etc.

Well, yeah, but the big challenge is still a big challenge and a realone, and advocating stable HTTP URIs as a solution surely will notcontribute to solving the big challenge?

There are also some issues with such URLs that have nothing to dowith stability, such as the fact that there are no separate URLsfor concepts and their representations, see previous discussions onthis list...

Right. Does this advocate for or against an opaque identifier system?BTW there are standards to deal with that, such as OpenURL (howeverimperfect that may be).

Domain names are quickly bought, used, and sold to someone else,and this is not just theoretical. The proposed "ease" with whichHTTP URIs can be stably maintained first of all is clearlycontradicted by the empirical evidence that it's not happeningright now (why would a W3C recommendation change that? That wewant stable HTTP URIs can't be new to anyone), and second requirescontinued ownership of the domain name. This seems like a trivialissue but in reality it's not once funding is cut off.For example, the journal Phyloinformatics discontinued recentlyand the domain name phyloinformatics.org is now for sale. If theyhad used HTTP URIs using their domain name, the next owner of thedomain would probably choose not to maintain any of those, orworse, reassign them to something else.
What am I missing?
The time dimension? :-)
If you reference some resource on phyloinformatics.org, you do wellto note down the time when you accessed the resource


In an RDF document?

[...] This will later allow you to retrieve the same page e.g. viathe Internet Archive (if you are lucky).

And if the semantic web tool supports going to the internet archiveif dereferencing an HTTP URI returns RDF that doesn't quite makesense with respect to the statement through which you got to it.


And what if the internet archive chose not to archive that HTTP URI?

Don't know how this is best handled in the context of the SemanticWeb...


Would you mind elaborating?

        -hilmar

--
===========================================================
: Hilmar Lapp  -:-  Durham, NC  -:- hlapp at duke dot edu :
===========================================================

Re: identifier to use

Reply via email to