Re: identifier to use

Eric Jain Tue, 21 Aug 2007 10:40:12 -0700


Hilmar Lapp wrote:

It seems to me that domain-specific resolution systems are rather a factand we deal with them all the time.

We try to deal with it, but it's a pain, even though the number ofdifferent systems I need to deal with is limited compared to someone who isdeveloping applications that must work across the entire life-sciencesdomain, or even outside of this domain as well -- completely impractical!

For example, articles are referenced by DOI, entries in mostinstitutional repositories are referenced by Handles, and GenBanksequences are referenced by a GI number. Any generic tool that wants todeal with statements made about or to articles (presumably almost allwill want to) will need to know how to dereference a DOI. Alternatively,for the time being we can prefix the DOI with http://dx.doi.org/ andhave a dereferancable HTTP URI.


That's the single best feature of that system, in my opinion :-)

I'm not sure why we can't apply the same principle to LSIDs. The lifescience field isn't necessarily a small one, and it seems like a smallprice to pay for a tool creator to implement a single resolution systemto resolve any life science identifier. Is this being naive?

From what I see, tool creators haven't shown much interest in implementingdomain specific schemes, or even at least make it easy to plug in your own.


How many semantic web tools support LSID resolution, for example?

There seems to be a notion that all "life science databases" will bethere in perpetuity, but in reality there are plenty of examples ofdatabases that lost funding and went "out of business", with PIR or BINDbeing some of the better known ones. I'm not quite following why afterall these years of discussion the validity of URIs should again besubject to the vagaries of funding, or the business acumen of commercialenterprises.

The going out of business problem is a big challenge, but in my experiencethe majority of changes are nothing else but URLs changing from somethinglike /cgi-bin/fetch.cgi?P00001 to /fetch.do?id=P00001 etc.

There are also some issues with such URLs that have nothing to do withstability, such as the fact that there are no separate URLs for conceptsand their representations, see previous discussions on this list...

Domain names are quickly bought, used, and sold to someone else, andthis is not just theoretical. The proposed "ease" with which HTTP URIscan be stably maintained first of all is clearly contradicted by theempirical evidence that it's not happening right now (why would a W3Crecommendation change that? That we want stable HTTP URIs can't be newto anyone), and second requires continued ownership of the domain name.This seems like a trivial issue but in reality it's not once funding iscut off.
For example, the journal Phyloinformatics discontinued recently and thedomain name phyloinformatics.org is now for sale. If they had used HTTPURIs using their domain name, the next owner of the domain wouldprobably choose not to maintain any of those, or worse, reassign them tosomething else.
What am I missing?


The time dimension? :-)

If you reference some resource on phyloinformatics.org, you do well to notedown the time when you accessed the resource (this is something most printjournals do when showing web addresses). This will later allow you toretrieve the same page e.g. via the Internet Archive (if you are lucky).


Don't know how this is best handled in the context of the Semantic Web...

Re: identifier to use

Reply via email to