Re: [CODE4LIB] Implementing OpenURL for simple web resources

Eric Hellman Mon, 14 Sep 2009 11:22:44 -0700

What the spec for z39.88 says is that rfr_id (and all the other _id's)must be URIs.

the info:sid samespace was defined to allow minting of identifiers forthe specific purpose of identifying referrers. the info uri wasdefined to allow non-resolving identifiers to have a place to livewithin URI-land.

Documents written by standards committees are often not as clear asthey should be, but its hard to get consensus across an industywithout getting a committee together. Social process is so much harderthan technology.



On Sep 14, 2009, at 1:57 PM, Jonathan Rochkind wrote:

Huh, I can't even FIND a section 9.1 in the z39.88 standard. Are welooking at the same z39.88 standard? Mine only goes up to chapter4. Oh wait, there it is in Chapter 3, section 9.1 okay.
While that example contains an http URI, I would say it's intendedas an unambiguous identifier URI that happens to use an http schema,not an end-user access URL. Although the weird thing is, in everyother context the docs use an info:sid uri for rfr_id, to the extentthat I thought you were REQUIRED to use an info:sid in rfr_id, Ididn't even know you could use an HTTP uri as that example does,weird. For instance, while chapter 3 Section 9.1 uses that exampleof rfr_id=http://www.sciencedirect.com, over on page 14 in Chapter1, they use this example for the same entity: rfr_id = info:sid/elsevier.com:ScienceDirect
It certainly doesn''t surprise anymore when the z3988 standardcontains ambiguity or confusing/conflicting examples.
I wonder if there's more on this that is conflicting or confusing inthe "scholarly format" application profiles, or in the "KEVimplementation guidelines." Probably. Yep, that's where I got therfr_id=sid idea from! The "KEV implementation guideilines" say:"Referrer Identifiers are defined in the source identifier Namespace`info:ofi/nam:info:sid:'. They are identified using the `info:sid/'scheme for the identification of collections." It is unclear howthe "KEV Implementation Guidelines" justify saying that a rfr_id isalways info:sid, when the actual z39.88 actually uses an http rfr_idexample. Who knows which one was the mistake.
Seriously, don't use OpenURL unless you really can't find anythingelse that will do, or you actually want your OpenURLs to be used bythe existing 'in the wild' OpenURL resolvers. In the latter case,don't count on them doing anything in particular or consistent with'novel' OpenURLs, like ones that put an end-user access URL inrft_id... don't expect actually existing in the wild OpenURLs to doanything in particular with that.
Jonathan

Rosalyn Metz wrote:
ok no one shoot me for doing this:
in section 9.1 Namespaces [Registry] of the OpenURL standard(z39.88) itactually provides an example of using a URL in the rfr_id field,and i
wonder why you couldn't just do the same thing for the rft_id
also there is a field called rft_val which currently has no use.this might
be a good one for it.

just my 2 cents.
On Mon, Sep 14, 2009 at 12:57 PM, Jonathan Rochkind<[email protected]>wrote:
Well, in the 'wild' I barely see any rft_id's at all, heh. Asidefrom theobvious non-http URIs in rft_id, I'm not sure if I've seen httpURIs thatdon't resolve to full text. BUT -- you can do anything with anhttp URIthat you can do with an info uri. There is no requirement orguarantee inany spec that an HTTP uri will resolve at all, let alone resolveto full
text for the document cited in an OpenURL.
The OpenURL spec says that rft_id is "An Identifier Descriptor
unambiguously specifies the Entity by means of a Uniform ResourceIdentifier
(URI)."  It doesn't say that it needs to resolve to full text.
In my own OpenURL link-generating software, I _frequently_ putidentifierswhich are NOT open access URLs to full text in rft_id. Becausethere's noother place to put them. And I frequently use http URIs even forthingsthat don't resolve to full text, because the conventional wisdomis toalways use http for URIs, whether or not they resolve at all, andcertainlyno requirement that they resolve to something in particular likefull text.
Examples that I use myself when generating OpenURL rft_ids, ofhttp URIsthat do not resolve to full text include ones identifying bibrecords in my
own catalog:
http://catalog.library.jhu.edu/bib/NUM [ Will resolve to mycatalog
record, but not to full text!]

Or similarly, WorldCat http URIs.
Or, an rft_id to unambigously identify something in terms of it'sGoogle
Books record:  http://books.google.com/books?id=tl8MAAAACAAJ

Also, URIs to unambiguously specify a referent in terms of sudoc:
http://purl.org/NET/sudoc/[sudoc] <http://purl.org/NET/sudoc/%5Bsudoc%5D> => will, as the purl is presently set up by rsinger, resolveto a GPO
catalog record, but there's no guarantee of online public full text.

I'm pretty sure what I'm doing is perfectly appropriate based on the
definition of rft_id, but it's definitely incompatible with areceiving linkresolver assuming that all rft_id http URIs will resolve to fulltext forthe rft cited. I don't think it's appropriate to assume that justbecause aURI is http, that means it will resolve to full text -- it'smerely anidentifier that unambiguously specifies the referent, same as anyother URIscheme. Isn't that what the sem web folks are always insisting inthearguments about how it's okay to use http URIs for any type ofidentifier atall -- that http is just an identifier (at least in a contextwhere allthat's called for is a URI to identify), you can't assume that itresolvesto anything in particular? (Although it's nice when it resolves toRDFsaying more about the thing identified, it's certainly notexpected that it
will resolve to full text).
Eric, out of curiosity, will your own link resolver softwareautomatically
take rft_id's and display them to the user as links?

Jonathan


Eric Hellman wrote:
Could you give us examples of http urls in rft_id that are likethat?
I've never seen such.

On Sep 14, 2009, at 11:58 AM, Jonathan Rochkind wrote:
In general, identifiers in URI form are put in rft_id that areNOT meantfor providing to the user as a navigable URL. So the receivingsoftwarecan't assume that whatever url is in rft_is represents anactual access
point (available to the user) for the  document.
Eric Hellman
President, Gluejar, Inc.
41 Watchung Plaza, #132
Montclair, NJ 07042
USA

[email protected]
http://go-to-hellman.blogspot.com/


Eric Hellman
President, Gluejar, Inc.
41 Watchung Plaza, #132
Montclair, NJ 07042
USA

[email protected]
http://go-to-hellman.blogspot.com/

Re: [CODE4LIB] Implementing OpenURL for simple web resources

Reply via email to