[whatwg] Conformance requirements for IRIs

Henri Sivonen Mon, 17 Apr 2006 09:14:24 -0700

In WA 1.0 and WF 2.0 some values are required to be IRIs and somevalues are required to be IRI references. I'm confused about whatexactly this means in terms of conformance checking. (WF 2.0 does saysomething about processing in a browser, though.)

First, I was amazed to learn that for pure non-infoset-augmentingvalidation xsd:anyURI datatype does not mean anything useful beyondtoken and that it is not exactly an IRI reference.

http://www.imc.org/atom-syntax/mail-archive/msg17990.html
http://www.mail-archive.com/rng-users@yahoogroups.com/msg00350.html


Having read
http://www.w3.org/TR/xlink/#link-locators

I started to suspect that just about every string indeed can beconsidered sort of an IRI reference that can munged into an IRIreference so there's nothing to check.


Then I found

http://jena.sourceforge.net/tmp/javadoc/com/hp/hpl/jena/iri/IRIFactory.htmlwhich provides a fascinating number of enforcement options. I couldwrite a custom datatype wrapper for it, but I don't know whichoptions to use.

I'd appreciate some guidance on which enforcement options to use.(E.g. should knowledge of the http scheme used? Should securityissues be flagged as non-conforming? Should "SHOULD" violations beflagged as non-conforming? Etc.)

(This is the first time I venture into the world of IRIs. I haveintuitively thought that they are trouble, so I have knowinglyavoided minting non-URI IRIs myself.

I suspected that bad stuff happens with IRIs containing decomposedcharacter sequences. (These can be found in the URI form due to HFS+-backed Apache setups.) Now that I've read the RFC, I think it is avery bad idea to allow decomposed characters in IRIs and that the RFCdoes not require percent encoding character sequences that are notinvariant under NFC.

This may have relevance to how the WF 2.0 url input works. That is,it probably SHOULD (MUST?) NOT percent-decode URIs that would resultin IRIs that are not invariant under NFC.)


--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/

[whatwg] Conformance requirements for IRIs

Reply via email to