Re: HTML 4 Profile for RDFa

Shelley Powers Sat, 23 May 2009 13:36:25 -0700

Philip Taylor wrote:

Shane McCarron wrote:
Julian Reschke wrote:
It's clear that if RDFa is to be used with prefix declarations donewith xmlns, then mixing uppercase and lowercase declarations is notgoing to work.
I think restricting prefixes to be lower-case (insert proper Unicodeterminology here) would be acceptable; it's easy to live with, andavoids introducing yet another prefix declaration mechanism.
I would not be opposed to adding text in the RDFa in HTML definitionlike "prefix names SHOULD be defined in lower-case to help ensuremaximum portability among parsers, since it is common for DOM-basedparsers to not preserve the case of attribute names."
If portability isn't guaranteed in a very simple case like this, thenit sounds like the specification would have failed at the fundamentaltask of specifying behaviour that will be interoperably implemented.
(Once portability is guaranteed, it might be good to recommend againstusing non-lowercase prefixes because they might have surprising (butportable) behaviour, but that's a very different reason.)
I don't see there being any need to change the definition ofXML-based languages like RDFa for XHTML. After all, in XML case ispreserved. Or is ot someone's goal that documents be able to beparsed as EITHER XML or HTML? It's not my goal. If I define adocument using an HTML family language, I expect it to be parserusing an HTML family parser. If I define it using an XHTML familylanguage then I expect it to be parsed using an XML-conformingparser. Such a parser would preserve the case of element andattributes.
People will read the RDFa-in-XHTML specs and guides and tutorials andexamples, and use the same syntax in their own pages. Then they'llserve their pages as text/html and expect it to work the same.
A survey of random pages from dmoz.org about a year ago found that~18% used an XHTML doctype, and ~0.03% were served asapplication/xhtml+xml. On the Alexa top 200 a bit earlier(http://lists.w3.org/Archives/Public/public-html/2007Aug/1248.html), athird used an XHTML doctype and three quarters of those were notwell-formed XML. So: Any new markup will be overwhelmingly served astext/html, and most of it that claims to be XHTML won't be usable withan XML parser.
Thus, the XHTML syntax will mostly be processed using theRDFa-in-text/html processing rules. If those rules don't do whatpeople expect (after they've read the XHTML-focused specs and guidesand tutorials and examples), then they will be surprised and unhappyand it will be a bad situation.
To make the situation better, either (a) the RDFa-in-XHTMLdocumentation should all be removed and replaced withRDFa-in-text/html documentation so that people won't be encouraged touse the wrong syntax in their pages; or (b) the RDFa-in-XHTML syntaxshould give the same results (as far as possible, given thebackward-compatibility constraints) when processed with theRDFa-in-text/html processing rules.
I presume (a) isn't going to happen. That leaves (b), which wouldrequire coordination between RDFa-in-XHTML and RDFa-in-text/html, andseems likely to require changes to the RDFa-in-XHTML spec to smoothout the differences.

Wow, Philip, you're using an 8-gauge shotgun to hunt baby bunnies here.

Can I take a leap of faith and guess that of the 18% of web pages servedup with the XHTML doctype not using well formed XML probably are alsonot using RDFa?

The RDFa in XHTML spec doesn't need to change if a new document coveringRDFa in HTML is created. Does it? Maybe a cross-reference between thedocuments, with a general warning about differences between the twodocuments would be good.

As it is, there's probably going to be confusion about XHML versus HTMLwith the HTML5 spec. I'm rather waiting for someone to use <br> in XHTML5.


Shelley

Re: HTML 4 Profile for RDFa

Reply via email to