Re: [whatwg] Trying to work out the problems solved by RDFa

Calogero Alex Baldacchino Tue, 03 Feb 2009 19:15:54 -0800

Benjamin Hawkes-Lewis ha scritto:

On 12/1/09 20:26, Calogero Alex Baldacchino wrote:
I just mean that, as far as I know, there is no official standard
requiring UAs to support (parse and expose through the DOM) attributes
and elements which are not part of the HTML language but are found in
text/html documents.
Perhaps, but then prior to HTML5, much of what practical user agentsmust do with HTML has not been required by any official standard. ;)
RFC 2854 does say that "Due to the long and distributed development ofHTML, current practice on the Internet includes a wide variety of HTMLvariants. Implementors of text/html interpreters must be prepared tobe 'bug-compatible' with popular browsers in order to work with manyHTML documents available the Internet."
http://tools.ietf.org/html/rfc2854
HTML 4.01 does recommend that "[i]f a user agent encounters an elementit does not recognize, it should try to render the element's content"and "[i]f a user agent encounters an attribute it does not recognize,it should ignore the entire attribute specification (i.e., theattribute and its value)".
http://www.w3.org/TR/html401/appendix/notes.html#h-B.3.2
Clearly these suggestions are incompatible with respect to attributes;AFAIK all popular UAs insert unrecognized attributes into the DOM andplenty of web content depends on that behaviour.

Very, very true. HTML 4.01 also says the recommended behaviours are ment"to facilitate experimentation and interoperability betweenimplementations of various versions of HTML", whereas the "specificationdoes not define how conforming user agents handle general errorconditions, including how user agents behave when they encounterelements, attributes, attribute values, or entities not specified inthis document", and since "user agents may vary in how they handle errorconditions, authors and users must not rely on specific error recoverybehavior". I just think the last sentence defines a best practiceeveryone should follow instead of relying on a common quirk supportinginvalid markup. However, beside something being a good or bad practice,there will always be authors doing whatever they please, therefore it isquite safe to assume UAs will always expose invalid/unrecognizedattributes (that's unavoidable, given the need for backward compatibility).

Just like proprietary elements/attributes introduced with user agentbehaviours (marquee, autocomplete, canvas), scripted uses of "data-*"might suggest new features to be added to HTML, which would thenbecome requirements for UAs.
But unlike proprietary elements/attributes introduced with user agentbehaviors, scripted uses of "data-*" do not impose new processingrequirements on UAs.
Therefore, unlike proprietary elements/attributes introduced with useragent behaviors, scripted uses of "data-*" impose _no_ designconstraints on new features.
Establishing user agent behaviours with "data-*" attributes, on theother hand, imposes almost as many design constraints as establishingthem with proprietary elements and attributes. (There's just lesspollution of the primary HTML "namespace".)
If no RDFa was in deployment, you could argue it would be less wrong(from this perspective) to abuse "data-*" than introduce new attributes.

Oh, well, I don't want to argue about that. For me the idea to use"data-rdfa-*" can rest in peace, since in practice it's not differentfrom using RDFa attributes as they are, at least as far as they'rehandled by scripts, either client- or server-side. However I think that,

* actually it seems not to be enough clear what UAs not involved in aparticular project should do with RDFa attributes, beside exposing theircontent for the purpose of a script elaboration, whereas a precisebehaviour should be defined, as well as an eventual class of UAs clearlyidentified as not required to support it, and eventual caveats onpossible problems and relative solutions, before introducing any newelements/attributes in a formal specification;

* actual deployment might be harmed by the use of xml namespaces in htmlserialization.

Also, I see design suggestions more than impositions. If a new (andproprietary/private) attribute/element/convention is convincinglyuseful/needed, it is supported by other UAs and introduced in aspecification, otherwise, if a not enough significant number of pageswould be broken, it might even be redefined for use with a differentsemantics. And a possible process involving data-* attributeswould/could be experiment privately => extend the scale involving otherpeople finding it useful for their needs => get it in the primarynamespace of an official specification (discarding the "data-" part andany other useless parts of the experimental name), so that existingpages may still work with their custom scripts or easily migrate to thenew standard (and benefit of the new default support) by running asimple regex.

But to the extent that these attributes are already in use intext/html and standardized within the "http://www.w3.org/1999/xhtml";namespace, processing requirements are effectively already beingimposed on user agents (such as not introducing conflicting treatmentof the "about" attribute). All that adding user agent behaviours with"data-rdfa*" attributes would do at this point is add _more_requirements, without rescuing the polluted attributes.

For what concerns html serialization, introducing xml namespaces (and,thus, xml extensibility - as a whole or partly) might be worse thanbreaking current experimentaions. Since xhtml about all W3C productionhas converged towards XML, suggesting a direction the web didn'tembraced completely, and instead causing objections with respect to xmlfeatures felt as useless or unwanted by a good number of people, hereinnamespaces and extensibility, hence the need to evolve htmlserialization to address new demands without forcing a migration towardsxml. Therefore, introducing pieces of xml inside text/html documents maybe problematic; of course, other surrogate mechanisms might be definedto indicate a namespace for the sole purposes of RDFa, but this wouldrise consitence issues between html and xhtml (as reported by HenriSivonen), perhaps solvable by specifing a double mechanism as workingfor xhtml (the html specific one, and the "classic" xml one), but such achoice might add complexity to UAs and be confusing for authors.

For what concerns XHTML, I disagree with the introduction of RDFaattribute into the basic namespace, and I wouldn't encourage the same inHTML5 spec. In first place, I think there is a possible conflict withrespect to the "content" attribute semantics, because it now requires adifferent processing when used as an RDFa attribute and as a <meta>attribute associated to an "http-equiv" or a "name" value (for instance).

In second place, it might be confusing for authors and lead to themisconception that every xhtml 1.x processor is also capable to processrdfa metadata (this is a limit of namespace + dtd/schema basedmodularization, because one can define the structure of a document, butnot "orthogonal" behaviours requiring a specific support, not covered bythe basic document model - such as collecting rdf triples declared byrdfa attributes, or calling a plugin and embedding its output - however,defining a proper namespace, maybe including its creation date somehow,may suggest what to expect from UAs).

In third place, creating a different namespace would have resulted in afar easier introduction of RDFa attributes into other xml languageswithout having to change the language to host them (by the way, thexhtml namespace and a related prefix can be used, but this require amore specific support due to the "content" attribute issue, especiallyby UAs not supporting DTDs or schemata - that is, what should happen ifan element were declared with both xhtml:name or xhtml:http-equiv,xhtml:content and xhtml:datatype, in an xml document accepting anyattributes from external namespaces? of course, this is solvable, butrdfa:content, rdfa:datatype and so on would make things easier, or atleast _cleaner_ and less confusing for authors having to understand thatan XML and RDF processor can/must support the xhtml namespace and its_whole_ semantics, not just dom-related structures, but limited to RDFaattributes, so that no <meta> or <object> or <link> can be used hopingtheir semantics is supported, despite the support for the xhtmlnamespace...). Also there might have been fewer attributes, each onewith a different semantic (assuming someone might not find useful tohave a link with rel="stylesheet" representing a triple, for instance).


Of course, this is my opinion.

> I also guess that,
if microformats experience (or the "realworld semantics" they claim to
be based on) had suggested the need to add a new element/attribute to
the language, a new element/attribute would have been added.
I'm not really sure what you mean.
(It's watching the microformats community struggle with the problem ofencoding machine data equivalents, for things like dates and telephonenumber types and measurements, that persuaded me HTML5 should includea generic machine data attribute, because it seems likely to me thatthe problem will be recurrent.)
--
Benjamin Hawkes-Lewis

If there were a general agreement, a new element/attribute would beintroduced as a result of a "bottom up" process (starting fromexperimentations) integrated with a "top down" community evaluation -for specific purposes, not generic machine exposure, I mean.

(I'm not sure a generic machine data attribute - in general, not justreferring to rdfa - would solve that, because each new occurrence of theproblem might require a "brand new" datatype that only newer, updatedUAs would understand (older ones would just parse the attribute andprovide it as a string for further elaboration by a script, at most, butthis might not be much better than using a data-* attribute for privatescript consumption), therefore, that wouldn't be necessarily differentthan creating a new appropriate attribute/element as needed andproviding such new feature in newer, compliant UAs).



WBR, Alex



--
Caselle da 1GB, trasmetti allegati fino a 3GB e in piu' IMAP, POP3 e SMTP 
autenticato? GRATIS solo con Email.it http://www.email.it/f

Sponsor:

Blu American Express: gratuita a vita!Clicca qui: http://adv.email.it/cgi-bin/foclick.cgi?mid=8613&d=4-2

Re: [whatwg] Trying to work out the problems solved by RDFa

Reply via email to