Re: [whatwg] namespaces in html5

David Karger Mon, 18 Jul 2011 07:23:18 -0700

OK, per Ian's suggestion I'm starting a new thread on a problem that I'dhoped html5 would solve for us. As far as I know the problem stillexists so I'm going to raise it here. I'm coming late to the discussionso will surely retread old territory (for example,http://www.pacificspirit.com/blog/2008/03/13/namespaces_in_html_readings);my apologies for that.

I am one of the PIs on the SIMILE project (http://simile.mit.edu/) thatdeveloped the Exhibit data visualization framework(http://simile-widgets.org/exhibit). The goal of Exhibit is to make iteasy for non-programmers to embed interactive data visualizations intheir web pages. Our approach is to leverage the willingness of manynon-programmers to author html (a key contributor to the early growth ofthe web). To do so, Exhibit extends the html vocabulary with attributesthat describe data, visualizations of that data, and interactions withthat data. For example, a tag of the form <div ex:role="view"ex:viewclass="timeline"> embeds a simile timeline in the html document,while <div ex:role="facet"> embeds a facet that can be used to filterthe data being viewed in the timeline. Exhibit offers a javascriptlibrary that interprets these tags and implements the requested widgetson the client side.

You will note that our special attributes use an "ex:" prefix. Thisdecision was taken in 2006, when it appeared that prefix-basednamespaces were in HTML's future. It addressed our concern that the newattributes we defined should not collide with those defined by otherprojects. Now that namespaces apparently will not be part of html5, weare wondering how we can properly offer our extended html vocabulary.In particular, seems highly desirable for us to be able to write Exhibitpages using html that will validate. Below I'll outline some of thecharacteristics of our desired solution, while emphasizing that we'd behappy to adopt _any_ solution with these characteristics, and are notwedded to namespaces.

I first justify our approach of html vocabulary extension. A programmercan argue that a better approach is to offer our javascript library witha good api, and allow programmers to invoke our widgets programmaticallyin script tags. This works fine for programmers, but excludes the largepopulation of users who are afraid of programming but are willing tofiddle with html. These users were a potent force in the early days ofthe web and we believe they continue to play an important role. Theymay not even "know" html; the simplicity and regularity of the syntaxallows them to copy, paste, and even modify page elements they likewithout fully understanding them. Specifying data interactions in themore restricted html syntax instead of programmatic javascript alsoopens up the possibility for more effective semantics; for example, itis easier for a browser to offer an accessible version of adata-filtering facet if it is explicitly named as a facet rather thanbeing arbitrary embedded javascript code.

If we accept the need for html language extensibility, there are severalpotential approaches. One is html polyglot. Permitting a blendedhtml/xml representation, polyglot would allow us to extend thevocabulary via xml namespaces. But polyglot fails to meet our need infatal ways. Polyglot restricts the html that can be used, for exampleexcluding the use of <noscript> tags. Such tags are essential whenusing Exhibit, since we want to offer some information presentation forthe case when our visualization javascript is unable to execute. Moregenerally, polyglot appears to demand much more rigid fidelity toprecise html/xml syntax, for example demanding tbody and colgroup tagswhere they are optional in html. This is something that the novice"programmers" we are targeting are particularly bad at. One of the realaccomplishments of html has been the great efforts of the browserdevelopers to robustly handle invalid html. We want to continue tobenefit from that effort instead of having pages fail because xmlparsing is performed much more rigidly than html parsing.

Another approach would be to use the catchall html5 data- prefix forattributes. We could certainly prefix all of our specialized attributeswith the data- prefix, which would turn those attributes valid forhtml. This solution is unsatisfactory for two reasons. The first isthat our attributes are not data attributes----we are not usingmicroformat-oriented data attributes; rather, we are using attributesthat describe visualizations. data- seems a poor choice of prefix. Thesecond problem that concerns me is attribute collisions. If we use anattribute like data-role="view", how long will it be before an exhibitauthor runs into a situation where a different javascript library isusing the same data-role attribute for a different purpose, which wouldmake the two libraries incompatible with one another?

In 2006, the predicted namespace prefixes seemed an obvious solution toour problem: we would define a namespace for our Exhibit framework, andour javascript would only pay attention to attributes from thatnamespace. I have no specific loyalty to namespaces, but I am reallyhopeful that html5 will offer us a solution that reflects the issues Ioutlined above, namely:* allow extension of them html5 vocabulary with attributes Exhibit willuse to anchor visualizations,

* such that the resulting html will validate,

* without requiring rigid obedience to the challenging html polyglotsyntax, which is beyond the capabilities of our target novice web authors* and protecting us from a future in which collisions on choice ofattribute names make our library/vocabulary incompatible with others'


On 7/18/2011 8:46 AM, Ian Hickson wrote:

On Mon, 18 Jul 2011, David Karger wrote:

   I wish to submit a comment regarding the (non) use of namespaces in
html5. But I hope you might help me track down the relevant issue off
which to hang that comment.  Some time ago I found a lengthy discussion
of whether html5 should use namespaces, with an over-simplified summary
being "we haven't seen any important use cases for them, so let's not
bother".  I would like to respond to that discussion by proposing a use
case, but I cannot find it. Searching the bugzilla database has failed.
Would you happen to recall participating in this discussion and know
where it is?

You can just post a new thread here.

I recommend describing the problem you wish to address separately from
your preferred solution. Also I recommend using a word other than
"namespaces" to describe your preferred solution, as that word is usually
used in the Web context to refer to some specific designs with known
problems, and it is likely that you actually want something different.

Re: [whatwg] namespaces in html5

Reply via email to