Re: HTML 4 Profile for RDFa

Geoffrey Sneddon Sat, 23 May 2009 05:58:39 -0700


On 23 May 2009, at 13:34, Julian Reschke wrote:

For this to make sense in real HTML implementations, thedefinition should be in terms of the document layer rather thanthe byte layer.
Disagreed. Many implementations never build a DOM. We're not onlytalking about browsers here.
By "DOM" I generally mean any kind of tree structure of elementsand attributes, either as an explicit data structure (DOM, XOM,ElementTree) or implicit (SAX). Would any RDFa implementation *not*parse the input HTML into that kind of structure and operate overthe elements and attributes as distinct objects? (e.g. would theyjust use regular expressions over the input byte stream? That seemsquite infeasible to me...)
Depends on the definition of "tree structure". I've been involved incode that just uses a tokenizer and specialized stack, andimplementations like these will not do the re-arranging of elementsthe HTML5 spec specifies for some kinds of broken input.

Still specifying it relative to a DOM is still not problem, as you canincur the elements and text nodes from the token stream, until youreach the point where you are required by HTML 5 to throw a fatalerror (i.e., when you can no longer parse per spec with the stream, asyou can't reorder the elements).



--
Geoffrey Sneddon
<http://gsnedders.com/>

Re: HTML 4 Profile for RDFa

Reply via email to