Re: [whatwg] Possible bugs : Microdata Itemscope on and

Tim van Oostrom Sun, 29 Nov 2009 09:57:45 -0800

Tim van Oostrom wrote:

Philip Jägenstedt wrote:
On Sun, 29 Nov 2009 12:46:16 +0100, Tim van Oostrom <t...@depulz.nl>wrote:
Philip Jägenstedt wrote:
On Thu, 26 Nov 2009 22:30:41 +0100, Tim van Oostrom <t...@depulz.nl>wrote:
Hi, I made a forumpost :http://forums.whatwg.org/viewtopic.php?t=4176, concerning apossible "microdata specification bug" and a bug in thejames.html5.org microdata extractor.
Comes down to <link/> and <meta/> elements possibly being unfitfor use with the itemscope attribute.
I made an example in the forum post with some nice ubb formatting .
There are some other issues with <link> and <meta> you might wantto review first: [1]
Ok
Your second example was:

<div itemtype="http://url.to/geoVocab#country"; itemscope>
<span itemprop="http://xmlns.com/foaf/spec/index.rdf#name";lang="cn">中華人民共和國</span><span itemprop="http://xmlns.com/foaf/spec/index.rdf#name";lang="en">China</span><link itemprop="http://url.to/city";href="http://url.to/shanghai"; itemscope itemref="city-shanghai" />
   <div id="city-shanghai">
<spanitemprop="http://xmlns.com/foaf/spec/index.rdf#name";>Shanghai</span><span itemprop="http://url.to/demoVocab#population";>14.61million people</span><span itemprop="http://url.to/physicsVocab#time";datetime="2009-11-26 11:43">11:43 pm (CT)</span>
   </div>
</div>
<link>, <meta> and any other void elements are usually the wrongchoice for itemprop+itemscope because they don't have childelements, so itemref is the only way to add properties.
Yes, see forumpost. Shouldn't this be noted in the Spec then ?
Yes, the spec certainly needs some notes on how to use <link> and<meta>.

And other void alements such as : area, base, br, col, command, embed,hr, img, input, link, meta, param, source(http://dev.w3.org/html5/markup/syntax.html)

Basically, the microdata can't really be on all elements as stated in :HTML5 spec, 5.2.2 Items

According to this an "itemref" attribute can never be added to an"item" within an itemscope of another "item" without the crawledprop/val pairs also applying to the ancestors itemscope.
Ah, I think you've found the root of the problem. By allowing aproperty to be part of several items at once, we get different kindsof strange problems. Except from messing up your example, it seems itis the real cause for the infinite recursion bug I wrote about in[1]. Then I was so focused on the recursion that I suggested a rathercomplex solution to detect loops in the microdata, when it seems itcould be solved simply be making sure that a property belongs to only1 item. Detailed suggestion below.Now, back to the problem of one property, multiple items. Thealgorithm for finding the properties of an item [2] is an attempt atoptimizing the search for properties starting at an item element. Ithink we should replace this algorithm with an algorithm for findingthe item of a property. This was previously the case with the specbefore the itemref mechanism. I would suggest something along theselines:
1. let current be the element with the itemprop attribute
2. if current has an ID, for each element e in document order:
2.1. if e has an itemref attribute:
2.1.1. split the value of that itemref attribute on spaces. for eachresulting token, ID:
2.1.1.1. if ID equals the ID of current, return e
3. reaching this step indicates that the item wasn't found viaitemref on this element
4. let parent be the parent element of current
5. if parent is null, return null
6. if parent has the itemscope attribute, return parent
7. otherwise, let current be parent and jump to step 2.
This algorithm will find the parent item of a property, if there isone. itemref'ing takes precedence over "parent-child linking", so inTim's example the properties of Shanghai would be applied to only theShanghai sub-item. I'm not convinced writing markup like that is agood idea, but at least this way it has sane processing.


Which is important in the markup-souped web of non-linked-data :-)

HTMLPropertiesCollection on any given element would simply match allelements in the document for which the the algorithm returns thatvery element. It should be invalid for there to be any elements inthe document with itemprop where this algorithm returns null or theelement itself.
I will try implementing this algorithm in MicrodataJS [3] and see ifit works OK. While it may look less efficient than the currentalgorithm, consider that a browser won't implement either algorithmas writting, only act as if they did. The expensive step of goingthrough all elements with itemref attributes is actually no moreexpensive than e.g. document.querySelector('.classname') ifimplemented natively.

I did something like this in my experimental/unfinished/test/learnmicrodata extractor based on jquery which is here :http://www.depulz.nl/microdata/ (works at least in FF 3.5 and opera 10.10).

[1]http://lists.whatwg.org/htdig.cgi/whatwg-whatwg.org/2009-November/024095.html[2]http://www.whatwg.org/specs/web-apps/current-work/multipage/microdata.html#the-properties-of-an-item
[3] http://gitorious.org/microdatajs

Re: [whatwg] Possible bugs : Microdata Itemscope on and

Reply via email to