Re: Disambiguation [was RE: "aid" microformats? (was Re: [uf-discuss]ISBN mark-up)]

Scott Reynen Mon, 01 May 2006 05:48:09 -0700

On May 1, 2006, at 2:29 AM, Joe Andrieu wrote:

You have to at least start parsing the html document in order toknow which profiles are used.
Agreed.
The presumption here is that processing is cheap and undirected.

There's no way to download only the DOCTYPE or the <head> of adocument, and processing is cheaper than bandwidth. Once you'vealready downloaded a whole document, you might as well parse it allbecause the <head> might be wrong about what's in the <body>.

See Kaboodle[1] or Backpack[2] or Scrapbook[3] for examples where
realtime, directed parsing is useful.

[A] http://www.kaboodle.com
[B] http://www.backpackit.com
[C] http://amb.vis.ne.jp/mozilla/scrapbook/

Basically, all of these could be seen as variants on Live Clipboard.

Right, but these all work client-side, where the document is alreadycompletely loaded. Many microformat parsers work server-side.

If there are only a handful of Microformats and they are all well-known,(and we have effectively hijacked the "class" default namespace),then the
processing should be manageable.


It is manageable.  It's just not worth doing because:

1) the whole document is already downloaded, which is the largest burden
2) <head>s lie.

But if there are thousands or tens of thousands of Microformats--and yes, Iknow this presumption is at odds with some of the expectationsbehind asocially moderated namespace--in that scenario, it is easy tocalculate thedifference of running a single attribute check for "microformat"instead of
checking against the entire Microformats space.

This was what I meant when I asked "How do Microformats scale?"

Microformats scale by re-use. Thousands or tens of thousands ofmicroformats is an anti-goal.

I don't believe we are in the latter situation where we need tight
coordination as in a protocol.

We need tight coordination as in a dictionary. A formal definitionof a shared lexicon is what allows us to communicate with newsymbols. You can use whatever class names you want, but if I don'tknow what they mean, I can't parse them, and a profile doesn't tellme what they mean, it just tells me whether they follow a certainsyntactic structure. Profiles are like a grammar check, but we stillneed a dictionary.

Instead, what we need is a simple way for
human authors to say "This is what I mean".

Profiles don't do that. No technology does that. That's anincredibly complex problem that no one has yet solved. When they do,we'll have usable machine translation, artificial intelligence, andmicroformats will be the least of our worries.

There is value in forging a tight class of well understood, easilyhumanauthored, semantic tags. However, Allowing rich variation on theexistingclasses doesn't "split" the community--the community is the socialnetwork,
not the semantic space.

In practice, social networks require shared understanding of whatthings mean. The lack of this shared understanding leads to civilwar in the real world, and unused specs in the tech world.

Instead, it allows exploration and differentiation,
which ultimately can be incorporated back into the foundationclasses. More
importantly, it allows user-driven innovation.

You can already explore and develop your own specs. But if you wantsomeone else to understand them, you have to explain to them what thespec means in clear human language. Machines don't understand.People understand. Clear human language is easier to accomplish incommunity than in isolation.

I think it is hubris to expect that the first adopted version of a
microformat is the orthodox way to do it and that variations areheresy.

It would be if microformats, or dictionaries, did much more thandocument and formalize existing use. Do you find hubris indictionaries as well? Who is this Webster to tell us what "dog"means? He's someone who documented how a lot of people use the word"dog" and wrote it down in a dictionary, just like we're documentinghow a bunch of people mark up citations, and writing it down in a wiki.

If our mantra includes basing our developments on real-world
examples, then how does the spec evolve if we don't have real-worldexamples
of derivative implementations?


We have a web full of real-world data publishing examples.

Without variations, we risk stagnation.

No one is preventing anyone from using whatever class names theywant. No one is preventing anyone from telling others what theirclass names mean. But no one has invented a technology to automateshared meaning. We can't use it because it doesn't exist.

I think the type of disambiguation I am talking about can beaddressed with
a simple microformat="profile" attribute.


Have you looked at profile URIs?

http://microformats.org/wiki/profile-uris

That accomplishes exactly what your microformat="profile" would,except it's valid XHTML. But neither accomplish shared meaning,which despite great effort, is still a human problem that requireshuman solutions.


Peace,
Scott
_______________________________________________
microformats-discuss mailing list
microformats-discuss@microformats.org
http://microformats.org/mailman/listinfo/microformats-discuss

Re: Disambiguation [was RE: "aid" microformats? (was Re: [uf-discuss]ISBN mark-up)]

Reply via email to