(Sorry about my frustrated tone. I always get frustrated when I try to extract implementation directions from the wiki and fail. This isn't the first time. And I can read specs in general.)

On Mar 10, 2007, at 23:10, Paul Wilkins wrote:

Henri Sivonen wrote:

I needed a .bib-based bibliography generator for XHTML, so I wrote one with help from a friend who had developed a .bib parser. The output of my generator can be seen at http://hsivonen.iki.fi/thesis/html5-conformance- checker.xhtml#references

I've wrapped the values of .bib fields in elements whose class name is the .bib field name. I did it just in case. I don't have any consumer use case for those class names. It was just super- easy to generate them.

My use case (publishing an academic paper with a bibliography) is not mentioned as a use case at http://microformats.org/wiki/citation-brainstorming . More to the point, the wiki has no consumer use case for my publication use case.

Does this mean that hCite is not for me at all?

Not at all. You are using the BibTex format, which is covered in the citation formats http://microformats.org/wiki/citation-formats

Sure, but considering that I share my .bib, should I expect people to want to scrape my (X)HTML-formatted bibliography?

If hCite is for me, what's the elevator pitch convincing me to put more effort into my generator? What benefits should I expect if I do? Is hCite mature enough to be implemented yet?

The citation microformat is a work in progress at this stage, so it's not mature enough for programs to extract information from it,

I guess this means that I shouldn't try to support hCite on the generator side in my thesis considering that the document should go final on the first week of April.

Would it be of any use to anyone if I wrapped the name of each author/ editor in a <span class='fn'> if I otherwise leave my markup the way it is now?

The benefits are that people visitng your content with next generation tools wil be able to easily extract and use the information in more interesting and useful ways.

So basically, my effort would not be about catering to specific realistic foreseeable use cases. Instead, it would be about putting data out there in case someone figures out a use case later on.

Tantek has a recent presentation about the big picture of microformats at http://tantek.com/presentations/2007/02/microformats/

I think I know the base theory. I am interested in practical use cases and implementability in this particular case.

Moreover, is it even possible to generate hCite from my source data (http://hsivonen.iki.fi/thesis/dippa.bib) without sacrificing the presentation that I want and without potentially generating bogus markup for personal names?

One of the big ideas behind the use of microformats is that it will allow you to markup the content on your page without modifying the presentation of it.

Somehow, I was under the impression that hCite required bibliography items as <li>s instead of <dt>/<dd> pairs (which is what I use and what W3C and WHATWG specs use).

For example, my source data does not encode explicitly the given name, the family name and other stuff that isn't quite neither. As far as I can tell, it is impossible to tell heuristically that the middle token in these two names is semantically different:
Gavin Thomas Nicol
Henrik Frystyk Nielsen

Those issues haven't yet been covered for for the citation microformat.

What I'm trying to say is that I think hCite should allow names to be marked up as formatted names tossing the deformatting problem to the consumer. After all, one of the most popular bibliography data format, BibTeX, stores formatted names.

It may be possible for for a generator to parse through them and extract the appropriate information though. For example, honorific-prefix and honorific-suffix are a rather short list. Then after those, the given name, family name and additional name could be extracted in that particular order.

Using heuristics in the generator to make explicit metadata statements is generally a bad idea. If the result is wrong, it still pretends to be authoritative. If heuristics are involved, the input to the heuristic should be sent and consumers should be able to compete on how good their heuristics are.

--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/


_______________________________________________
microformats-discuss mailing list
microformats-discuss@microformats.org
http://microformats.org/mailman/listinfo/microformats-discuss

Reply via email to