Re: List parsing with Confluence parser wrong?

Lukas Theussl Thu, 25 Oct 2007 03:22:34 -0700


Vincent Massol wrote:

Hi,

The APT and Confluence parsers behave differently when parsing lists.
The APT parser generates paragraph()/paragraph_() events for each listitem whereas the Confluence parser doesn't.
So my questions are:
1) Who's right? This is very important since a Sink will outputdifferent results if the parsers behave differently

I think in this case the AptParser is wrong. I have recently modifiedthe xhtml sink [1] which, before, didn't emit paragraphs within listitems. I don't see a reason for that since paragraphs are legal andsignificant in list items (ie <li>item</li> is different from<li><p>item</p></li> and both are legal and meaningful). However, theAptParser behavior remains to be corrected, it's one of the few reasonswhy the apt module currently doesn't pass the identity test (see DOXIA-134).

2) How do we ensure parsers are correct in the events they send?

See related DOXIA-132. We don't have a mechanism yet to test parsingevents and since doxia is only about events (no object model), I don'tquite see how this can be done in general. In practice, I think thestandard is set by the AptParser, and the model emitted by theSinkTestDocument, all parsers should try to be consistent with that.

For 2), we should probably have an abstract test case similar to whatis done in AbstractSinkTest for Sinks.

There is already an AbstractParserTest, it currently only does a simplecheck with the WellformednessCheckingSink, but it should be extended.


HTH,
-Lukas

[1] https://svn.apache.org/viewvc?view=rev&revision=583579

For 1) I've checked and it seems TWiki also doens't output paragraph ()events for list items.
So is the AptParser wrong?

Thanks
-Vincent

Re: List parsing with Confluence parser wrong?

Reply via email to