My impression was that JTidy had to make a complete pass over the document in
order
to tidy it. This would preclude using it for a SAX (stream-based) parser.
court
Wong Kok Wai wrote:
> Hi, Assaf,
>
> I believe you've suggested using Tidy/JTidy to preprocess the HTML before
> parsing. Possible
properly (or so it seems).
court
Assaf Arkin wrote:
> I think it's good enough and certainly seems to be complaint with the
> specs.
>
> arkin
>
> (HTML parser will be coming, just as soon as I finish some DSML stuff)
>
> Court Demas wrote:
> >
> >
I've hacked up an implementation of the HTMLDocument.write() and writeln()
methods. It's cheese but functional. I'm wondering if it would
be interesting to have this added or if somebody is already working on
implementing it the Right Way.
The idea is to just build up a string of what was writt