Yes, it should be faster.
Also, I had to "patch" JTidy because it wasn't cleaning HTML correctly with
duplicate attributes.
I submitted a bug report to them, and they told me to wait the next C
version to be ported to Java.
This says it all...
--
Nicola Ken Barozzi [EMAIL PROTECTED]
- verba volant, scripta manent -
(discussions get forgotten, just code remains)
---------------------------------------------------------------------
----- Original Message -----
From: "J�rn Heid" <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>
Sent: Tuesday, May 14, 2002 2:16 PM
Subject: AW: NekoHTML in Cocoon....
I haven't tested Tidy yet. From Sourceforge I got
"JTidy is a Java port of HTML Tidy, a HTML syntax checker and pretty
printer. Like its non-Java cousin, JTidy can be used as a tool for
cleaning up malformed and faulty HTML. In addition, JTidy provides a DOM
parser for real-world HTML.".
So is Tidy just DOM based? If so, and Necko supports SAX, Necko should
be faster...
-----Urspr�ngliche Nachricht-----
Von: Reinhard P�tz [mailto:[EMAIL PROTECTED]]
Gesendet: Dienstag, 14. Mai 2002 14:07
An: [EMAIL PROTECTED]
Betreff: RE: NekoHTML in Cocoon....
What are there any differences to HTML Tidy (speed, functionality)?
Reinhard
> -----Original Message-----
> From: J�rn Heid [mailto:[EMAIL PROTECTED]]
> Sent: Tuesday, May 14, 2002 1:27 PM
> To: [EMAIL PROTECTED]
> Subject: NekoHTML in Cocoon....
>
>
>
> Necko is an HTML parser based on Xerces who can parse 'normal' HTML
(not > XHTML) and prodcues pure XML (SAX). >
http://www.apache.org/~andyc/nekohtml/doc/index.html
>
> Just thinking if it could be usefull... (I haven't tried it yet ;).
> > One possibible use case would be the ability for Cocoon developers
to > use old html files and change them with XSLT. E.g. including news
pages > not based on XML in Cocoon, filtering information from external
pages > and so on. > > What do you think? I will ask Andy Clark for
permission if you think it > would be usefull for Cocoon. > > JOERN
> > >
---------------------------------------------------------------------
> To unsubscribe, e-mail: [EMAIL PROTECTED]
> For additional commands, email: [EMAIL PROTECTED] >
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]