Hi André,
Florent André schrieb:
I would like to parse localy downloaded (via <xi:include parse="text">)
html pages.
I'm afraid this approach will only cause a lot of headache. I'd rather
recommend to use the HTMLGenerator [1] to parse the files. In your
XInclude statement you can just call the HTMLGenerator pipeline using
the cocoon:/ protocol.
[1] http://cocoon.apache.org/2.1/userdocs/html-generator.html
HTH,
-- Andreas
After download, <xi:include> give me an "escape" html file.
I suppress <!Doctype ... > with regex, but now the unescape transformer
throw this error :
" Caused by: org.xml.sax.SAXParseException: The entity "nbsp" was
referenced, but not declared."
I found this on internet : "To allow the use off   in you stylesheet,
you have to declare it first : <!DOCTYPE xsl:stylesheet [<!ENTITY nbsp
" ">]> "
How I can add this declaration in the java unescape transformer ?
I think that I can remove all   with a regex, but I would like to more
understand how work java transformer.
Thanks and have a good day.
Florent
--
Andreas Hartmann, CTO
BeCompany GmbH
http://www.becompany.ch
Tel.: +41 (0) 43 818 57 01
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]