Hi André,

Florent André schrieb:
I would like to parse localy downloaded (via <xi:include parse="text">)
html pages.

I'm afraid this approach will only cause a lot of headache. I'd rather recommend to use the HTMLGenerator [1] to parse the files. In your XInclude statement you can just call the HTMLGenerator pipeline using the cocoon:/ protocol.

[1] http://cocoon.apache.org/2.1/userdocs/html-generator.html

HTH,

-- Andreas


After download, <xi:include> give me an "escape" html file.

I suppress <!Doctype ... > with regex, but now the unescape transformer
throw this error : " Caused by: org.xml.sax.SAXParseException: The entity "nbsp" was
referenced, but not declared."
I found this on internet : "To allow the use off &nbsp in you stylesheet,
you have to declare it first :  <!DOCTYPE xsl:stylesheet [<!ENTITY nbsp
"&#160;">]> "

How I can add this declaration in the java unescape transformer ?

I think that I can remove all &nbsp with a regex, but I would like to more
understand how work java transformer.

Thanks and have a good day.

Florent


--
Andreas Hartmann, CTO
BeCompany GmbH
http://www.becompany.ch
Tel.: +41 (0) 43 818 57 01


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to