Is there any html parser in cocoon that can strip a html page of all the tags and return only text content?
I want to read a mail, and print only the text contents and avoid all the html content.
how can this be done?
hi Smith
use html generator to convert this page to xhtml (xml well formed
document) and the parse this with an .xsl tha will produce the output you
want
-- stavros
On Tue, 7 Sep 2004, Adriano Smith wrote:
Is there any html parser in cocoon that can strip a html page of all the tags
HTMLGenerator followed by a stylesheet:
?xml version=1.0 encoding=ISO-8859-1?
xsl:stylesheet version=1.0
xmlns:xsl=http://www.w3.org/1999/XSL/Transform;
/xsl:stylesheet
should do it. The default behaviour of XSLT is to strip tags. So that
simple stylesheet should give you what you