HTMLParser

2004-09-07 Thread Adriano Smith
Is there any html parser in cocoon that can strip a html page of all the tags and return only text content? I want to read a mail, and print only the text contents and avoid all the html content. how can this be done?

Re: HTMLParser

2004-09-07 Thread gounis
hi Smith use html generator to convert this page to xhtml (xml well formed document) and the parse this with an .xsl tha will produce the output you want -- stavros On Tue, 7 Sep 2004, Adriano Smith wrote: Is there any html parser in cocoon that can strip a html page of all the tags

Re: HTMLParser

2004-09-07 Thread Upayavira
HTMLGenerator followed by a stylesheet: ?xml version=1.0 encoding=ISO-8859-1? xsl:stylesheet version=1.0 xmlns:xsl=http://www.w3.org/1999/XSL/Transform; /xsl:stylesheet should do it. The default behaviour of XSLT is to strip tags. So that simple stylesheet should give you what you