Date: 2005-02-01T21:41:12 Editor: HossMan Wiki: Jakarta Lucene Wiki Page: LuceneFAQ URL: http://wiki.apache.org/jakarta-lucene/LuceneFAQ
from lucene-users "Tue, 01 Feb 2005 08:48:36 -0500" by "Michael Giles" Change Log: ------------------------------------------------------------------------------ @@ -506,6 +506,8 @@ [http://jtidy.sourceforge.net/ JTidy] cleans up HTML, and can provide a DOM interface to the HTML files through a Java API. +The author of [http://furl.net FURL] recommends [http://www.tagsoup.info TagSoup]. + ==== How can I index XML documents? ==== --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]