problems with HTML Parser

Keith Gunn Wed, 14 Aug 2002 09:25:10 -0700

Has anyone noticed that the HTML Parser that comes with
Lucene joins terms together when parsing a file.
I used to think it was my PDFParser but after fixing that
I found out it was the HMTLParser.


I managed to find a replacement parser that doesn't join terms.

Just wondered if anyone had come across this problem??




--
To unsubscribe, e-mail:   <mailto:[EMAIL PROTECTED]>
For additional commands, e-mail: <mailto:[EMAIL PROTECTED]>

problems with HTML Parser

Reply via email to