maybe you should encode the html code ...
Patrick Burleson wrote:
?? is < \> a valid tag? I think it should be < />Why oh why did you send this to the tomcat lists?
Don't cross post! Especially when the question doesn't even apply to one of the lists.
Patrick
On Tue, 7 Sep 2004 16:35:35 -0400, hui liu <[EMAIL PROTECTED]> wrote:
Hi,
I have such a problem when creating lucene index for many html files:
It shows "aborted, expected<tagname>....<tagend>" for those html files
which contain java scripts. It seems it cannot parse the tags < \>.
Do you want to index the whole HTML file, or just the information i this files?
Maybe you should use a HTML2TXT converter, and then index the resulting text.
All the best,
Sergiu
Does anyone has any solution?
Thank you very very much...!!!
Ivy.
--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]