List Fellows:
Lacking any knowledge of JavaCC, I solicted help in hacking the
HTMLParser.jj included in the demo. I retreat from this solication, for two
reasons: 1) I'm using other ideas gleaned from the list archives, 2) I'm not
prepared to dive into the world of complier compliers. The mere so
List fellows:
I did not see an answer to my further questioning of the use of the Entities
class found in the Lucene demo, possibly because it was before I knew what I
was looking for.
I have reviewed the code carefully, but need help with the HTMLParser.jj.
This JavaCC piece appears to be at th
"?".
> -Original Message-
> From: Joshua O'Madadhain [mailto:[EMAIL PROTECTED]]
> Sent: Monday, September 02, 2002 20:36
> To: Lucene Users List
> Subject: Re: Newbie quizzes further...
>
>
> On Mon, 2 Sep 2002, Stone, Timothy wrote:
>
> >
On Mon, 2 Sep 2002, Stone, Timothy wrote:
> I have noted that Lucene fails to interpret numerous HTML entities,
> specifically entities in the 82xx range, i.e. — (en-dash) and
> many others. Now this may not be a Lucene issue, I'm looking at the
> code as I post, but I'm curious to its origins an