Demo provided HTML parser bug (was RE: Newbie quizzes further...)

2002-09-06 Thread Stone, Timothy
List Fellows: Lacking any knowledge of JavaCC, I solicted help in hacking the HTMLParser.jj included in the demo. I retreat from this solication, for two reasons: 1) I'm using other ideas gleaned from the list archives, 2) I'm not prepared to dive into the world of complier compliers. The mere so

RE: Newbie quizzes further...

2002-09-04 Thread Stone, Timothy
List fellows: I did not see an answer to my further questioning of the use of the Entities class found in the Lucene demo, possibly because it was before I knew what I was looking for. I have reviewed the code carefully, but need help with the HTMLParser.jj. This JavaCC piece appears to be at th

RE: Newbie quizzes further...

2002-09-03 Thread Stone, Timothy
"?". > -Original Message- > From: Joshua O'Madadhain [mailto:[EMAIL PROTECTED]] > Sent: Monday, September 02, 2002 20:36 > To: Lucene Users List > Subject: Re: Newbie quizzes further... > > > On Mon, 2 Sep 2002, Stone, Timothy wrote: > > >

Re: Newbie quizzes further...

2002-09-02 Thread Joshua O'Madadhain
On Mon, 2 Sep 2002, Stone, Timothy wrote: > I have noted that Lucene fails to interpret numerous HTML entities, > specifically entities in the 82xx range, i.e. — (en-dash) and > many others. Now this may not be a Lucene issue, I'm looking at the > code as I post, but I'm curious to its origins an