Re: Need pointers on using a very small part of Lucene

2002-03-14 Thread Kelvin Tan
Robert, > > I just have one more question - how do I remove repeated words? Does > anyone have a filter for doing this? > > For example, here's the result of one of my files being worked on: > "todai customer.formattedmailingaddress3 dear customer.dearnam respond > request inform productlongnam s

Indexing and Duplication

2002-03-14 Thread Kelvin Tan
I've got a little problem with indexing that I'd like to throw to everyone. My objects have a unique identifier. When indexing, before I create a new document, I'd like to check if a document has already been created with this identifier. If so, I'd like to retrieve the document corresponding to

Re: Need pointers on using a very small part of Lucene

2002-03-14 Thread Robert A. Decker
I must say, lucene is pretty damn cool. I now have it working and filtering stuff using a custom analyzer I built named FragmentAnalyzer. It works like a StandardAnalyzer but also uses the PorterStemFilter. I'm very impressed with its speed. I just have one more question - how do I remove repea

Re: Need pointers on using a very small part of Lucene

2002-03-14 Thread Joshua O'Madadhain
On Thu, 14 Mar 2002, Robert A. Decker wrote: > Yes, unique terms. I've started looking at the StandardAnalyzer, and > related classes, and I'll see if I can use them for what I want. > > Also, I'd like massage the text based a bit more than just the unique > terms. For example, common words shou

Re: Need pointers on using a very small part of Lucene

2002-03-14 Thread Robert A. Decker
Yes, unique terms. I've started looking at the StandardAnalyzer, and related classes, and I'll see if I can use them for what I want. Also, I'd like massage the text based a bit more than just the unique terms. For example, common words should be removed (some of which are found in the StandardAn

StandardTokenizer.jj and queryparser.jj

2002-03-14 Thread Aruna Raghavan
Hi, I am trying to build the 1.2 -rc4 version of lucene in debug mode using JBuilder. I downloaded javacc2.1, ran javacc on StandardTokenizer.jj and QueryParser.jj. I ma getting mismatch compiler errors with Token class, the ctor generated by javacc does not match the ctor lucene analysis code. An

Re: Using tagLibs with Lucene, Tomcat and NetBeans.

2002-03-14 Thread Eric Thoman
Thanks Peter, I thought someone here in the Lucene list might have already configured Lucene with taglibs, and possibly with Tomcat and NetBeans. If so, please feel free to forward any comments. Thanks again, Eric Peter Carlson wrote: > Hi Eric, > > I think that this is the wrong list to ask you

Re: Need pointers on using a very small part of Lucene

2002-03-14 Thread Peter Carlson
Hi I am a little confused by your request. When you say get the text that lucene would normally put into the index doesn't really make sense since lucene is term based. What data are you trying to get. The set of unique terms for each document? If you are trying to use lucene to normalize the da

Need pointers on using a very small part of Lucene

2002-03-14 Thread Robert A. Decker
I would like to use a very small part of the functionality of Lucene, but need some pointers on which classes I should start looking at first. What I want to do is pass to a Lucene method some text, and have it return the text that it would normally put into the index. (I'll then take that text

Re: Using tagLibs with Lucene, Tomcat and NetBeans.

2002-03-14 Thread Peter Carlson
Hi Eric, I think that this is the wrong list to ask your question. Try out the netbeans list. Www.netbeans.org If you do create some lucene tagligs, please contribute them to the lucene contribution page. Do this by sending them to this list with [contrib] in the subject. Thanks --Peter On 3

Using tagLibs with Lucene, Tomcat and NetBeans.

2002-03-14 Thread Eric Thoman
Hi all, I've been told that the way to go with using JSP is to use tagLibrarys. I've read thru the documentation on using jsp with tld's -- it makes a lot of sense. I'm currently trying to use NetBeans with Lucene and Tomcat using tagLibraries. My questions are as follows: 1) Considering debug

RE: Phone number Searches

2002-03-14 Thread Aruna Raghavan
Thanks, I am trying to do that. But the JBuilder IDE I am using does not recognize the .jj files. How do I link these in? -Original Message- From: Norbert Pabis [mailto:[EMAIL PROTECTED]] Sent: Thursday, March 14, 2002 8:41 AM To: Lucene Users List Subject: Re: Phone number Searches Re

Re: Phone number Searches

2002-03-14 Thread Norbert Pabiƛ
Recompile Lucene with debug on, them you will see exactly what it does. > -Original Message- > From: Aruna Raghavan [mailto:[EMAIL PROTECTED]] > Sent: Wednesday, March 13, 2002 10:48 AM > To: Lucene Users List > Subject: Phone number Searches > > Hello All, > I tried doing a search for

RE: Phone number Searches

2002-03-14 Thread Aruna Raghavan
Hi, I have just noticed that "1-954-612-1276" (phrase query) works but a search for 1-954-612-1276 is returning all documents I have probably because in the latter case, lucene searcher is treating the "-" as exclusion. Is this correct? Thanks, Aruna. -Original Message- From: Aruna Raghav

Re: Accessing the "contents" field

2002-03-14 Thread William W
Hi Roshan, You only will have the contents in a field if you add it. Ex.: doc.add(Field.Text("contents", "My text")); "contents" is not a default name. William. >From: "ROSHAN NAVENDRA" <[EMAIL PROTECTED]> >Reply-To: "Lucene Users List" <[EMAIL PROTECTED]> >To: <[EMAIL PROTECTED]> >Subject: