Re: Modifying Length Normalization calculation

2011-06-13 Thread Lahiru Samarakoon
m(String field, FieldInvertState state). There > >> is comprehensive info in the javadocs. Note that values are > >> calculated at indexing and stored in the index encoded, with some loss > >> of precision. > >> > >> > >> -- > >> Ian. > >> >

Re: Modifying Length Normalization calculation

2011-06-13 Thread Lahiru Samarakoon
t; of precision. > > > -- > Ian. > > On Mon, Jun 13, 2011 at 7:31 AM, Lahiru Samarakoon > wrote: > > Hi All, > > > > I want to change the length normalization calculation specific to my > > application. By changing the "*number of terms*" accordin

Modifying Length Normalization calculation

2011-06-12 Thread Lahiru Samarakoon
Hi All, I want to change the length normalization calculation specific to my application. By changing the "*number of terms*" according to my requirement. The "*StandardTokenizer*" works perfectly for my application, However, the *number of terms* calculated by the tokenizer is not the effective n

Re: Lemmatization

2011-06-08 Thread Lahiru Samarakoon
Hi, > > Is there something in Lucene that supports lemmatization of the following > form: > > Mexican --> Mexico (from adjective to name/noune) > > Lemmatization do not change part of speech. I think you are looking for a stemming algorithm. http://nlp.stanford.edu/IR-book/html/htmledition/stemmi

Re: Finding the term positions of a document

2011-03-03 Thread Lahiru Samarakoon
Hi Ian, Yes, It does the job for me :) . Thank you, Best Regards, Lahiru

Finding the term positions of a document

2011-03-03 Thread Lahiru Samarakoon
Hi All, I have a document and a list of terms. For each term, I want to find the position of that term in the given document. Please advice. Thanks, Lahiru

BM 25 scoring with lucene

2011-03-01 Thread Lahiru Samarakoon
Hi All, Do you have any BM 25 scoring implementation which can be used with Lucene? How can I find and use the implementation mentioned in following jira entry? https://issues.apache.org/jira/browse/LUCENE-2091 Thanks, Lahiru

Re: finding the length of a field

2011-03-01 Thread Lahiru Samarakoon
Thanks nick, will try that On Tue, Mar 1, 2011 at 12:00 PM, Nick Pellow wrote: > Have you considered storing the length of the field in a Payload? > You could do that during analysis. > > Cheers, > Nick > > On 01/03/2011, at 5:06 PM, Lahiru Samarakoon wrote: > > >

Re: finding the length of a field

2011-02-28 Thread Lahiru Samarakoon
Hi Anshum, I am trying to implement a custom length normalization mechanism, which needs the length of the field being searched. Yes, the field is stored. Thanks, Lahiru.

finding the length of a field

2011-02-28 Thread Lahiru Samarakoon
Hi all, Is there a way to find the length of a field of a lucene index document? Thanks, Lahiru

Re: How to rebuild index

2011-01-21 Thread Lahiru Samarakoon
HI You were using a system for two years and it used an index created using lucene with the StandardAnalyzer. So, There must be an index creation code with your system. Anyway,Since you have the book “*Lucene in action*” you can find how to create an index by reading chapter 2 (Indexing). Please

Re: Lucene Ranking Problem

2011-01-18 Thread Lahiru Samarakoon
HI Ian & Umesh. This is what I was looking for. Thank a lot. Regards, Lahiru

Lucene Ranking Problem

2011-01-18 Thread Lahiru Samarakoon
Dear All, I have two documents. The analyzed and the tokenized contents are mentioned below. *Document 1 :* *when*, null_1, *my*, null_1, money, fund, amount, payment, creditcard, credit, card, *bank, account*, debit, deduct, *charge*, null_1, my, mobile, usage, *service*, connection *

Re: How to see results of an analyzer

2010-12-23 Thread Lahiru Samarakoon
Hi Umesh, Thanks for the descriptive answer. Using your method I can test my custom analyzers without creating an index. Best Regards, Lahiru

Re: How to see results of an analyzer

2010-12-22 Thread Lahiru Samarakoon
Hi Ian, Luke works for me. Thanks, Lahiru.

How to see results of an analyzer

2010-12-22 Thread Lahiru Samarakoon
Dear All, How can I see (or print) the result of a lucene analyzer. I believe the analyzing happens when the *writer.addDocument* or * writer.updateDocument* methods are called. Can anyone suggest me a method to see the analyzing process as it happens. If it is not possible in real time, how can

Re: java.lang.NoClassDefFoundError: org/apache/lucene/util/CharacterUtils

2010-12-14 Thread Lahiru Samarakoon
Uwe Schindler > > H.-H.-Meier-Allee 63, D-28213 Bremen > > http://www.thetaphi.de > > eMail: u...@thetaphi.de > > > > > >> -Original Message- > >> From: Lahiru Samarakoon [mailto:lahir...@gmail.com] > >> Sent: Tuesday, December 14, 201

Re: java.lang.NoClassDefFoundError: org/apache/lucene/util/CharacterUtils

2010-12-13 Thread Lahiru Samarakoon
Dear Uwe, I remove additional jar files and it is working. Thanks a lot for the guidance. Regards, Lahiru

Re: java.lang.NoClassDefFoundError: org/apache/lucene/util/CharacterUtils

2010-12-13 Thread Lahiru Samarakoon
ttp://www.thetaphi.de > eMail: u...@thetaphi.de > > > -Original Message- > > From: Lahiru Samarakoon [mailto:lahir...@gmail.com] > > Sent: Tuesday, December 14, 2010 7:05 AM > > To: java-user@lucene.apache.org > > Subject: java.lang.NoClassDefFoundError:

java.lang.NoClassDefFoundError: org/apache/lucene/util/CharacterUtils

2010-12-13 Thread Lahiru Samarakoon
Hi All, I am using *lucene-3.0.2* with *spring framework-3.0.5*. Spring context initialization failed due to a *java.lang.NoClassDefFoundError: org/apache/lucene/util/CharacterUtils*. But that class can not be found in the jars shipped with *lucene-3.0.2*. I am stumped here. Please advice. Thank