AUTOMATIC REPLY
LUX is closed until 5th January 2009
-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org
AUTOMATIC REPLY
LUX is closed until 5th January 2009
-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org
gt; From: Chris Hostetter
> To: java-user@lucene.apache.org
> Sent: Saturday, December 20, 2008 2:12:21 PM
> Subject: Re: Inquiry on Lucene Stemming
>
>
> : Well some client inquiries if it's possible to expand such simple words
> : and does Lucene have an API for this
: Well some client inquiries if it's possible to expand such simple words
: and does Lucene have an API for this logic? Because all I read was the
: stemming logic for Lucene was the other way around which is, example
: "flashing" it will be trimmed to the root word "flash" when searched.
ther
>
>
>
> From: Erick Erickson
> To: java-user@lucene.apache.org
> Sent: Tuesday, December 16, 2008 10:14:13 PM
> Subject: Re: Inquiry on Lucene Stemming
>
> Why do you want to do this? The reason I ask is that you're
> making each
read was the stemming
> logic for Lucene was the other way around which is, example "flashing" it
> will be trimmed to the root word "flash" when searched.
>
>
> Regards,
> Jay Malaluan
>
>
>
>
>
> ____
> From
flash" when searched.
Regards,
Jay Malaluan
From: Erick Erickson
To: java-user@lucene.apache.org
Sent: Tuesday, December 16, 2008 10:14:13 PM
Subject: Re: Inquiry on Lucene Stemming
Why do you want to do this? The reason I ask is that you're
m
Why do you want to do this? The reason I ask is that you're
making each clause very complex.
For a single term, it's not very complex, but for something like
((A AND B) OR (C AND D)) NOT X
expanding A, B, C, D and X to, possibly many terms is...er...ugly.
You could think about ngrams, althou
you stem the search query and while indexing, so only "flash" is indexed
when "flashing" is read.
If you don't wont to hurt your index with half word, you can use a second
index, just like for spelling :
http://blog.garambrogne.net/index.php?post/2008/03/07/A-lexicon-approach-for-Lucene-index
M.
Hi,
Can anyone comment if my understanding of the stemming process in Lucene is
correct. From my testing using the SnowballAnalyzer, if I passed this word
"flashing" it will be trimmed to a root word "flash" and this root word
("flash") will be the one searched not the original word "flashing"
10 matches
Mail list logo