Hi,
The logic you are looking for is Lemmatization -
http://en.wikipedia.org/wiki/Lemmatisation.
I don't think Lucene has a built-in lemmatizer but you can use GATE which is an
open source project:
http://gate.ac.uk
http://gate.ac.uk/gate/doc/plugins.html
Enjoy!
-Original Message-
to normal... So cool!
-Original Message-
From: Yuval Kesten [mailto:ykes...@yahoo-inc.com]
Sent: Wednesday, February 22, 2012 7:29 PM
To: java-user@lucene.apache.org
Subject: RE: Custom lucene scoring - Dot product between field boost and query
boost
Hi all,
Inspired by another thread here
,
You can just override Similarity, rather than DefaultSimilarity - that way you
don't burn any CPU cycles on TF/IDF calculations.
Alan
On 22 Feb 2012, at 07:17, Yuval Kesten wrote:
Hi Em,
1. Regarding the performances - the similarity class (And my subtype as well)
gets the IDF and TF
Hi,
I want to use Lucene with the following scoring logic:
When I index my documents I want to set for each field a score/weight.
When I query my index I want to set for each query term a score/weight.
I will NEVER index or query with many instances of the same field - In each
query (document)
The same question is formatted nicer here:
http://stackoverflow.com/questions/9380188/custom-lucene-scoring-dot-product-between-field-boost-and-query-boost
Thanks!
-Original Message-
From: Yuval Kesten [mailto:ykes...@yahoo-inc.com]
Sent: Tuesday, February 21, 2012 5:18 PM
To: java-user
worried about?
2. The score I get from the TopScoreDocCollector is not the same as I
get from the Explanation.
Here is part of my code:
Could you provide us the code where you are setting the Similarity, please?
Kind regards,
Em
Am 21.02.2012 16:18, schrieb Yuval Kesten:
Hi,
I want to use Lucene