RE: Indexing with Semantics

2012-05-03 Thread Yuval Kesten
Hi, The logic you are looking for is Lemmatization - http://en.wikipedia.org/wiki/Lemmatisation. I don't think Lucene has a built-in lemmatizer but you can use GATE which is an open source project: http://gate.ac.uk http://gate.ac.uk/gate/doc/plugins.html Enjoy! -Original Message-

RE: Custom lucene scoring - Dot product between field boost and query boost

2012-02-23 Thread Yuval Kesten
to normal... So cool! -Original Message- From: Yuval Kesten [mailto:ykes...@yahoo-inc.com] Sent: Wednesday, February 22, 2012 7:29 PM To: java-user@lucene.apache.org Subject: RE: Custom lucene scoring - Dot product between field boost and query boost Hi all, Inspired by another thread here

RE: Custom lucene scoring - Dot product between field boost and query boost

2012-02-22 Thread Yuval Kesten
, You can just override Similarity, rather than DefaultSimilarity - that way you don't burn any CPU cycles on TF/IDF calculations. Alan On 22 Feb 2012, at 07:17, Yuval Kesten wrote: Hi Em, 1. Regarding the performances - the similarity class (And my subtype as well) gets the IDF and TF

Custom lucene scoring - Dot product between field boost and query boost

2012-02-21 Thread Yuval Kesten
Hi, I want to use Lucene with the following scoring logic: When I index my documents I want to set for each field a score/weight. When I query my index I want to set for each query term a score/weight. I will NEVER index or query with many instances of the same field - In each query (document)

RE: Custom lucene scoring - Dot product between field boost and query boost

2012-02-21 Thread Yuval Kesten
The same question is formatted nicer here: http://stackoverflow.com/questions/9380188/custom-lucene-scoring-dot-product-between-field-boost-and-query-boost Thanks! -Original Message- From: Yuval Kesten [mailto:ykes...@yahoo-inc.com] Sent: Tuesday, February 21, 2012 5:18 PM To: java-user

RE: Custom lucene scoring - Dot product between field boost and query boost

2012-02-21 Thread Yuval Kesten
worried about? 2. The score I get from the TopScoreDocCollector is not the same as I get from the Explanation. Here is part of my code: Could you provide us the code where you are setting the Similarity, please? Kind regards, Em Am 21.02.2012 16:18, schrieb Yuval Kesten: Hi, I want to use Lucene