Hello.
I want to set LMJelinekMercer Similarity (with lambda set to, say, 0.6) for
the Luke similarity calculation. Luke by default use the DefaultSimilarity.
Can anyone help with this? I use Lucene 4.10.4 and Luke for that version
of Lucene index.
Dwaipayan
Stemming is an inherently limited process. It doesn't know about the
word 'news', it just has a rule about 's'.
Some of us sell commercial products that do more complex linguistic
processing that knows about which words are which.
There may be open source implementations of similar technology.
Hi Dwaipayan,
Another way is to use KeywordMarkerFilter. Stemmer implementations respect this
attribute.
If you want to supply your own mappings, StemmerOverrideTokenFilter could be
used as well.
ahmet
On Monday, March 14, 2016 4:31 PM, Dwaipayan Roy
wrote:
I am using EnglishAnalyzer wi
Hi - if you don't want specific words passed through a stemmer, you need to
supply a CharArraySet with exclusions as the second argument to its constructor.
Markus
-Original message-
> From:Dwaipayan Roy
> Sent: Monday 14th March 2016 15:31
> To: java-user@lucene.apache.org
> Subject: Pr