Great! For my favorite example of "invest", "invests", etc. it shows:
SnowballEnglish:
•investment
•invest
•invests
•investing
•invested
kStem:
•investors
•invest
•investor
•invests
•investing
•invested
minimalStem:invest
•invest
•invests
That highlights the distinctions between these stemmers quite well, without
highlighting the actual indexed term, which can be quite ugly.
-- Jack Krupansky
-----Original Message-----
From: Elmer van Chastelet
Sent: Wednesday, November 21, 2012 8:49 AM
To: java-user@lucene.apache.org
Subject: Re: Which stemmer?
I've just created a small web application which you might find useful.
You can see which words are matched by a query word when using different
analyzers (phonetic and stemming analyzers).
These include snowball, kstem and minimal stem (the ones on the right).
http://dutieq.st.ewi.tudelft.nl/wordsearch/
I can extend the app with more analyzers. Please let me know :)
--Elmer
Example
On 11/14/2012 07:55 PM, Scott Smith wrote:
Does anyone have any experience with the stemmers? I know that Porter is
what "everyone" uses. Am I better off with KStemFilter (better
performance) or ?? Does anyone understand the differences between the
various stemmers and how to choose one over another?
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org