Great! For my favorite example of "invest", "invests", etc. it shows:

SnowballEnglish:
•investment
•invest
•invests
•investing
•invested

kStem:
•investors
•invest
•investor
•invests
•investing
•invested

minimalStem:invest
•invest
•invests

That highlights the distinctions between these stemmers quite well, without highlighting the actual indexed term, which can be quite ugly.

-- Jack Krupansky

-----Original Message----- From: Elmer van Chastelet
Sent: Wednesday, November 21, 2012 8:49 AM
To: java-user@lucene.apache.org
Subject: Re: Which stemmer?

I've just created a small web application which you might find useful.
You can see which words are matched by a query word when using different
analyzers  (phonetic and stemming analyzers).
These include snowball, kstem and minimal stem (the ones on the right).

http://dutieq.st.ewi.tudelft.nl/wordsearch/

I can extend the app with more analyzers. Please let me know :)

--Elmer

Example

On 11/14/2012 07:55 PM, Scott Smith wrote:
Does anyone have any experience with the stemmers? I know that Porter is what "everyone" uses. Am I better off with KStemFilter (better performance) or ?? Does anyone understand the differences between the various stemmers and how to choose one over another?



---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Reply via email to