This is likely one of the many subtleties of the Porter stemmer. Dr. Porter has chosen a particular way of doing things, but it isn't necessarily right for everyone. You really have to measure the net benefit across all your searches, not specifically just one. If you can't live with this particular case, you can implement a protected words approach or try some other stemmers.

If you go to the snowball site and peruse their archives you will find much discussion of these kinds of issues.

Sorry I can't offer more in terms of a solution.

-Grant


On Dec 19, 2008, at 5:33 AM, Jay Malaluan wrote:


Hi,

I'm using the SnowballAnalyzer for my stemming processing.
search words: love, loved, loveliness, loveless, lovely, and loving

On my index I have the word love. The behavior during searching is that it can't correctly stem the two words loveliness, loveless to love. And the odd thing is loveliness is stemmed to "loveli" and loveless is not stemmed at
all.

Does anyone already encountered this and have suggestions on other
Analyzers?


Regards,
Jay Malaluan
--
View this message in context: 
http://www.nabble.com/Stemming-behavior-tp21089115p21089115.html
Sent from the Lucene - Java Users mailing list archive at Nabble.com.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org


--------------------------
Grant Ingersoll

Lucene Helpful Hints:
http://wiki.apache.org/lucene-java/BasicsOfPerformance
http://wiki.apache.org/lucene-java/LuceneFAQ











---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Reply via email to