Hi David I like KStem more than Porter / Snowball - but still has limitations although performs better as it has a dictionary to augment the rules.
Note that KStem will also treat "print" and "printer" as two distinct terms, probably treating it as verb and noun respectively. Cheers Pete Lewis ----- Original Message ----- From: "David Spencer" <[EMAIL PROTECTED]> To: "Lucene Users List" <[EMAIL PROTECTED]> Sent: Tuesday, September 14, 2004 7:19 PM Subject: Re: PorterStemfilter > Honey George wrote: > > > Hi, > > This might be more of a questing related to the > > PorterStemmer algorithm rather than with lucene, but > > if anyone has the knowledge please share. > > You might want to also try the Snowball stemmer: > > http://jakarta.apache.org/lucene/docs/lucene-sandbox/snowball/ > > And KStem: > > http://ciir.cs.umass.edu/downloads/ > > > > I am using the PorterStemFilter that some with lucene > > and it turns out that searching for the word 'printer' > > does not return a document containing the text > > 'print'. To narrow down the problem, I have tested the > > PorterStemFilter in a standalone programs and it turns > > out that the stem of printer is 'printer' and not > > 'print'. That is 'printer' is not equal to 'print' + > > 'er', the whole of the word is stem. Can somebody > > explain the behavior. > > > > Thanks & Regards, > > George > > > > > > > > > > > > ___________________________________________________________ALL-NEW Yahoo! Messenger - all new features - even more fun! http://uk.messenger.yahoo.com > > > > --------------------------------------------------------------------- > > To unsubscribe, e-mail: [EMAIL PROTECTED] > > For additional commands, e-mail: [EMAIL PROTECTED] > > > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] > > --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]