Hi David

I like KStem more than Porter / Snowball - but still has limitations
although performs better as it has a dictionary to augment the rules.

Note that KStem will also treat "print" and "printer" as two distinct terms,
probably treating it as verb and noun respectively.

Cheers

Pete Lewis

----- Original Message ----- 
From: "David Spencer" <[EMAIL PROTECTED]>
To: "Lucene Users List" <[EMAIL PROTECTED]>
Sent: Tuesday, September 14, 2004 7:19 PM
Subject: Re: PorterStemfilter


> Honey George wrote:
>
> > Hi,
> >  This might be more of a questing related to the
> > PorterStemmer algorithm rather than with lucene, but
> > if anyone has the knowledge please share.
>
> You might want to also try the Snowball stemmer:
>
> http://jakarta.apache.org/lucene/docs/lucene-sandbox/snowball/
>
> And KStem:
>
> http://ciir.cs.umass.edu/downloads/
> >
> > I am using the PorterStemFilter that some with lucene
> > and it turns out that searching for the word 'printer'
> > does not return a document containing the text
> > 'print'. To narrow down the problem, I have tested the
> > PorterStemFilter in a standalone programs and it turns
> > out that the stem of printer is 'printer' and not
> > 'print'. That is 'printer' is not equal to 'print' +
> > 'er', the whole of the word is stem. Can somebody
> > explain the behavior.
> >
> > Thanks & Regards,
> >    George
> >
> >
> >
> >
> >
> > ___________________________________________________________ALL-NEW
Yahoo! Messenger - all new features - even more fun!
http://uk.messenger.yahoo.com
> >
> > ---------------------------------------------------------------------
> > To unsubscribe, e-mail: [EMAIL PROTECTED]
> > For additional commands, e-mail: [EMAIL PROTECTED]
> >
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [EMAIL PROTECTED]
> For additional commands, e-mail: [EMAIL PROTECTED]
>
>


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to