About stemmers you can read on http://snowball.tartarus.org/
-Original Message-
From: Koji Sekiguchi [mailto:[EMAIL PROTECTED]
Sent: Monday, November 21, 2005 2:37 AM
To: java-user@lucene.apache.org
Subject: RE: What is stemming?
Gekkokid, Daniel, Giovanni,
Thank you very much for
Gekkokid, Daniel, Giovanni,
Thank you very much for your explanation.
Now I'm very clear!
Thank you again,
Koji
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
[Afaik] Lucene stemming is based on Snowball (http://snowball.tartarus.org/)
and snowball is an implementation of Porter's algorithm (
http://www.tartarus.org/~martin/PorterStemmer/) so, if I'm not wrong, you
should refer to them.
On Sonntag 20 November 2005 16:48, Koji Sekiguchi wrote:
> Could someone explain what "stemming" is?
Stemming usually means to cut off characters from the end of the word, e.g.
walked -> walk, walking -> walk. However, this does not necessarily
produce a real word, e.g. a stemmer could also cha
m a stemmed index/database.
I hope that helps. Do a search for "porter stemmer" for more information.
_gk
----- Original Message -
From: "Koji Sekiguchi" <[EMAIL PROTECTED]>
To:
Sent: Sunday, November 20, 2005 3:48 PM
Subject: What is stemming?
Hell
Hello, Luceners!
What is "stemming"?
I have Lucene in Action and found the following definitions on page 103:
- reducing words to a root form (stemming)
- changing words into the basic form (lemmatization)
but I cannot see the difference between them.
I'm also confused by the