On Jun 29, 2005, at 4:03 AM, Paul Libbrecht wrote:
Le 29 juin 05, à 00:57, Erik Hatcher a écrit :
Paul - if stemming is what you're looking for, then grab the
SnowballAnalyzer code from Subversion under contrib/snowball. Or
you could get a binary copy of the JAR from the source code
distribution of Lucene in Action at http://www.lucenebook.com (and
see examples of its use in the sample code there). Lucene's JAR
does have a built-in PorterStemFilter, but none of the built-in
analyzers use it, and the SnowballAnalyzer is more commonly used
anyway.
What is the differences between, PorterStemFilter, GermanFilter and
the filters in SnowballAnalyzers ?
GermanStemFilter (not GermanFilter) is specific to German.
PorterStemFilter is for English. SnowballAnalyzer is an engine that
runs any of the available stemmers (there is German, a few different
English ones, and several other language-specific stemmers).
Your best bet would be to give each of them a try on some sample text
and see which one fits best for your needs.
Erik
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]