Italian stemmer (from SnowballAnalyzer) does not work
-----------------------------------------------------
Key: LUCENENET-459
URL: https://issues.apache.org/jira/browse/LUCENENET-459
Project: Lucene.Net
Issue Type: Bug
Components: Lucene.Net Contrib
Affects Versions: Lucene.Net 2.9.2, Lucene.Net 2.9.4
Reporter: Santiago M. Mola
Italian stemmer does not work.
Consider this code:
var englishAnalyzer = new SnowballAnalyzer("English");
var tk = englishAnalyzer.TokenStream("text", new
StringReader("horses"));
var ta =
(TermAttribute)tk.GetAttribute(typeof(TermAttribute));
tk.IncrementToken();
Console.WriteLine("English stemmer: horses -> " +
ta.Term());
var italianAnalyzer = new SnowballAnalyzer("Italian");
tk = italianAnalyzer.TokenStream("text", new
StringReader("abbandonata"));
ta =
(TermAttribute)tk.GetAttribute(typeof(TermAttribute));
tk.IncrementToken();
Console.WriteLine("Italian stemmer: abbandonata -> " +
ta.Term());
It outputs:
English stemmer: horses -> hors
Italian stemmer: abbandonata -> abbandonata
While Java Lucene 2.9.4 outputs:
English stemmer: horses -> hors
Italian stemmer: abbandonata -> abbandon
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira