[
https://issues.apache.org/jira/browse/SOLR-2769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13114071#comment-13114071
]
Robert Muir commented on SOLR-2769:
-----------------------------------
I think we should be more cautious on recommending Hunspell on the wiki here,
for these reasons:
* The algorithm relies entirely on the quality of the dictionary, for many of
these languages the dictionary is not good for this purpose: no affix rules,
just a list of words, etc
* Even in the case where a particular dictionary is pretty good, there are a
number of problems: the primary use case of these dictionaries is spellchecking
and that doesn't necessarily imply that the rules+affix combinations yield good
results here.
* Finally, the usual problems of having a dictionary-based technique, languages
are not static and there absolutely no handling for OOV words.
> HunspellStemFilterFactory
> -------------------------
>
> Key: SOLR-2769
> URL: https://issues.apache.org/jira/browse/SOLR-2769
> Project: Solr
> Issue Type: New Feature
> Components: Schema and Analysis
> Reporter: Jan Høydahl
> Labels: stemming
> Fix For: 3.5, 4.0
>
> Attachments: SOLR-2769-branch_3x.patch, SOLR-2769-branch_3x.patch,
> SOLR-2769.patch, SOLR-2769.patch, SOLR-2769.patch, SOLR-2769.patch
>
>
> Now that Hunspell stemmer is added to Lucene (LUCENE-3414), let's make a
> Factory for it in Solr
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]