[ 
https://issues.apache.org/jira/browse/SOLR-2769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13114071#comment-13114071
 ] 

Robert Muir commented on SOLR-2769:
-----------------------------------

I think we should be more cautious on recommending Hunspell on the wiki here, 
for these reasons:
* The algorithm relies entirely on the quality of the dictionary, for many of 
these languages the dictionary is not good for this purpose: no affix rules, 
just a list of words, etc
* Even in the case where a particular dictionary is pretty good, there are a 
number of problems: the primary use case of these dictionaries is spellchecking 
and that doesn't necessarily imply that the rules+affix combinations yield good 
results here.
* Finally, the usual problems of having a dictionary-based technique, languages 
are not static and there absolutely no  handling for OOV words.

> HunspellStemFilterFactory
> -------------------------
>
>                 Key: SOLR-2769
>                 URL: https://issues.apache.org/jira/browse/SOLR-2769
>             Project: Solr
>          Issue Type: New Feature
>          Components: Schema and Analysis
>            Reporter: Jan Høydahl
>              Labels: stemming
>             Fix For: 3.5, 4.0
>
>         Attachments: SOLR-2769-branch_3x.patch, SOLR-2769-branch_3x.patch, 
> SOLR-2769.patch, SOLR-2769.patch, SOLR-2769.patch, SOLR-2769.patch
>
>
> Now that Hunspell stemmer is added to Lucene (LUCENE-3414), let's make a 
> Factory for it in Solr

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to