[ 
https://issues.apache.org/jira/browse/LUCENE-7378?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15376853#comment-15376853
 ] 

Robert Muir commented on LUCENE-7378:
-------------------------------------

For hungarian language, hunspell has a lot of internal special sauce and stuff. 
We don't implement any of that logic, or even decompounding at all.

> HunspellStemFilter returns another values than Hunspell in console / command 
> line with same dictionaries.
> ---------------------------------------------------------------------------------------------------------
>
>                 Key: LUCENE-7378
>                 URL: https://issues.apache.org/jira/browse/LUCENE-7378
>             Project: Lucene - Core
>          Issue Type: Sub-task
>          Components: core/other
>    Affects Versions: 5.4.1
>         Environment: Apache Solr 5.4.1
>            Reporter: Barta Tamás
>         Attachments: hu_HU.aff, hu_HU.dic
>
>
> HunspellStemFilter for hungarian language returns different results than 
> hunspell command for the same dictionary.
> - input text is "kakaó"
> - HunspellStemFilter results are "kakaó" and "kaka"
> - "kaka" is wrong, it shouldn't be returned
> - hunspell command returns "kakaó" only.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to