[
https://issues.apache.org/jira/browse/LUCENE-7378?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15376853#comment-15376853
]
Robert Muir commented on LUCENE-7378:
-------------------------------------
For hungarian language, hunspell has a lot of internal special sauce and stuff.
We don't implement any of that logic, or even decompounding at all.
> HunspellStemFilter returns another values than Hunspell in console / command
> line with same dictionaries.
> ---------------------------------------------------------------------------------------------------------
>
> Key: LUCENE-7378
> URL: https://issues.apache.org/jira/browse/LUCENE-7378
> Project: Lucene - Core
> Issue Type: Sub-task
> Components: core/other
> Affects Versions: 5.4.1
> Environment: Apache Solr 5.4.1
> Reporter: Barta Tamás
> Attachments: hu_HU.aff, hu_HU.dic
>
>
> HunspellStemFilter for hungarian language returns different results than
> hunspell command for the same dictionary.
> - input text is "kakaó"
> - HunspellStemFilter results are "kakaó" and "kaka"
> - "kaka" is wrong, it shouldn't be returned
> - hunspell command returns "kakaó" only.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]