[ 
https://issues.apache.org/jira/browse/LUCENE-3976?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13260490#comment-13260490
 ] 

Luca Cavanna commented on LUCENE-3976:
--------------------------------------

We found out that some recent dutch dictionaries contain rule like the one 
mentioned (Starting from version 2.00 if I'm correct). I'm going to look at 
that specific problem and see how we can parse those affix rules.
                
> Improve error messages for unsupported Hunspell formats
> -------------------------------------------------------
>
>                 Key: LUCENE-3976
>                 URL: https://issues.apache.org/jira/browse/LUCENE-3976
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: modules/analysis
>            Reporter: Chris Male
>         Attachments: LUCENE-3976.patch
>
>
> Our hunspell implementation is never going to be able to support the huge 
> variety of formats that are out there, especially since our impl is based on 
> papers written on the topic rather than being a pure port.
> Recently we ran into the following suffix rule:
> {noformat}SFX CA 0 /CaCp{noformat}
> Due to the missing regex conditional, an AOE was being thrown, which made it 
> difficult to diagnose the problem.
> We should instead try to provide better error messages showing what we were 
> unable to parse.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to