[ 
https://issues.apache.org/jira/browse/LUCENE-3414?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chris Male updated LUCENE-3414:
-------------------------------

    Attachment: LUCENE-3414.patch

Patch with a port of the code.

Because most of the dictionaries are L/GPL, I've written my own dumb stupid 
dictionary for test purposes.

During testing I discovered a long standing bug to do with recursive 
application of rules This has now been fixed.

Code now is also version aware, as required by the CharArray* data structures.

> Bring Hunspell for Lucene into analysis module
> ----------------------------------------------
>
>                 Key: LUCENE-3414
>                 URL: https://issues.apache.org/jira/browse/LUCENE-3414
>             Project: Lucene - Java
>          Issue Type: New Feature
>          Components: modules/analysis
>            Reporter: Chris Male
>         Attachments: LUCENE-3414.patch
>
>
> Some time ago I along with Robert and Uwe, wrote an Stemmer which uses the 
> Hunspell algorithm.  It has the benefit of supporting dictionaries for a wide 
> array of languages.   
> It seems to still be being used but has fallen out of date.  I think it would 
> benefit from being inside the analysis module where additional features such 
> as decompounding support, could be added.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to