[ 
https://issues.apache.org/jira/browse/CODEC-125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Matthew Pocock updated CODEC-125:
---------------------------------

    Attachment: performanceAndBugs.patch

This patch addresses the following:

* Performance issues: The guts now use some custom CharSequence implementations 
and TreeSet, avoiding a lot of cost from repeatedly making strings and 
performing an expensive sort prior to stringification

* Encoding hyphenated and prefixed strings: Hyphonated words and words starting 
with prefixes e.g. d' now encode correctly

* Malformed rules: I found some rules that where malformed in 
ash_approx_common.txt - I have fixed these. I think this particular file is 
free of fancy chars so should patch

> Implement a Beider-Morse phonetic matching codec
> ------------------------------------------------
>
>                 Key: CODEC-125
>                 URL: https://issues.apache.org/jira/browse/CODEC-125
>             Project: Commons Codec
>          Issue Type: New Feature
>            Reporter: Matthew Pocock
>            Priority: Minor
>         Attachments: acz.patch, bm-gg.diff, bmpm.patch, bmpm.patch, 
> bmpm.patch, bmpm.patch, bmpm.patch, bmpm.patch, bmpm.patch, bmpm.patch, 
> handleH.patch, majorFix.patch, performanceAndBugs.patch, testEncodeGna.patch
>
>
> I have implemented Beider Morse Phonetic Matching as a codec against the 
> commons-codec svn trunk. I would like to contribute this to commons-codec.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to