I have made some modifications to
org.apache.commons.codec.language.DoubleMetaphone in order to support
the three additional Norwegian and Danish vowels. The current
implementation at Jakarta does not provide any methods to specify the
language of the input text.
Is it all right to modify DoubleMetaphone to support the Scandinavian
vowels (Swedish, Danish and Norwegian) and possibly other languages
or have I completely misunderstood the idea behind the double
metaphone algorithm? That is, should double metaphone detect various
language constructs automatically or is it perhaps a better idea to
have a factory which returns a double metaphone implementation
appropriate for the language?
Any suggestions?
I would like to contribute any changes back to Jakarta commons-codec,
of course.
Steinar Cook
[EMAIL PROTECTED]
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]