[ http://issues.apache.org/jira/browse/LANG-285?page=comments#action_12442931 ] Guillaume Coté commented on LANG-285: -------------------------------------
Reagarding Stephen Colebourne post : I had a look at the Caracter class in JDK 1.5, I found nothing to identify accented caracters. Regarding Aldrin Leal post : Thanks for your submission. How ever you don't seem to covert all ISO8859-1, for exemple you seem to be missing Å . I am working on patch with full test case. I expect to submit a first version in one week. Reagarding Gary Gregory post : As I understand it, the class sun.text.Normalizer doesn't unaccent a String, it only replace single caracter accent by double caracter so you could remove them. That approch required you to do two pass over a String. I would prefer ro do it in one pass. > Wish : method unaccent > ---------------------- > > Key: LANG-285 > URL: http://issues.apache.org/jira/browse/LANG-285 > Project: Commons Lang > Issue Type: New Feature > Reporter: Guillaume Coté > Priority: Minor > > I would like to add a method that replace accented caracter by unaccented > one. For example, with the input String "L'été où j'ai dû aller à l'île > d'Anticosti commenca tôt", the method would return "L'ete ou j'ai du aller à > l'ile d'Anticosti commenca tot". > I suggest to call that method unaccent and to add it in StringUtils. > If we cannot covert all case, the first version could only covert iso-8859-1. > If you are willing to go forward with that idea, I am willing to contribute a > patch. -- This message is automatically generated by JIRA. - If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa - For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]