On Fri, Nov 1, 2013 at 4:37 PM, Jennifer Wong <jennifer.w...@workday.com> wrote: > Hi, > > I would like to ask for advice on removing accents from characters. While > the normalization process is straight forward (NFD, remove accents), it does > not take into account of special cases. For example, Danish, "å" should be > mapped to "aa", not "a". Likewise, in German, "ä" "ö" "ü" should be mapped > to "ae", "oe" and "ue" respectively, not "a", "e", "u". Are there common > practices on how to handle these special cases? Thank you.
As Jukka K. Korpela said, it depends from the language. In Italian, for example there are mandatory and optional accents. One can just drop the optional accents, but if the mandatory ones are dropped then what remains is not correct Italian anymore Cheers P.