Asmus, are you including the case where an accented character maps to two 
unaccented characters?

 

e.g. Å to AA or Ä to AE

 

From: Unicode [mailto:[email protected]] On Behalf Of Asmus Freytag 
(c) via Unicode
Sent: Wednesday, July 17, 2019 11:07 AM
To: Norbert Lindenberg
Cc: Unicode Mailing List
Subject: Re: Removing accents and diacritics from a word

 

On 7/17/2019 11:02 AM, Norbert Lindenberg wrote:

“Misspelling”?

Not helpful. Anybody have a serious suggestion?

A./

 
 
 

On Jul 17, 2019, at 10:37, Asmus Freytag via Unicode  
<mailto:[email protected]> <[email protected]> wrote:
 
A question has come up in another context:
 
Is there any linguistic term for describing the process of removing accents and 
diacritics from a word to create its “base form”, e.g. São Tomé to Sao Tome?
 
The linguistic term "string normalization" appears not that preferable in a 
computing context.
 
Any ideas?
 
A./
 
 

 
 

 

Reply via email to