Re: The Case Against Autodecode

Marc Schütz via Digitalmars-d Fri, 13 May 2016 03:51:41 -0700

On Thursday, 12 May 2016 at 23:16:23 UTC, H. S. Teoh wrote:

Therefore, autodecoding actually only produces intuitivelycorrect results when your string has a 1-to-1 correspondencebetween grapheme and code point. In general, this is only truefor a small subset of languages, mainly a few common Europeanlanguages and a handful of others. It doesn't work for Korean,and doesn't work for any language that uses combiningdiacritics or other modifiers. You need byGrapheme to have thecorrect results.

In fact, even most European languages are affected if NFDnormalization is used, which is the default on MacOS X.

And this is actually the main problem with it: It was introducedto make unicode string handling correct. Well, it doesn't,therefore it has no justification.

Re: The Case Against Autodecode

Reply via email to