On 4/8/2025 11:07 AM, Markus Scherer wrote:
Encoding characters that look the same but behave differently is a bad
idea.
^^^ That.
We have tried this, for example with letter-behavior clones of some of
the typographic quotes (U+02BB, U+02BC). People use them
inconsistently, because they can't tell the difference while typing or
reading, and so we get problems with having to treat both equally in
some places, text search, spoofing, "why does it say I am using an
invalid character?", etc.
Unicode also has some magic invisible control characters that were
supposed to change the behavior of affected characters in ways that
violated their identity. These control codes are Deprecated with
prejudice.
markus