On Nov 11, 2019, at 1:49 PM, Jose Isaias Cabrera <[email protected]> wrote:
>
> If there is a combination, is just like the accented e, é, why not use the
> one character vs the combination?
Big “if.” There isn’t always a pre-composed character.
Typically, pre-composed characters exist in Unicode for compatibility with
legacy encodings so that you can have lossless mappings from e.g. ISO 8859-1 to
Unicode and back. In an ideal world, Unicode would have no pre-composed
characters, only base characters and accents.
That is, in fact, the way the macOS native file systems HFS+ and APFS handle
Unicode in file names. It’s called Normalization Form D: input is decomposed
and stored that way, always. It’s done to ensure that sorting happens
predictably.
See:
https://www.unicode.org/standard/where/#Duplicates
https://www.unicode.org/reports/tr15/
_______________________________________________
sqlite-users mailing list
[email protected]
http://mailinglists.sqlite.org/cgi-bin/mailman/listinfo/sqlite-users