On Sunday, 3 July 2022 at 18:33:29 UTC, rikki cattermole wrote:
On 04/07/2022 6:10 AM, Ola Fosheim Grøstad wrote:
People who are willing to use 4 bytes per code point are
probably using third party C-libraries that have their own
representation, so you have to convert anyway?
If you use Unicode and follow their recommendations, you are
going to be using dstrings at some point.
I hardly ever use anything outside UTF-8, and if I do then I use
a well tested unicode library as it has to be correct and up to
date to be useful. The utility of going beyond UTF-8 seems to be
limited:
https://en.wikipedia.org/wiki/UTF-32#Analysis