On 2025-07-24 09:36, Corinna Vinschen via Cygwin wrote:
On Jul 24 09:28, Brian Inglis via Cygwin wrote:
On 2025-07-24 04:30, Corinna Vinschen via Cygwin wrote:
Or shall simply go along with CESU-8 when converting back to multibyte
to keep the string the same as with wcstombs?
There are 15 * SMP as BMP characters, so many non-Western and emoji
characters will be expanded from 4 UTF-8 bytes to 6 CESU-8 bytes, and this
is not supported anywhere as a string representation, designed for internal
use only per the TR.
We're only talking about invalid sequences, not using CESU-8 throughout.
That was not clear, so then as they say "go along to get along"? ;^>
--
Take care. Thanks, Brian Inglis Calgary, Alberta, Canada
La perfection est atteinte Perfection is achieved
non pas lorsqu'il n'y a plus rien à ajouter not when there is no more to add
mais lorsqu'il n'y a plus rien à retrancher but when there is no more to cut
-- Antoine de Saint-Exupéry
--
Problem reports: https://cygwin.com/problems.html
FAQ: https://cygwin.com/faq/
Documentation: https://cygwin.com/docs.html
Unsubscribe info: https://cygwin.com/ml/#unsubscribe-simple