Thanks, all! That's a relief to know, six bytes always seemed to long
but my reptile coder brain was also reptile-coder-lazy and I never dug
into it.
/be
Phillips, Addison wrote:
Hi Mark, thanks for this post.
Mark Davis ☕ wrote:
UTF-8 represents a code point as 1-4 8-bit code units
"1-6".
No. 1 to *4*. Five and six byte "UTF-8" sequences are illegal and invalid.
UTF-16 represents a code point as 2 or 4 16-bit code units
"1 or 2".
Yes, 1 or 2 16-bit code units (that's 2 or 4 bytes, of course).
Addison
Addison Phillips
Globalization Architect (Lab126)
Chair (W3C I18N WG)
Internationalization is not a feature.
It is an architecture.
_______________________________________________
es-discuss mailing list
es-discuss@mozilla.org
https://mail.mozilla.org/listinfo/es-discuss