Thanks, all! That's a relief to know, six bytes always seemed to long but my reptile coder brain was also reptile-coder-lazy and I never dug into it.

/be

Phillips, Addison wrote:
Hi Mark, thanks for this post.

Mark Davis ☕ wrote:
UTF-8 represents a code point as 1-4 8-bit code units
"1-6".

No. 1 to *4*. Five and six byte "UTF-8" sequences are illegal and invalid.

UTF-16 represents a code point  as 2 or 4 16-bit code units
"1 or 2".

Yes, 1 or 2 16-bit code units (that's 2 or 4 bytes, of course).

Addison

Addison Phillips
Globalization Architect (Lab126)
Chair (W3C I18N WG)

Internationalization is not a feature.
It is an architecture.




_______________________________________________
es-discuss mailing list
es-discuss@mozilla.org
https://mail.mozilla.org/listinfo/es-discuss

Reply via email to