On Tue, Feb 21, 2012 at 3:11 PM, Brendan Eich <bren...@mozilla.com> wrote: > Hi Mark, thanks for this post. > Mark Davis ☕ wrote: >> >> UTF-8 represents a code point as 1-4 8-bit code units > > "1-6". ... > Lock up your encoders, I am so not a Unicode guru but this is what my > reptile coder brain remembers.
Only theoretically. UTF-8 has been locked down to the same range that UTF-16 has (RFC 3629), so the largest real character you'll see is 4 bytes, as that gives you exactly 21 bits of data. ~TJ _______________________________________________ es-discuss mailing list es-discuss@mozilla.org https://mail.mozilla.org/listinfo/es-discuss