Re: New full Unicode for ES6 idea

Brendan Eich Mon, 20 Feb 2012 08:47:34 -0800

Andrew Oakley wrote:

Issues only arise in code that tries to treat a string as an array of
16-bit integers, and I don't think we should be particularly bothered by
performance of code which misuses strings in this fashion (but clearly
this should still work without opt-in to new string handling).


This is all strings in JS and the DOM, today.

That is, we do not have any measure of code that treats strings asuint16s, forges strings using "\uXXXX", etc. but the ES and DOM specshave allowed this for > 14 years. Based on bitter experience, it'slikely that if we change by fiat to 21-bit code points from 16-bit codeunits, some code on the Web will break.

And as noted in the o.p. and in the thread based on Allen's proposallast year, browser implementations definitely count on representationvia array of 16-bit integers, with length property or method counting same.

Breaking the Web is off the table. Breaking implementations, less so.I'm not sure why you bring up UTF-8. It's good for encoding and decodingbut for JS, unlike C, we want string to be a high level "full Unicode"abstraction. Not bytes with bits optionally set indicating more bytesfollow to spell code points.

I think this is a nicer and more flexible model than string
representations being dependent on which heap they came from - all
issues related to encoding can be contained in the String object
implementation.

You're ignoring the compatibility break here. Browser vendors can'tafford to do that.

While this is being discussed, for any new string handling I think we
should make any invalid strings (according to the rules in Unicode)
cause some kind of exception on creation.

This is future-hostile if done for all code points. If done only for thecode points in [D800,DFFF] both for literals using "\u{...}" and forconstructive methods such as String.fromCharCode, then I agree.


/be
_______________________________________________
es-discuss mailing list
es-discuss@mozilla.org
https://mail.mozilla.org/listinfo/es-discuss

Re: New full Unicode for ES6 idea

Reply via email to