RE: Internationalization: Support for IANA time zones

2013-03-04 Thread Shawn Steele
fallback though :) -Shawn -Original Message- From: Norbert Lindenberg [mailto:ecmascr...@lindenbergsoftware.com] Sent: Saturday, March 2, 2013 11:09 AM To: Shawn Steele Cc: Norbert Lindenberg; Dean Landolt; Mark Davis ☕; Eric Albright; es-discuss; Phillips, Addison Subject: Re

RE: Internationalization: Support for IANA time zones

2013-03-04 Thread Shawn Steele
I still think the stability issue should be addressed in the IANA time zone database itself, not by adopting a IANA-derived alternate registry. Has that been tried? I agree, we should be chatting with the IANA folks. -Shawn ___ es-discuss

RE: Internationalization: Support for IANA time zones

2013-03-02 Thread Shawn Steele
Re: CLDR names being “better” than tzdb names. I’m uncomfortable using the CLDR names, although perhaps they could be aliases, because other standards use the tzdb names and we have to be able to look up the tzdb names. It might be nice to get more stability for the tzdb names, like aliases

RE: Internationalization: Support for IANA time zones

2013-03-01 Thread Shawn Steele
Can we work with iana to get the names stabilized? Sent from my Windows Phone 8 From: Norbert Lindenbergmailto:ecmascr...@lindenbergsoftware.com Sent: ‎3/‎1/‎2013 12:41 PM To: Mark Davis ☕mailto:m...@macchiato.com Cc: Shawn Steelemailto:shawn.ste...@microsoft.com;

RE: Internationalization: Support for IANA time zones

2013-02-28 Thread Shawn Steele
For #5 I might prefer falling back to English or something. I don't think UTC offset is a good idea because that doesn't really represent a Timezone very well. (If a meeting gets moved to a following week, that offset might change or be wrong) -Shawn -Original Message- From:

RE: String.fromCodePoint and surrogate pairs?

2013-01-14 Thread Shawn Steele
things the developer thinks works. Assuming a use case for illegal unicode were ever found, it could be added later. -Shawn -Original Message- From: Norbert Lindenberg [mailto:ecmascr...@norbertlindenberg.com] Sent: Monday, January 14, 2013 4:35 PM To: Shawn Steele Cc: Norbert

RE: String.fromCodePoint and surrogate pairs?

2012-12-12 Thread Shawn Steele
IMO String.fromCodePoint should disallow U+D800-U+DFFF. There's already fromCharCode that does that, and a according to The Unicode Standard, isolated surrogates have no meaning on their own and goes on to compare them to illegal UTF-8 sequences. IMO CodePoint is a 21 Unicode code point, and

RE: String.fromCodePoint and surrogate pairs?

2012-12-12 Thread Shawn Steele
...@norbertlindenberg.com] Sent: Wednesday, December 12, 2012 2:40 PM To: Shawn Steele Cc: Norbert Lindenberg; Erik Arvidsson; es-discuss@mozilla.org Subject: Re: String.fromCodePoint and surrogate pairs? The Unicode standard defines code point as any value in the range of integers from 0

RE: `free` operator

2012-10-25 Thread Shawn Steele
I sort of have a fundamental problem with the solution. Eg: If it were actually unused, it'd be GC'd. Since it isn't GC'd, something must be holding a reference to it. So if you force it to be gone, or clear all the properties, or whatever, seems to me that then you'd just start throwing

RE: `free` operator

2012-10-25 Thread Shawn Steele
On the contrary, TypeError: Cannot read property 'prop' of undefined, with a stack trace, is WAY easier to track down than The RSS on my server gradually rises, until the operating system kills it, which happens about every 4 hours. I'm not so sure. Now you know where you're using it that

RE: Globalization API: supportedLocalesOf vs. getSupportedLocales

2011-11-28 Thread Shawn Steele
[mailto:es-discuss-boun...@mozilla.org] On Behalf Of Norbert Lindenberg Sent: Monday, November 28, 2011 5:30 PM To: Eric Albright; Peter Constable; Shawn Steele Cc: es-discuss Subject: Re: Globalization API: supportedLocalesOf vs. getSupportedLocales We invented the supportedLocalesOf method to let

RE: Full Unicode strings strawman

2011-05-19 Thread Shawn Steele
- Java kept their strings encoded exactly as they were (a sequence of 16-bit code units) and provided extra APIs for the cases where you want to extract a code point. Bloaty. ? defining UTF-16 instead of UCS-2 introduces zero bloat. In fact, it pretty much works anyway, it's just not

RE: Full Unicode strings strawman

2011-05-19 Thread Shawn Steele
The crucial win of Allen's proposal comes down the road, when someone in a certain locale *can* do s.indexOf(nonBMPChar) and win. s.indexOf(\U+1), Ok, but \U+... does not work today. Yes, that would be worth adding (IMO) as a convenience, regardless of whether the backend were UTF-16

UTF-16 Strings not-strawman

2011-05-19 Thread Shawn Steele
I don’t have time to make a real strawman, but what would people need if we went the UTF-16 route (instead of full-Unicode)? (This thread is to collect requirements, which are somewhat getting lost in the merits of UTF-16 vs 32 bit thread). Basically, just replace UCS-2 with UTF-16, allowing

RE: Full Unicode strings strawman

2011-05-19 Thread Shawn Steele
There are several sequences in Unicode which are meaningless if you have only one character and not the other. Eg: any of the variation selectors by themselves are meaningless. So if you break a modified character from its variation selector you've damaged the string. That's pretty much

RE: Full Unicode strings strawman

2011-05-19 Thread Shawn Steele
, mayo 19, 2011 3:00 PM To: Shawn Steele Cc: Waldemar Horwat; es-discuss@mozilla.org; Peter Constable Subject: Re: Full Unicode strings strawman On May 19, 2011, at 2:06 PM, Shawn Steele wrote: There are several sequences in Unicode which are meaningless if you have only one character

RE: Full Unicode strings strawman

2011-05-18 Thread Shawn Steele
Hmm... I proposed break iterators for 'character/grapheme', word, line and sentence as a part of i18n API, but it's shot down (at least for version 0.5). Are you open to adding them now ? Once this discussion is settled and the proposal to support the full unicode range is in place, we can

RE: Full Unicode strings strawman

2011-05-17 Thread Shawn Steele
I would much prefer changing UCS-2 to UTF-16, thus formalizing that surrogate pairs are permitted. That'd be very difficult to break any existing code and would still allow representation of everything reasonable in Unicode. That would enable Unicode, and allow extending string literals and

RE: Full Unicode strings strawman

2011-05-17 Thread Shawn Steele
Right - but they are still legitimate code points, and they fill out the space required to let us treat String as uint16[] when defining the backing store as something that maps to the set of all Unicode code points. That said, you can encode these code points with utf-8; for example,

RE: Full Unicode strings strawman

2011-05-16 Thread Shawn Steele
Thanks for making a strawman Unicode Escape Sequences Is it possible for U+ to accept either 4, 5, or 6 digit sequences? Typically when I encounter U+ notation the leading zero is omitted, and I see BMP characters quite often. Obviously BMP could use the U notation, however it seems like

RE: Full Unicode strings strawman

2011-05-16 Thread Shawn Steele
, 2011 12:53 PM To: Shawn Steele Cc: es-discuss@mozilla.org Subject: Re: Full Unicode strings strawman On May 16, 2011, at 11:34 AM, Shawn Steele wrote: Thanks for making a strawman (see my very last sentence below as it may impact the interpreation of some of the rest of these responses

RE: Full Unicode strings strawman

2011-05-16 Thread Shawn Steele
to match, as the return value of [[Get]] would be a code point. Shawn Steele, I don't understand this comment: Also, the “trick” I think, is encoding to surrogate pairs (illegally, since UTF8 doesn’t allow that) vs decoding to UTF16. Why do we care about the UTF-16 representation of particular

RE: Full Unicode strings strawman

2011-05-16 Thread Shawn Steele
PM To: Shawn Steele Cc: Jungshik Shin (신정식, 申政湜); Markus Scherer; es-discuss@mozilla.org Subject: Re: Full Unicode strings strawman In terms of implementation capabilities, there isn't really a significant practical difference between * a UCS-2 implementation, and * a UTF-16

RE: Full Unicode strings strawman

2011-05-16 Thread Shawn Steele
The problem is that “\UD800\UDC00” === “\U+01”. And if the internal representation is UTF-32, then they’d have to continue to be the same. And it’s really hard for them to have the same length if one’s 2 code points and the other’s 1 code point. -Shawn From:

RE: Full Unicode strings strawman

2011-05-16 Thread Shawn Steele
Not in my proposal! \ud800\udc00=== \u+01 is false in my proposal. That’s exactly my problem. I think the engine’s (or at least the applications written in JavaScript) are still UTF-16-centric and that they’ll have d800, dc00 === 1. For example, if they were different, then d800,

RE: Full Unicode strings strawman

2011-05-16 Thread Shawn Steele
I think you'll find that the actual JS engines are currently UCS-2 centric. The surrounding browser environments are doing the UTF-16 interpretation. That why you see instead of �� in browser generated display output. There’s no difference. I wouldn’t call Windows C++ WCHAR “UCS-2”, however

RE: Collation API not complete for search

2011-03-28 Thread Shawn Steele
expect comparison to match that substring. Similarly, if one is using Turkish I, we expect all of them to do so. - Shawn From: Nebojša Ćirić [mailto:c...@google.com] Sent: Monday, March 28, 2011 1:36 PM To: Mark Davis ☕ Cc: es-discuss@mozilla.org; Shawn Steele; Phillips, Addison Subject: Re

RE: Collation API not complete for search

2011-03-28 Thread Shawn Steele
, but, likely, fuzzy and exact ideas are still useful. - Shawn From: Nebojša Ćirić [mailto:c...@google.com] Sent: Monday, March 28, 2011 2:26 PM To: Phillips, Addison Cc: Mark Davis ☕; es-discuss@mozilla.org; Shawn Steele Subject: Re: Collation API not complete for search How do you solve search

RE: Collation API not complete for search

2011-03-28 Thread Shawn Steele
Yes, sort, comparison and in-text search seem like reasonable buckets to me. Although in-text can further be broken into, I think, exact and non-exact cases. - Shawn -Original Message- From: es-discuss-boun...@mozilla.org [mailto:es-discuss-boun...@mozilla.org] On Behalf Of Axel

RE: Summary of i18n ad hoc group meeting - 21st March 2011

2011-03-25 Thread Shawn Steele
I think we meant allowing the -u extensions for, at least, collation variants. - Shawn   http://blogs.msdn.com/shawnste Selfhost a custom locale from \\scratch2\scratch\shawnste\customlocaledrop\install.batfile:///\\scratch2\scratch\shawnste\customlocaledrop\install.bat (Selfhost

RE: Summary of i18n ad hoc group meeting - 21st March 2011

2011-03-25 Thread Shawn Steele
PM To: Shawn Steele; Nebojša Ćirić; es-discuss@mozilla.org Subject: RE: Summary of i18n ad hoc group meeting - 21st March 2011 Those would be BCP 47 language tags too, though. I guess the question is what level of support the spec should contain (MUST/SHOULD/MAY support RFC 6067). Addison From

Scientific Notation in Number Formatting

2011-03-25 Thread Shawn Steele
There was a tiny bit of discussion around scientific notation and number formatting a while back. I thought this included some comments around globalization differences between cultures on how scientific notation worked. Does anyone have any examples of such differences? Thanks, - Shawn

RE: A proposal to add String.prototype.format

2011-03-09 Thread Shawn Steele
I would postpone any formatting stuff until the i18n stuff was better understood. - Shawn   http://blogs.msdn.com/shawnste -Original Message- From: es-discuss-boun...@mozilla.org [mailto:es-discuss-boun...@mozilla.org] On Behalf Of Gillam, Richard Sent: Wednesday, March 09,

RE: Stupid i18n use cases question

2011-01-30 Thread Shawn Steele
-discuss-boun...@mozilla.org] on behalf of Erik Corry [erik.co...@gmail.com] Sent: Sunday, January 30, 2011 12:32 AM To: Mark Davis ☕ Cc: Mads Ager; Shawn Steele; es-discuss@mozilla.org Subject: Re: Stupid i18n use cases question On Jan 29, 2011 8:37 PM, Mark Davis ☕ m...@macchiato.commailto:m

RE: RE: Stupid i18n use cases question

2011-01-30 Thread Shawn Steele
Names of the locales? (We seem to be digressing somewhat). Downloading the code too is how jQuery's i18n works. -Shawn   http://blogs.msdn.com/shawnste From: Erik Corry [erik.co...@gmail.com] Sent: Sunday, January 30, 2011 10:48 AM To: Shawn Steele

Stupid i18n use cases question

2011-01-29 Thread Shawn Steele
On the phone yesterday we mentioned word/line breaking and grapheme clusters. It didn't occur to me to ask about the use cases. Why does someone need word/line breaking in js? It seems like that would better be done by my rendering engine, like the HTML layout engine or my edit control or

RE: Stupid i18n use cases question

2011-01-29 Thread Shawn Steele
] on behalf of Phillips, Addison [addi...@lab126.com] Sent: Saturday, January 29, 2011 4:19 PM To: Mark Davis ☕; Shawn Steele Cc: es-discuss@mozilla.org Subject: RE: Stupid i18n use cases question ... For line breaking, the main thing would be to preformat some text. We (the Lab126 we) use line and word

RE: i18n objects

2011-01-26 Thread Shawn Steele
problems for you. -Shawn   http://blogs.msdn.com/shawnste From: Phillips, Addison [addi...@lab126.com] Sent: Tuesday, January 25, 2011 10:09 PM To: Shawn Steele; Mark Davis ☕ Cc: es-discuss@mozilla.org; Gillam, Richard Subject: RE: i18n objects Hi, I’ve

Intent of i18n inferred values

2011-01-24 Thread Shawn Steele
My understanding of “inferred” values for the LocaleInfo object is that, when inferred is allowed/used, that values which are not explicitly specified in the constructor (eg: calendar), will be inferred as best as possible from the other inputs. For example: If I construct it with: var

RE: i18n objects

2011-01-24 Thread Shawn Steele
that it creates it the way it intends. -Shawn From: es-discuss-boun...@mozilla.org [mailto:es-discuss-boun...@mozilla.org] On Behalf Of Mark Davis ? Sent: Monday, January 24, 2011 4:09 PM To: Shawn Steele Cc: es-discuss@mozilla.org Subject: Re: i18n objects I don't understand. * If you want

RE: i18n objects

2011-01-24 Thread Shawn Steele
? Mark — Il meglio è l’inimico del bene — On Mon, Jan 24, 2011 at 15:53, Shawn Steele shawn.ste...@microsoft.commailto:shawn.ste...@microsoft.com wrote: But assume I have a “black-box” API that prints a report or something. If the caller correctly sets the LocaleInfo() for inferred or not inferred

RE: 2nd day meeting comments on the latest i18n API proposal

2011-01-21 Thread Shawn Steele
IMO that’s going overboard in the other direction ☺ It’d be nice to find some middle ground. Sometimes inferring can be very bad. Sometimes it can be very good. The problem isn’t that one is “right” or “wrong” for all apps, but rather that it might be simple for developers to accidentally

RE: 2nd day meeting comments on the latest i18n API proposal

2011-01-21 Thread Shawn Steele
infer” in the input options would help avoid accidentally not making a choice, yet still making it fairly easy to use. -Shawn From: mark.edward.da...@gmail.com [mailto:mark.edward.da...@gmail.com] On Behalf Of Mark Davis ? Sent: Friday, January 21, 2011 4:01 PM To: Shawn Steele Cc: Axel Hecht

i18n collator object

2011-01-20 Thread Shawn Steele
The i18n group said we’d figure out collator options by email. This is an email ☺ First I’d like to say something about the scope of collator’s in EcmaScript: I don’t think it’s reasonable to write a database in Javascript. I think people should instead use a database engine in some form,

FW: i18n collator options

2011-01-20 Thread Shawn Steele
Forwarding for Mark From: mark.edward.da...@gmail.com [mailto:mark.edward.da...@gmail.com] On Behalf Of Mark Davis ? Sent: Poʻahā, Ianuali 20, 2011 4:05 hours To: Shawn Steele Cc: es-discuss@mozilla.org; Peter Constable; Derek Murman Subject: Re: i18n collator options (BTW I haven't gotten

RE: i18n collator options

2011-01-20 Thread Shawn Steele
For UTF-16 order do you use like the Turkish casing if it was a turkish locale? -Shawn From: mark.edward.da...@gmail.com [mailto:mark.edward.da...@gmail.com] On Behalf Of Mark Davis ? Sent: Poʻahā, Ianuali 20, 2011 4:05 hours To: Shawn Steele Cc: es-discuss@mozilla.org; Peter Constable; Derek

RE: 2nd day meeting comments on the latest i18n API proposal

2011-01-19 Thread Shawn Steele
The list didn’t like my email address…, trying again. From: Shawn Steele Sent: Poʻakolu, Ianuali 19, 2011 6:12 hours To: 'Nebojša Ćirić'; es-discuss@mozilla.org Subject: RE: 2nd day meeting comments on the latest i18n API proposal I think a “common” case is to tweak a part of a LocaleInfo object