This is what Markus had to say (he implemented most of the collation for
ICU):

"http://www.unicode.org/reports/tr10/#Avoiding_Normalization

Step 1 of the algorithm: http://www.unicode.org/reports/tr10/#Step_1
which has a note:

   - Conformant implementations may skip this step *in certain
   circumstances: *see *Section 6.5, Avoiding
Normalization<http://www.unicode.org/reports/tr10/#Avoiding_Normalization>
   * for more information.

See also http://www.unicode.org/reports/tr10/#Parametic_Tailoring
-> attribute "normalization", see the description there
(this whole table 14 will soon move to the LDML spec, leaving only a link
in this place)"

So the question is:

1. Do we change i18n API default for normalization to always be true, with
some performance penalty?
2. Update ES 262 spec with info Markus passed (if possible)?


2012/8/30 Mark Davis ☕ <m...@macchiato.com>

> ICU *is* always able to compare them as being equal, just by setting the
> parameter.
>
> Even if the parameter isn't set, it uses an FCD sort (see
> http://unicode.org/notes/tn5/) and canonical closure, which handles most
> cases of canonical equivalence. The default is turned on for languages
> where the normal+auxiliary exemplar sets contains characters that would
> show a difference even with an FCD+closure sort, and can be turned on
> always if desired (at some cost in performance; 30% sounds high though).
>
> Mark <https://plus.google.com/114199149796022210033>
> *
> *
> *— Il meglio è l’inimico del bene —*
> **
>
>
>
> On Thu, Aug 30, 2012 at 6:30 PM, Norbert Lindenberg <
> ecmascr...@norbertlindenberg.com> wrote:
>
>> In particular, a conformant implementation must be able to compare any
>> two canonical-equivalent strings as being equal, for all Unicode characters
>> supported by that implementation."
>
>
>


-- 
Nebojša Ćirić
_______________________________________________
es-discuss mailing list
es-discuss@mozilla.org
https://mail.mozilla.org/listinfo/es-discuss

Reply via email to