On Thu, Jul 26, 2012 at 6:19 PM, Richard Wordingham <
richard.wording...@ntlworld.com> wrote:

> On Thu, 26 Jul 2012 17:01:53 -0700
> "Steven R. Loomis" <s...@icu-project.org> wrote:
>
> >  May I ask if you have a specific example in mind? Is the U in ICU
> > misleading somehow, or CLDR recommending a policy for plain text?
>
> I had no examples of untrue or misleading statements about Unicode
> standards and ICU in mind.
>
> I believe the plain text example was hyperbole.  There is an issue
> resulting from the use of non-characters in CLDR root collation, and as
> you seem unaware of it, I have raised the matter as
> http://unicode.org/cldr/trac/ticket/5115 .  I fear there may be a
> practical reason for the forbidden behaviour, so fixing the issue may
> well end up being deferred to allow CLDR users time to improve their
> compliance with the LDML specification.


I was totally unaware of it. Thanks for being specific!

I suspect it was simply an oversight and not indicative of any systemic
issue.  UTS#35 gives the example of <cp hex="0"> for representing NULL as
an example of a character not to be used in XML. Note that there's nothing
wrong with processing non-characters in memory- I have to deal with
non-characters all the time. Thanks for filing the bug.

Regards,
Steven

Reply via email to