Hello, I have raised a bug for icu[1], not sure whether it will be fixed in icu4j 3.6. :-)
[1] http://bugs.icu-project.org/cgi-bin/icu-bugs/incoming?findid=5391 On 9/12/06, Richard Liang <[EMAIL PROTECTED]> wrote:
Hello, I will clarify this issue with ICU team. ;-) Best regards, Richard Tony Wu wrote: > I encounter a problem when implement isWhiteSpace(int) in j.l.Character. > There is a difference between RI and ICU. > > RI spec says, > > >> It is a Unicode szpace character (SPACE_SEPARATOR, LINE_SEPARATOR, or >> PARAGRAPH_SEPARATOR) but is not also a non-breaking space ('\u00A0', >> '\u2007', '\u202F'). > > but ICU spec says, > >> It is a Unicode space separator (category "Zs"), but is not a no-break >> space (\u00A0 or \u202F or \uFEFF). > > RI excludes U+2007 however ICU excludes U+FEFF > > And I looked up the definition of these 4 related characters on > unicode.org: > >> 00A0;NO-BREAK SPACE;Zs;0;CS;<noBreak> 0020;;;;N;NON-BREAKING SPACE;;;; >> 2007;FIGURE SPACE;Zs;0;WS;<noBreak> 0020;;;;N;;;;; >> 202F;NARROW NO-BREAK SPACE;Zs;0;CS;<noBreak> 0020;;;;N;;;;; >> FEFF;ZERO WIDTH NO-BREAK SPACE;Cf;0;BN;;;;;N;BYTE ORDER MARK;;;; > > > I consider it is a bug of ICU because the U+FEFF is not in category > *Zs* as > ICU spec described. And I purposed to report that to ICU team. > Should I handle the U+2007 by ourselves to follow RI or just document > this > problem in testcase? > -- Richard Liang China Software Development Lab, IBM --------------------------------------------------------------------- Terms of use : http://incubator.apache.org/harmony/mailing.html To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
-- Richard Liang China Development Lab, IBM --------------------------------------------------------------------- Terms of use : http://incubator.apache.org/harmony/mailing.html To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]