If you have questions as to particular normalizations, I'd suggest looking at the normalization charts on the Unicode website.
Mark ________ [EMAIL PROTECTED] IBM, MS 50-2/B11, 5600 Cottle Rd, SJ CA 95193 (408) 256-3148 fax: (408) 256-0799 ----- Original Message ----- From: "David J. Perry" <[EMAIL PROTECTED]> To: <[EMAIL PROTECTED]> Sent: Saturday, March 15, 2003 09:36 Subject: Normalisation and Greek characters > U+03AC and U+1F71 both have canonical decompositions to U+03B1 followed > by U+0301. (There are other similar pairs in the Greek blocks.) If an > application applies normalisation form C both decompose to the same > string; will the resulting recomposed character be 03AC or 1F71? I > suspect the former, but I'd like to know if this is correct and if so, > how this is determined. > > Thanks - David > > > >