Re: CJK stroke order data: kRSUnicode v. kRSKangXi

2014-03-10 Thread Richard COOK
Mr. Nohejl, About the property data you mention below. kRSUnicode property data permits multiple/variant (space-delimited) radical/stroke values, and I think we will see important variants added in the future. Where a specific value attested in a specific Kangxi edition is missing from

Normalization test

2014-03-10 Thread Markus Doppelbauer
Hello, I am working on an Unicode Normalization implemenation. I have a question about a specific toNFC test rule. toNFC(0061 0305 0315 0300 05AE 0062) = (0061 05AE 0305 0300 0315 0062) expected: (0061 05AE 0305 0300 0315 0062) -/ = (00E0 05AE 0305 0315 0062) Why doesnt

Re: Normalization test

2014-03-10 Thread Philippe Verdy
toNFC(0061 0305 0315 0300 05AE 0062) - From DerivedCombiningClass.txthttp://www.unicode.org/Public/UCD/latest/ucd/extracted/DerivedCombiningClass.txt: 05D0..05EA; 0 # Lo [27] HEBREW LETTER ALEF..HEBREW LETTER TAV In other words, 05EA with combining class 0 is blocking the composition

Re: Normalization test

2014-03-10 Thread Philippe Verdy
Sorry, I took the wrong line (because I typed 05EA instead of 05AE) 05AE ; 228 # Mn HEBREW ACCENT ZINOR You're right, the combining class 228 does not block the composition. 2014-03-10 20:28 GMT+01:00 Philippe Verdy verd...@wanadoo.fr: toNFC(0061 0305 0315 0300 05AE 0062) -

Re: Normalization test

2014-03-10 Thread Markus Scherer
The U+0300 ( ̀ ) COMBINING GRAVE ACCENT is blocked by the U+0305 ( ̅ ) COMBINING OVERLINE which has the same ccc=230. Could you use an existing library rather than roll your own? markus ___ Unicode mailing list Unicode@unicode.org

Re: ?MP = Multi*lingual* plane?

2014-03-10 Thread Richard COOK
On Feb 27, 2014, at 7:23 AM, Michael Everson wrote: On 27 Feb 2014, at 02:32, Shriramana Sharma samj...@gmail.com wrote: Given that Unicode encodes scripts and not languages, how appropriate is it to call the BMP and the SMP as the multi*lingual* planes? You are more than two decades