Mr. Nohejl,
About the property data you mention below. kRSUnicode property data permits
multiple/variant (space-delimited) radical/stroke values, and I think we will
see important variants added in the future. Where a specific value attested in
a specific Kangxi edition is missing from
Hello,
I am working on an Unicode Normalization implemenation. I have a question
about a specific toNFC test rule.
toNFC(0061 0305 0315 0300 05AE 0062) =
(0061 05AE 0305 0300 0315 0062)
expected:
(0061 05AE 0305 0300 0315 0062)
-/ =
(00E0 05AE 0305 0315 0062)
Why doesnt
toNFC(0061 0305 0315 0300 05AE 0062) -
From
DerivedCombiningClass.txthttp://www.unicode.org/Public/UCD/latest/ucd/extracted/DerivedCombiningClass.txt:
05D0..05EA; 0 # Lo [27] HEBREW LETTER ALEF..HEBREW LETTER TAV
In other words, 05EA with combining class 0 is blocking the
composition
Sorry, I took the wrong line (because I typed 05EA instead of 05AE)
05AE ; 228 # Mn HEBREW ACCENT ZINOR
You're right, the combining class 228 does not block the composition.
2014-03-10 20:28 GMT+01:00 Philippe Verdy verd...@wanadoo.fr:
toNFC(0061 0305 0315 0300 05AE 0062) -
The U+0300 ( ̀ ) COMBINING GRAVE ACCENT is blocked by the U+0305 ( ̅ )
COMBINING OVERLINE which has the same ccc=230.
Could you use an existing library rather than roll your own?
markus
___
Unicode mailing list
Unicode@unicode.org
On Feb 27, 2014, at 7:23 AM, Michael Everson wrote:
On 27 Feb 2014, at 02:32, Shriramana Sharma samj...@gmail.com wrote:
Given that Unicode encodes scripts and not languages, how appropriate is it
to call the BMP and the SMP as the multi*lingual* planes?
You are more than two decades
6 matches
Mail list logo