Doug Ewell wrote:
> And no, I did not intend to make this big a deal out of it, and I
> apologize for doing so.

Nor did I.

I'm a genuine student of Unicode, here to learn. It seems many of the regular 
contributors to the Unicode and Unicore mailing lists are the Unicode experts 
themselves, many of whom are developers of the Unicode Standard. As such, these 
mailing lists are fantastic! There are very few technology mailing lists 
like them anymore. How cool is it to post an inquiry to the Unicode mailing 
list and have Unicode luminaries like Mark Davis, Asmus Freytag, Markus 
Scherer, 
Martin Dürst and Doug Ewell ALL reply? (The answer: Pretty darn cool!)

When I asked for clarification about my use of the term "CJK text" instead of 
"kana and Hangul text", I was earnest. If there was something wrong with my 
understanding of the standard terminology, I genuinely wanted to know what it 
was. You're the experts, I'm the initiate.

> The answer to Jim's question, then, is that for those examples
> of "CJK text" which are encoded differently in NFC and NFD (a group
> that excludes ideographs, thus immediately putting that side issue
> to rest), there are indeed some combinations of OS + app + rendering
> engine + font that can display those examples properly.

And this was the valuable lesson I learned. Until this exchange on the Unicode 
mailing, I'd had a biased and wrong impression of the state of the art with 
respect to Unicode normalization and modern software based on my own personal 
experience. I'm glad I asked the question, and I'm grateful for all the 
excellent and thorough answers.

When I type the ideograph 漢 (U+FA47) into BabelPad, highlight it, and then 
click 
the button labeled "Normalize to NFC", the character becomes 漢 (U+6F22). Does 
BabelPad not conform to the Unicode Standard in this case? Is this not truly 
Unicode normalization?

Jim Monty


Reply via email to