From: "Jonathan Coxhead" <[EMAIL PROTECTED]> > John Tisdale wrote: > > > I've created an illustration to accompany my MSDN article to provide a > > high-level overview of Unicode encoding. I would appreciate any feedback > > related to accuracy and clarity. > > > > http://www.ocean.org/JT/Writings/MSDN/Images/UnicodeEncodingSmall.gif > > > > Thanks, John
Additionally the labels are incorrect or misleading: - Add a layer above the first level showing grapheme clusters, that are made of one or multiple abstract characters. - There's a missing layer between the first (currently CCS) and second (CCF) box. The content of the first box is not showing a "Coded" character set, only examples of abstract characters, with a normative reference (U+....) and a representative glyph. So the top box should be labelled "The Unicode-ISO/IEC 10646 character repertoire" of abstract characters. The 3 examples shown should be labelled "abstract characters", and by themselves they are not encoded, only named by the U+xxxx reference.- The CEF box should have labels "Code Unit(s)" with the optional plural. - Add a layer below it showing codepoints (numeric values), which is appropriate to represent the Unicode CCS. Label the example boxes with "Code point(s)" with the optional plural. - The bottom box should not contain the term "Code Unit" but "byte(s)". Code units only exist in CEF, not in CES.

