Pim Blokland scripsit:

> Then why does UnicodeData break them down as (e.g.) 0064 030C rather than
> 0064 0315?

To keep the upper case and lower case characters in sync for decomposition,
they always have the same combining characters.  For another example, G with
cedilla gets the cedilla on top when it's a capital, but it still decomposes
to the ordinary combining cedilla.  These are essentially font-ligaturing
issues.

-- 
John Cowan          http://www.ccil.org/~cowan        [EMAIL PROTECTED]
To say that Bilbo's breath was taken away is no description at all.  There are
no words left to express his staggerment, since Men changed the language that
they learned of elves in the days when all the world was wonderful. --The Hobbit

Reply via email to