Pim Blokland scripsit: > Then why does UnicodeData break them down as (e.g.) 0064 030C rather than > 0064 0315?
To keep the upper case and lower case characters in sync for decomposition, they always have the same combining characters. For another example, G with cedilla gets the cedilla on top when it's a capital, but it still decomposes to the ordinary combining cedilla. These are essentially font-ligaturing issues. -- John Cowan http://www.ccil.org/~cowan [EMAIL PROTECTED] To say that Bilbo's breath was taken away is no description at all. There are no words left to express his staggerment, since Men changed the language that they learned of elves in the days when all the world was wonderful. --The Hobbit

