On Mon, Mar 5, 2012 at 19:35, Denis Jacquerye wrote:
> According to ftp://std.dkuug.dk/jtc1/sc2/WG2/docs/n2463.doc the
> Cyrillic Selkup OE is mapped to Latin OE:
> CYRILLIC SMALL LETTER SELKUP O E to U+0153 LATIN SMALL LIGATURE OE
> CYRILLIC CAPITAL LETTER SELKUP O E to U+0152 LATIN CAPITAL LIGATURE OE
> Several other of those missing Cyrillic characters are simply mapped
> to Latin ones or sort of decomposed. 

N2463 also maps twelve characters from ISO 10574 that have been disunified 
since 2002, namely:
04/06 CYRILLIC SMALL LETTER KURDISH QA is now U+051B CYRILLIC SMALL LETTER QA
04/09 CYRILLIC SMALL LETTER EL WITH MIDDLE HOOK is now U+0521 CYRILLIC SMALL 
LETTER EL WITH MIDDLE HOOK
04/10 CYRILLIC SMALL LETTER MORDVIN EL KA is now U+0515 CYRILLIC SMALL LETTER 
LHA
04/14 CYRILLIC SMALL LETTER EN WITH MIDDLE HOOK is now U+0523 CYRILLIC SMALL 
LETTER EN WITH MIDDLE HOOK
05/06 CYRILLIC CAPITAL LETTER KURDISH QA is now U+051A CYRILLIC CAPITAL LETTER 
QA
05/09 CYRILLIC CAPITAL LETTER EL WITH MIDDLE HOOK is now U+0520 CYRILLIC 
CAPITAL LETTER EL WITH MIDDLE HOOK
05/10 CYRILLIC CAPITAL LETTER MORDVIN EL KA is now U+0514 CYRILLIC CAPITAL 
LETTER LHA
05/14 CYRILLIC CAPITAL LETTER EN WITH MIDDLE HOOK is now U+0522 CYRILLIC 
CAPITAL LETTER EN WITH MIDDLE HOOK
06/03 CYRILLIC SMALL LETTER ER KA is now U+0517 CYRILLIC SMALL LETTER RHA
06/08 CYRILLIC SMALL LETTER KURDISH WE is now U+051D CYRILLIC SMALL LETTER WE
07/03 CYRILLIC CAPITAL LETTER ER KA is now U+0516 CYRILLIC CAPITAL LETTER RHA
07/08 CYRILLIC CAPITAL LETTER KURDISH WE is now U+051C CYRILLIC CAPITAL LETTER 
WE

There is a clear precedent here that the unifications of N2463 are not 
necessarily the final fate of any of these characters. If the О Е letter for 
Selkup should be disunified from U+0152/U+0153, then a proposal needs to be 
submitted calling for the addition of the two letters to the UCS.

It is worth noting that N2463 also decomposes four characters using U+0335, a 
practice which hasn't been used for decompositions since Unicode 1.1.

I also don't understand the mapping of 04/05 CYRILLIC SMALL LETTER CHECHEN KA 
and 05/05 CYRILLIC CAPITAL LETTER CHECHEN KA into <U+043A CYRILLIC SMALL LETTER 
KA, U+030A COMBINING RING ABOVE> and <U+041A CYRILLIC CAPITAL LETTER KA. U+030A 
COMBINING RING ABOVE>, respectively. Is the character shown in ISO 10574 just a 
glyph variant of this combining sequence?

—Ben Scarborough


Reply via email to