> -----Original Message-----
> From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]
On
> Behalf Of Peter Jacobi


> I'm looking for enlightment, how to best (or least bad) encode Tamil
SRI
> in Unicode. The glyph can be seen as codepoint 0x82 of TSCII 1.7 at
> http://www.tamil.net/tscii/charset17.gif...


> 0x82 : 0x0BB8 0x0BCD 0x0BB0 0x0BC0

That is exactly how it should be encoded.


 
> So far, feedback from Tamil experts I got, seem to indicate that no
> satisfiable encoding
> exists and they would prefer a distinct codepoint, which was rejected.

Of course, in order to comment, one would need to know why the above is
not satisfactory.


> For example 0x0BB8 0x0BCD 0x0BB0 0x0BC0 is the word 'laughable' in
Tamil.
> 
> Alternatives given were
> (0BB8)(0BCD)(0BB1)(0BC0)
> (0BB6)(0BCD)(0BB1)(0BC0)  (if and when U+0BB6 becomes Unicode)
> (0B9A)(0BBF)(0BB1)(0BC0)

Alternatives to what? The first and third sequence would have distinct
appearances (see attached file), and would consistute distinct
spellings. The second cannot be evaluated without knowing what they
intend 0BB6 to be.


Peter
 
Peter Constable
Globalization Infrastructure and Font Technologies
Microsoft Windows Division


<<attachment: tamilsequences.PNG>>

Reply via email to