Dear List Members,

I'm looking for enlightment, how to best (or least bad) encode Tamil SRI
in Unicode. The glyph can be seen as codepoint 0x82 of TSCII 1.7 at 
http://www.tamil.net/tscii/charset17.gif

The transcoding tables I found, especially the GNU libc 
iconv implementation
at:

http://sources.redhat.com/cgi-bin/cvsweb.cgi/~checkout~/libc/iconvdata/TSCII.precomposed?rev=1.1&content-type=text/plain&cvsroot=glibc

list 

0x82 : 0x0BB8 0x0BCD 0x0BB0 0x0BC0

So far, feedback from Tamil experts I got, seem to indicate that no
satisfiable encoding 
exists and they would prefer a distinct codepoint, which was rejected. 
For example 0x0BB8 0x0BCD 0x0BB0 0x0BC0 is the word 'laughable' in Tamil.

Alternatives given were
(0BB8)(0BCD)(0BB1)(0BC0)
(0BB6)(0BCD)(0BB1)(0BC0)  (if and when U+0BB6 becomes Unicode)
(0B9A)(0BBF)(0BB1)(0BC0)

I'm far too clueless to re-start the distinct codepoint discussion, but
rather look
for a pragmatic solution for transcoding.

Regards,
Peter Jacobi
Hamburg, Germany

-- 
NEU FÜR ALLE - GMX MediaCenter - für Fotos, Musik, Dateien...
Fotoalbum, File Sharing, MMS, Multimedia-Gruß, GMX FotoService

Jetzt kostenlos anmelden unter http://www.gmx.net

+++ GMX - die erste Adresse für Mail, Message, More! +++


Reply via email to