Dear Elliotte, ISO-8859-11 has been finished for almost year :)
for refs: ISO-8859-11 8-bit single-byte coded graphic character sets -- Part 11: Latin/Thai alphabet http://www.iso.ch/iso/en/CatalogueDetailPage.CatalogueDetail?CSNUMBER=28263&ICS1=35&ICS2=40&ICS3= or see its draft at http://www.nectec.or.th/it-standards/iso8859-11/ ISO-8859-11 = TIS-620 + { NBSP } NBSP = Non-Breaking Space (0xA0 / U+00A0) ---- from http://crl.nmsu.edu/~mleisher/csets.html those "REPLACEMENT CHARACTER". imho, they should be replace with "UNDEFINED", since in other unicode mapping of CP874 (another Thai encoding, which is almost the same as Windows-874, the TIS-620 varaints) http://www.unicode.org/Public/MAPPINGS/VENDORS/MICSFT/WINDOWS/CP874.TXT they also use "UNDEFINED". you may interesting in some comments from http://bugzilla.mozilla.org/show_bug.cgi?id=127755 :) regards, Art Elliotte Rusty Harold wrote: > The Unicode data files at > http://www.unicode.org/Public/MAPPINGS/ISO8859/ do not include a mapping > for ISO-8859-11, Thai. Is there any particular reason for this? Is > ISO-8859-11 unfinished or deprecated or unable to be mapped to Unicode > or some such? If none of these things are true, is there a mapping chart > for this set somewhere? > > I'm working on adding output support for the ISO sets to XOM > (http://www.cafeconleche.org/XOM/) and need the mapping tables to figure > out which characters need to be esacped in which character sets. > > -- > Elliotte Harold >