Dear Werner,

A strange thing, I describe it as follows:

The chinese character *爛*, whose inner code is *0x721b*. I can display
it correctly with utf-8 incoding file with emacs unicode 2, the following is 
the information I got with the command *M-x describe-char*:
*************************
        character: 爛 (29211, #o71033, #x721b)
preferred charset: unicode (Unicode (ISO10646))
       code point: 0x721B
           syntax: w    which means: word
         category: C:Chinese (Han) characters of 2-byte character sets
                   c:Chinese h:Korean j:Japanese
                   |:While filling, we can break a line at this character.
      buffer code: #xE7 #x88 #x9B
        file code: #xE7 #x88 #x9B (encoded by coding system utf-8-unix)
          display: by this font (glyph code)
     sun-exta:pixelsize=15:foundry=unknown:weight=medium:slant=r:width=normal
(#x61D7)
*************************

But when I change the encoding scheme of my file include this character into 
GBK or GB18030, I can display the file correctly with vim or gedit and can't 
with emacs, the following is the my emacs's chaos code information:

************************
\240\200
************************

When I use the command *M-x describe-char*, I got the following information:

****************************
        character:\240 (4194208, #o17777640, #x3fffa0)
preferred charset: eight-bit (Raw bytes 0-255)
       code point: 0xA0
           syntax: w    which means: word
      buffer code: #xA0
        file code: not encodable by coding system in-is13194-devanagari-unix
          display: no font available
****************************

Why?

Best,
Hongsheng.

PS. The attached is my sample files incoding with GBK, GB18030 and UTF8
respectively.

Attachment: strange-character.tar.bz2
Description: Binary data

_______________________________________________
Cjk maillist  -  [email protected]
http://lists.ffii.org/mailman/listinfo/cjk

Reply via email to