Dear Werner, A strange thing, I describe it as follows:
The chinese character *爛*, whose inner code is *0x721b*. I can display
it correctly with utf-8 incoding file with emacs unicode 2, the following is
the information I got with the command *M-x describe-char*:
*************************
character: 爛 (29211, #o71033, #x721b)
preferred charset: unicode (Unicode (ISO10646))
code point: 0x721B
syntax: w which means: word
category: C:Chinese (Han) characters of 2-byte character sets
c:Chinese h:Korean j:Japanese
|:While filling, we can break a line at this character.
buffer code: #xE7 #x88 #x9B
file code: #xE7 #x88 #x9B (encoded by coding system utf-8-unix)
display: by this font (glyph code)
sun-exta:pixelsize=15:foundry=unknown:weight=medium:slant=r:width=normal
(#x61D7)
*************************
But when I change the encoding scheme of my file include this character into
GBK or GB18030, I can display the file correctly with vim or gedit and can't
with emacs, the following is the my emacs's chaos code information:
************************
\240\200
************************
When I use the command *M-x describe-char*, I got the following information:
****************************
character:\240 (4194208, #o17777640, #x3fffa0)
preferred charset: eight-bit (Raw bytes 0-255)
code point: 0xA0
syntax: w which means: word
buffer code: #xA0
file code: not encodable by coding system in-is13194-devanagari-unix
display: no font available
****************************
Why?
Best,
Hongsheng.
PS. The attached is my sample files incoding with GBK, GB18030 and UTF8
respectively.
strange-character.tar.bz2
Description: Binary data
_______________________________________________ Cjk maillist - [email protected] http://lists.ffii.org/mailman/listinfo/cjk
