> If I use the following sample file: [...]
>
> Then I can get the chinese character *丂* in my dvi out file when
> running latex on it.
>
> But, when I use the following tex source file and run latex on it:
>
> --------------------
> \documentclass{article}
> \usepackage{CJK}
> \begin{document}
> \begin{CJK}{UTF8}{song}
> ^^81^^40
> \end{CJK}
> \end{document}
> --------------------
>
> I will get a [EMAIL PROTECTED] symbol in my out dvi file, why does this 
> happen?

Because ^^81^^40 is not UTF-8 encoding!  The character 丂 has the
UTF-8 code ^^e4^^b8^^82 (this is U+4E02).

> How can I input chinese characters with unicode number under UTF8
> ecoding scheme?

A quick look into the documentation of the CJK package would answer
that: Within UTF-8 encoding, just say \Unicode{"4E}{"02}.

> Furthmore, I find that in the first sample I posted here, the *8140*
> is not the unicode number of the character *丂*, in fact the chinese
> character *腀* is corresponding to the unicode number
> *8140*, so I'm puzzled on the map mechanism betwteen chinese
> character and its unicode number.  Can you give me some hints on
> these things?

I assume that `0x8140' is the character code in GBK encoding (I
haven't tested it).  In general, you shouldn't worry about this by
yourself.  I suggest that you use programs like `iconv' to convert
between encodings.


    Werner
_______________________________________________
Cjk maillist  -  [email protected]
https://lists.ffii.org/mailman/listinfo/cjk

Reply via email to