On 3/7/07, A.J.Mechelynck <[EMAIL PROTECTED]> wrote:

IIUC, the above is for 'encoding' and 'fileencoding' being both 8-bit
encodings. cp936 is the Microsoft encoding for mainland China: some characters
(such as ASCII) are 8 bits, others are 16 bits; and UTF-8 (the 'encoding'
Zhaojun uses) can use anywhere between 1 and 4 bytes to represent the
"assigned" codepoints. For instance, the highest Unicode codepoint currently
regarded as "valid" by the Unicode Consortium, U+10FFFD, is represented in
UTF-8 as F4 8F BF BD.

No! The very goal of this code is that `encodig` is utf-8 while
`fileencoding` is different. But you are right noting that I have not
tested this `fileencoding` other than 8-bit one -- except the marginal
case when it is utf-8 too. But again -- `encoding` *is* multibyte and
it works. Why not just to test? I haven't Chinese fonts installed...

--
Cyril Slobin <[EMAIL PROTECTED]> `When I use a word,' Humpty Dumpty said,
<http://45.free.net/~slobin> `it means just what I choose it to mean'

Reply via email to