Chris Pratley wrote,
Win2000 can be made to support Ext B characters. Download the support
package
at:
http://www.microsoft.com/china/windows2000/downloads/18030.asp
Downloading the support package there and running the download
installs the SimSun-18030 font.
The SimSun-18030 font (a
Hi!
Let's say that I have two files, namely file1 file2, in any Unicode
encoding, both starting with a BOM, and I compile them into one by using
cat file1 file2 file3
in Unix or
copy file1 + file2 file3
in MS-DOS, file3 will have the following contents:
BOM
contents from file1
BOM
in MS-DOS, file3 will have the following contents:
BOM
contents from file1
BOM
contents from file2
Is this in accordance with the Unicode standard
Nope. When concatenating two files (or any streams) of which the
second one has a BOM, the second one should be deleted.
However, there's a
You can remove a per-file prefix, certainly. This would make sense.
But if you do not, what is the harm of a character that you cannot see
and which does not even have width or cause line breaking behavior?
Realistically, what would the problem be?
MichKa
- Original Message -
From:
Note on the COPY command: it seems some versions of Windows seem to
be BOM-aware; at least Windows2000, when concatenating two text
files, does remove the second's BOM.
Pim Blokland
- Original Message -
From: Pim Blokland [EMAIL PROTECTED]
To: Unicode List [EMAIL PROTECTED]
Sent: Sunday, March 23, 2003 2:43 PM
Subject: Re: Several BOMs in the same file
[ ... ]
But now you've got me wondering whether there are any rules or
guidelines for the situation where two
From: Allen Haaheim
... it seems that [s]ince GB18030 is fully ISO 10646 compatible, it
readily supports CJK Extension B and other languages. I don't have
the GB18030 font or Extension B Charset in my machine. Can I load CJK
Extensions A and B without switching to XP? I would prefer to use
Chris Jacobs schreef:
guidelines for the situation where two files are joined, and the
second one has a BOM, but the first one hasn't. Should the
resulting
file have a BOM?
In that case you should seriously consider the possibility that
the byte
order for both files is different!
Good
unsubscribe
Title: RE: CJK question
Sorry if I wasn't clear. That
support package "enables" Win2k to support ExtB - that is, after you run it
calls toGDI with surrogates now succeed with correct metrics, etc. The
included foint does nto cover ExtB as you note.
Once you have done that, you
can use
I tried what you suggested with unipad, but for some reason it went to a
location on a PUA character map, rather than CJK Unified Ideographs
Extension B, where they are in fact located. I guess it is because Unipad
doesn't support Extension B yet, or else I am doing something wrong. But
This is good news for me, thanks a lot.
--Allen
- Original Message -
From:
Chris Pratley
To: Allen Haaheim ; [EMAIL PROTECTED]
Cc: [EMAIL PROTECTED]
Sent: Saturday, March 22, 2003 6:11
PM
Subject: RE: CJK question
Win2000 can be made to
support ExtB
Thanks for the enightening me regarding the CHANT website fonts. Now if ICS1
(and ICS2 and ICS6) would work, I could count this as a major victory.
My correct characters were simply what I know to be the right ones by
looking at hard copies and the only place I've seen them on my computer
screen,
13 matches
Mail list logo