RE: CJK question

2003-03-23 Thread jameskass
Chris Pratley wrote, Win2000 can be made to support Ext B characters. Download the support package at: http://www.microsoft.com/china/windows2000/downloads/18030.asp Downloading the support package there and running the download installs the SimSun-18030 font. The SimSun-18030 font (a

Several BOMs in the same file

2003-03-23 Thread Stefan Persson
Hi! Let's say that I have two files, namely file1 file2, in any Unicode encoding, both starting with a BOM, and I compile them into one by using cat file1 file2 file3 in Unix or copy file1 + file2 file3 in MS-DOS, file3 will have the following contents: BOM contents from file1 BOM

Re: Several BOMs in the same file

2003-03-23 Thread Pim Blokland
in MS-DOS, file3 will have the following contents: BOM contents from file1 BOM contents from file2 Is this in accordance with the Unicode standard Nope. When concatenating two files (or any streams) of which the second one has a BOM, the second one should be deleted. However, there's a

Re: Several BOMs in the same file

2003-03-23 Thread Michael \(michka\) Kaplan
You can remove a per-file prefix, certainly. This would make sense. But if you do not, what is the harm of a character that you cannot see and which does not even have width or cause line breaking behavior? Realistically, what would the problem be? MichKa - Original Message - From:

Re: Several BOMs in the same file

2003-03-23 Thread Pim Blokland
Note on the COPY command: it seems some versions of Windows seem to be BOM-aware; at least Windows2000, when concatenating two text files, does remove the second's BOM. Pim Blokland

Re: Several BOMs in the same file

2003-03-23 Thread Chris Jacobs
- Original Message - From: Pim Blokland [EMAIL PROTECTED] To: Unicode List [EMAIL PROTECTED] Sent: Sunday, March 23, 2003 2:43 PM Subject: Re: Several BOMs in the same file [ ... ] But now you've got me wondering whether there are any rules or guidelines for the situation where two

Re: CJK question

2003-03-23 Thread Eric Rasmussen
From: Allen Haaheim ... it seems that [s]ince GB18030 is fully ISO 10646 compatible, it readily supports CJK Extension B and other languages. I don't have the GB18030 font or Extension B Charset in my machine. Can I load CJK Extensions A and B without switching to XP? I would prefer to use

Re: Several BOMs in the same file

2003-03-23 Thread Pim Blokland
Chris Jacobs schreef: guidelines for the situation where two files are joined, and the second one has a BOM, but the first one hasn't. Should the resulting file have a BOM? In that case you should seriously consider the possibility that the byte order for both files is different! Good

unsubscribe

2003-03-23 Thread brent
unsubscribe

RE: CJK question

2003-03-23 Thread Chris Pratley
Title: RE: CJK question Sorry if I wasn't clear. That support package "enables" Win2k to support ExtB - that is, after you run it calls toGDI with surrogates now succeed with correct metrics, etc. The included foint does nto cover ExtB as you note. Once you have done that, you can use

Re: CJK question

2003-03-23 Thread Allen Haaheim
I tried what you suggested with unipad, but for some reason it went to a location on a PUA character map, rather than CJK Unified Ideographs Extension B, where they are in fact located. I guess it is because Unipad doesn't support Extension B yet, or else I am doing something wrong. But

Re: CJK question

2003-03-23 Thread Allen Haaheim
This is good news for me, thanks a lot. --Allen - Original Message - From: Chris Pratley To: Allen Haaheim ; [EMAIL PROTECTED] Cc: [EMAIL PROTECTED] Sent: Saturday, March 22, 2003 6:11 PM Subject: RE: CJK question Win2000 can be made to support ExtB

Re: CJK question

2003-03-23 Thread Allen Haaheim
Thanks for the enightening me regarding the CHANT website fonts. Now if ICS1 (and ICS2 and ICS6) would work, I could count this as a major victory. My correct characters were simply what I know to be the right ones by looking at hard copies and the only place I've seen them on my computer screen,