Microsoft input method, 950, and Unicode mapping

2001-12-17 Thread Tex Texin
Unicoders, I am sure there is a simple answer, but at the moment I am confused. On Windows 2k with default locale "Traditional Chinese" and input locale "Chinese (Taiwan)" and using the "Chinese Traditional - Quick" method, users can enter Characters with the code points: 0xFA44 0xFA41 0x916F T

Re: Microsoft input method, 950, and Unicode mapping

2001-12-17 Thread Y M Chan
The answer is yes and no, depending on whether you've applied the patch. To cut a long story short, you may want to check out: http://www.microsoft.com/hk/hkscs/ On Tue, 18 Dec 2001 00:50:24 -0500 "Tex Texin" <[EMAIL PROTECTED]> wrote: texin> Unicoders, texin> I am sure there is a simple answ

Re: Microsoft input method, 950, and Unicode mapping

2001-12-18 Thread Kenneth Whistler
Tex, > > Thanks for this and the several private responses. > > For anyone interested, in addition to the Microsoft page: > http://www.microsoft.com/hk/hkscs/ > > The HK Gov't has a web page, fonts and mapping tables: > http://www.info.gov.hk/digital21/eng/hkscs/introduction.html And to add t

Re: Microsoft input method, 950, and Unicode mapping

2001-12-18 Thread Tex Texin
Thanks for this and the several private responses. For anyone interested, in addition to the Microsoft page: http://www.microsoft.com/hk/hkscs/ The HK Gov't has a web page, fonts and mapping tables: http://www.info.gov.hk/digital21/eng/hkscs/introduction.html Oracle gave a nice paper at a recen

Re: Microsoft input method, 950, and Unicode mapping

2001-12-18 Thread Tex Texin
Ken, Thanks for commiserating. Yes, I noticed the differences in mapping tables. I am glad Sybase gave different character sets different names. I am curious how you deal with Unicode and HKSCS in the private use area, sometimes For that matter I wonder what a user in HK does when their Windo

Re: Microsoft input method, 950, and Unicode mapping

2001-12-18 Thread Thomas Chan
On Tue, 18 Dec 2001, Tex Texin wrote: > I am glad Sybase gave different character sets different names. There's a "Big5-HKSCS" tag[1]--is anyone using that? [1] http://www.iana.org/assignments/character-sets (see MIBenum 2101; I don't understand why it's in the "vendor" range, though) > For t

Re: Microsoft input method, 950, and Unicode mapping

2001-12-18 Thread Thomas Chan
On Tue, 18 Dec 2001, Kenneth Whistler wrote: > And to add to the chaos and confusion, note that the HKSCS > patch for Windows Code Page 950 does not map exactly the > same as the HK Government mapping table. And that the HK And that's in addition to the confusion caused by the semi-official, sem

Re: Microsoft input method, 950, and Unicode mapping

2001-12-18 Thread Asmus Freytag
On top of that, it looks like 950 maps a bogus symbol or punctuation character to U+2574. (2574 is one of a set of 4, and only 1 is mapped for starters. Fonts covering CP950 give a way different image for that character than you'd expect from either the charts or the names... I let some people

Re: Microsoft input method, 950, and Unicode mapping

2001-12-19 Thread Kevin Bracey
In message <[EMAIL PROTECTED]> Asmus Freytag <[EMAIL PROTECTED]> wrote: > On top of that, it looks like 950 maps a bogus symbol or punctuation > character to U+2574. (2574 is one of a set of 4, and only 1 is mapped for > starters. Fonts covering CP950 give a way different image for th

Re: Microsoft input method, 950, and Unicode mapping

2001-12-19 Thread Asmus Freytag
At 10:38 AM 12/19/01 +, Kevin Bracey wrote: >In message <[EMAIL PROTECTED]> > Asmus Freytag <[EMAIL PROTECTED]> wrote: > > > On top of that, it looks like 950 maps a bogus symbol or punctuation > > character to U+2574. (2574 is one of a set of 4, and only 1 is mapped for > > starters

Re: Microsoft input method, 950, and Unicode mapping

2001-12-20 Thread Kevin Bracey
In message <[EMAIL PROTECTED]> Asmus Freytag <[EMAIL PROTECTED]> wrote: > Because of this, you get better interoperation among CJK code sets with > using CIRCLED PLUS instead of EARTH, but at the cost of having obscured > the semantics (i.e. compromised interoperation with Unicode-ba