Re: Ångstrøm symbol

2003-02-18 Thread Doug Ewell
As Stefan Persson already observed, U+212B ANGSTROM SIGN (Å) exists in Unicode alongside U+00C5 LATIN CAPITAL LETTER A WITH RING ABOVE (Å) only because both characters were present in some legacy character set with which Unicode had to maintain round-trip compatibility. -Doug Ewell Fullerton, Cal

Re: Ångstrøm symbol

2003-02-18 Thread Wm Sean Glen
Dear David, There is a letter in the Swedish alphabet (capital A with a ring above). Some Swede by the name of Ångstrøm was a scientist and worked with light and color. He came up with a convenient was to accurately measure the color of light. That measurement was named after him and given

Re: Hot Beverage font.

2003-02-18 Thread starner
>William Overtoning Sorry; as was pointed out to me in private email, that should have been William Overington.

Re: Hot Beverage font.

2003-02-18 Thread starner
>one down, 95000+ to go. > >Can we not have a detailed mail for each character describing 3 places it was >used and "it looks good to me"? I'm curious if you would have sent the same message if Michael Everson had sent a message about one character. We've had threads on this list about one charact

Re: RFC, 5-6 octets sequence in UTF8, non short form in UTF8

2003-02-18 Thread Markus Scherer
Frank, http://www.ietf.org/internet-drafts/draft-yergeau-rfc2279bis-03.txt addresses these, and version -04 of this draft will be public shortly. markus

Re: BOM's at Beginning of Web Pages?

2003-02-18 Thread Jonathan Coxhead
That's a very long-winded way of writing it! How about this: #!/usr/bin/perl -pi~ -0777 # program to remove a leading UTF-8 BOM from a file # works both STDIN -> STDOUT and on the spot (with filename as argument) s/^\xEF\xBB\xBF//s; which uses perl's -p, -i and -0 o

Re: Hot Beverage font.

2003-02-18 Thread Tex Texin
one down, 95000+ to go. Can we not have a detailed mail for each character describing 3 places it was used and "it looks good to me"? Imagine if every font designer did that. We are now aware of another site that has fonts for unicode so we all know where to look. 'nuff said. tex [EMAIL PROTECT

RFC, 5-6 octets sequence in UTF8, non short form in UTF8

2003-02-18 Thread Yung-Fong Tang
I read the RFC 2279 again ( http://www.cis.ohio-state.edu/cs/Services/rfc/rfc-text/rfc2279.txt ) 1. I cannot find any text in it mentioned about. non short form is invalid UTF8, and 2. It mentioned about 1-6 octets of UTF8 3. It mentioned about how to encode surrogate pair to UTF-8. But it does

Re: Hot Beverage font.

2003-02-18 Thread jameskass
. William Overington has graciously provided a downloadable font with the Unicode 4.0 hot beverage symbol encoded at U+2615. A suggestion would be to add some of the other interesting new glyphs from Unicode 4.0 for experimental purposes. There are many to choose from, and no fonts (to speak of)

Re: TrueType Explorer (was RE: Everson Mono)

2003-02-18 Thread Bob_Hallissy
On 17/02/2003 20:01:51 Rick Cameron wrote: >TrueType Explorer can do this and more. I gave TTE a try and it definitely has some interesting and useful features. To get some answers to a few questions, I have corresponded with the author and he has given me permission to post the following: -

Hot Beverage font.

2003-02-18 Thread William Overington
Thinking that the new to Unicode 4.0 symbol U+2615 Hot Beverage might be very useful in the preparation of meeting agendas and the like and also wishing to try to design a glyph which would look good particularly at a 12 point size in documents, I have produced a font named Hot Beverage which I hav

Re: DBCS and Unicode 3.1

2003-02-18 Thread Markus Scherer
[EMAIL PROTECTED] wrote: Does anyone know of a way to process GB 18030 data in COBOL on MVS? You could try to call ICU4C from COBOL http://oss.software.ibm.com/icu/userguide/cobol.html ICU has a GB 18030 converter. markus

Re: DBCS and Unicode 3.1

2003-02-18 Thread Markus Scherer
Jungshik Shin wrote: On Mon, 17 Feb 2003, Markus Scherer wrote: Other examples: There are EUC-JP (1/2/3 bytes per character) and EUC-CN (1/2/4 BpC) which are quite "old" (much older than GB 18030). Markus's fingers made a mistake here :-). It's EUC-TW (not EUC-CN) that encodes CNS 11643 pl

Re: DBCS and Unicode 3.1

2003-02-18 Thread Erik.Ostermueller
Thanks, all, for your responses. They helped me to better phrase my question: Does anyone know of a way to process GB 18030 data in COBOL on MVS? Thanks, --Erik Ostermueller