IBM-1047

Tony Harminc Mon, 29 Mar 2010 13:30:25 -0700

On 29 March 2010 13:56, Dale Miller <dalelmil...@comcast.net> wrote:
> Since I get only the digest late in the evening, someone else may have
> replied to this - if so, I apologize.
> UTF-8 encodes every character in the Unicode standards (so far). Code points
> from 0-x'7f'are coded as-is. Code points from x'80'-x'7ff' are encoded in
> two bytes,  code points from x'800' to x'ffff' require 3 bytes, and code
> points from x'10000' to x'1ffff' (the current standard limit) require 4
> bytes.
> So, there is no character in the Unicode repertoire missing from UTF-8.


Nor will there ever be, since UTF-8 is not a standalone encoding of
characters, but just an algorithmic transformation of UNICODE.

Tony H.

----------------------------------------------------------------------
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to lists...@bama.ua.edu with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html

Re: /usr/lib/nls/charmap/IBM-1047

Reply via email to