IBM-1047

Dale Miller Mon, 29 Mar 2010 11:06:30 -0700

Since I get only the digest late in the evening, someone else may havereplied to this - if so, I apologize.UTF-8 encodes every character in the Unicode standards (so far). Codepoints from 0-x'7f'are coded as-is. Code points from x'80'-x'7ff' areencoded in two bytes, code points from x'800' to x'ffff' require 3bytes, and code points from x'10000' to x'1ffff' (the current standardlimit) require 4 bytes.

So, there is no character in the Unicode repertoire missing from UTF-8.


Dale Miller
dalelmil...@comcast.net

----------------------------------------------------------------------
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to lists...@bama.ua.edu with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html

Re: /usr/lib/nls/charmap/IBM-1047

Reply via email to