On 29 March 2010 13:56, Dale Miller <dalelmil...@comcast.net> wrote: > Since I get only the digest late in the evening, someone else may have > replied to this - if so, I apologize. > UTF-8 encodes every character in the Unicode standards (so far). Code points > from 0-x'7f'are coded as-is. Code points from x'80'-x'7ff' are encoded in > two bytes, code points from x'800' to x'ffff' require 3 bytes, and code > points from x'10000' to x'1ffff' (the current standard limit) require 4 > bytes. > So, there is no character in the Unicode repertoire missing from UTF-8.
Nor will there ever be, since UTF-8 is not a standalone encoding of characters, but just an algorithmic transformation of UNICODE. Tony H. ---------------------------------------------------------------------- For IBM-MAIN subscribe / signoff / archive access instructions, send email to lists...@bama.ua.edu with the message: GET IBM-MAIN INFO Search the archives at http://bama.ua.edu/archives/ibm-main.html