On Wed, Apr 12, 2017 at 2:22 PM, Paul Gilmartin < 0000000433f07816-dmarc-requ...@listserv.ua.edu> wrote:
> On Wed, 12 Apr 2017 13:50:47 -0500, John McKown wrote: > > > >> * CONVERT UNICODE TO UTF-8 > >> * CONVERT UTF-8 TO UNICODE > >> > >> What's "UNICODE"? I suppose it's in a glossary somewhere. > > > >With respect to the PoPS, "Convert UTF-8 to Unicode" is exactly the same > >instruction as "Convert UTF-8 to UTF-16" and "Convert Unicode to UTF-8" is > >the same instruction as "Convert UTF-16 to UTF-8". The results are > >documented in the instruction on the transformation results, but (of > >course) does not say exactly how it is implemented. > > > I see that: > CONVERT UTF-8 TO UTF-16 > CONVERT UTF-8 TO UNICODE > ... > 'B2A7' > ... > The one-, two-, three-, or four-byte UTF-8 characters of the second > operand are converted to two-byte Unicode characters and placed > at the first operand location. > > But Wikipedia, which is always right, tells me that UTF-16 is not a > two-byte representation but a variable-length representation. > > RCF in order? > That would be nice. UTF-16 is a "16 bit code unit" according to what I was reading on unicode.org . http://unicode.org/faq/utf_bom.html#gen0 [quote] A: No. The first version of Unicode was a 16-bit encoding, from 1991 to 1995, but starting with Unicode 2.0 (July, 1996), it has not been a 16-bit encoding. The Unicode Standard encodes characters in the range U+0000..U+10FFFF, which amounts to a 21-bit code space. Depending on the encoding form you choose (UTF-8, UTF-16, or UTF-32), each character will then be represented either as a sequence of one to four 8-bit bytes, one or two 16-bit code units, or a single 32-bit code unit. [quote] > > -- gil > > ---------------------------------------------------------------------- > For IBM-MAIN subscribe / signoff / archive access instructions, > send email to lists...@listserv.ua.edu with the message: INFO IBM-MAIN > -- "Irrigation of the land with seawater desalinated by fusion power is ancient. It's called 'rain'." -- Michael McClary, in alt.fusion Maranatha! <>< John McKown ---------------------------------------------------------------------- For IBM-MAIN subscribe / signoff / archive access instructions, send email to lists...@listserv.ua.edu with the message: INFO IBM-MAIN