Re: [edk2] [PATCH v4 05/10] BaseTools/Tests: Verify unsupported UTF-16 are rejected

2015-06-04 Thread Laszlo Ersek
On 06/04/15 08:42, Jordan Justen wrote: > Supplementary Plane characters can exist in UTF-16 files, > but they are not valid UCS-2 characters. > > For example, this python interpreter code: import codecs codecs.encode(u'\U00010300', 'utf-16') > '\xff\xfe\x00\xd8\x00\xdf' > > Therefore t

[edk2] [PATCH v4 05/10] BaseTools/Tests: Verify unsupported UTF-16 are rejected

2015-06-03 Thread Jordan Justen
Supplementary Plane characters can exist in UTF-16 files, but they are not valid UCS-2 characters. For example, this python interpreter code: >>> import codecs >>> codecs.encode(u'\U00010300', 'utf-16') '\xff\xfe\x00\xd8\x00\xdf' Therefore the UCS-4 0x00010300 character is encoded as two 16-bit n