Re: First Impressions!

Patrick Schluter via Digitalmars-d Fri, 01 Dec 2017 04:36:00 -0800

On Friday, 1 December 2017 at 12:21:22 UTC, A Guy With a Questionwrote:

On Friday, 1 December 2017 at 06:07:07 UTC, Patrick Schluterwrote:
On Thursday, 30 November 2017 at 19:37:47 UTC, StevenSchveighoffer wrote:
On 11/30/17 1:20 PM, Patrick Schluter wrote:
[...]
iopipe handles this:http://schveiguy.github.io/iopipe/iopipe/textpipe/ensureDecodeable.html
It was only to give an example. With UTF-8 people whoimplement the low level code in general think about themultiple codeunits at the buffer boundary. With UTF-16 it'soften forgotten. In UTF-16 there are also 2 other commonpitfalls, that exist also in UTF-8 but are less consciouslyacknowledged, overlong encoding and isolated codepoints. SoUTF-16 has the same issues as UTF-8, plus some more,endianness and size.
Most problems with UTF16 is applicable to UTF8. The only issuethat isn't, is if you are just dealing with ASCII it's a bit ofa waste of space.

That's what I said. UTF-16 and UTF-8 have the same issues, butUTF-16 has even 2 more: endianness and bloat for ASCII. All 3encodings have their pluses and minuses, that's why D supportsall 3 but with a preference for utf-8.

Re: First Impressions!

Reply via email to