Re: First Impressions!

Joakim via Digitalmars-d Thu, 30 Nov 2017 02:41:09 -0800

On Thursday, 30 November 2017 at 10:19:18 UTC, Walter Brightwrote:

On 11/27/2017 7:01 PM, A Guy With an Opinion wrote:
+- Unicode support is good. Although I think D's string typeshould have probably been utf16 by default. Especiallyconsidering the utf module states:
"UTF character support is restricted to '\u0000' <= character<= '\U0010FFFF'."
Seems like the natural fit for me. Plus for the vast majorityof use cases I am pretty guaranteed a char = codepoint. Notthe biggest issue in the world and maybe I'm just being overlycritical here.
Sooner or later your code will exhibit bugs if it assumes thatchar==codepoint with UTF16, because of surrogate pairs.
https://stackoverflow.com/questions/5903008/what-is-a-surrogate-pair-in-java
As far as I can tell, pretty much the only users of UTF16 areWindows programs. Everyone else uses UTF8 or UCS32.
I recommend using UTF8.

Java, .NET, Qt, Javascript, and a handful of others use UTF-16too, some starting off with the earlier UCS-2:


https://en.m.wikipedia.org/wiki/UTF-16#Usage

Not saying either is better, each has their flaws, just pointingout it's more than just Windows.

Re: First Impressions!

Reply via email to