Re: Why the hell doesn't foreach decode strings

Steven Schveighoffer Wed, 26 Oct 2011 05:20:38 -0700

On Mon, 24 Oct 2011 19:49:43 -0400, Simen Kjaeraas<simen.kja...@gmail.com> wrote:

On Mon, 24 Oct 2011 21:41:57 +0200, Steven Schveighoffer<schvei...@yahoo.com> wrote:
Plus, a combining character (such as an umlaut or accent) is part of a
character, but may be a separate code point.
If this is correct (and it is), then decoding to dchar is simply notenough.You seem to advocate decoding to graphemes, which is a whole differentmatter.

I am advocating that. And it's a matter of perception. D can say "weonly support code-point decoding" and what that means to a user is, "wedon't support language as you know it." Sure it's a part of unicode, butit takes that extra piece to make it actually usable to people who requireunicode.

Even in English, fiancé has an accent. To say D supports unicode, butthen won't do a simple search on a file which contains a certain *valid*encoding of that word is disingenuous to say the least.

D needs a fully unicode-aware string type. I advocate D should use it asthe default string type, but it needs one whether it's the default or notin order to say it supports unicode.


-Steve

Re: Why the hell doesn't foreach decode strings

Reply via email to