Re: Proposal for fixing dchar ranges

Brad Anderson Mon, 10 Mar 2014 10:11:30 -0700

On Monday, 10 March 2014 at 13:35:33 UTC, Steven Schveighofferwrote:

I proposed this inside the long "major performance problem withstd.array.front," I've also proposed it before, a long time ago.
But seems to be getting no attention buried in that thread, noteven negative attention :)
An idea to fix the whole problems I see with char[] beingtreated specially by phobos: introduce an actual string type,with char[] as backing, that is a dchar range, that actuallydictates the rules we want. Then, make the compiler use thistype for literals.
e.g.:

struct string {
   immutable(char)[] representation;
   this(char[] data) { representation = data;}
   ... // dchar range primitives
}

Then, a char[] array is simply an array of char[].

points:
1. No more issues with foreach(c; "cassé"), it iterates viadchar2. No more issues with "cassé"[4], it is a static compilererror.
3. No more awkward ASCII manipulation using ubyte[].
4. No more phobos schizophrenia saying char[] is not an array.
5. No more special casing char[] array templates to fool thecompiler.6. Any other special rules we come up with can be dictated bythe library, and not ignored by the compiler.
Note, std.algorithm.copy(string1, mutablestring) will stilldecode/encode, but it's more explicit. It's EXPLICITLY a dcharrange. Use std.algorithm.copy(string1.representation,mutablestring.representation) will avoid the issues.
I imagine only code that is currently UTF ignorant will break,and that code is easily 'fixed' by adding the 'representation'qualifier.
-Steve

Generally I think it's a good idea. Going a bit further you couldalso enable Short String Optimization but you'd have toencapsulate the backing array.

It seems like this would be an even bigger breaking change thanWalter's proposal though (right or wrong, slicing strings is verycommon).

Re: Proposal for fixing dchar ranges

Reply via email to