Re: Proposal for fixing dchar ranges

Steven Schveighoffer Mon, 10 Mar 2014 11:16:56 -0700

On Mon, 10 Mar 2014 13:59:53 -0400, John Colvin<john.loughran.col...@gmail.com> wrote:

On Monday, 10 March 2014 at 13:35:33 UTC, Steven Schveighoffer wrote:
I proposed this inside the long "major performance problem withstd.array.front," I've also proposed it before, a long time ago.
But seems to be getting no attention buried in that thread, not evennegative attention :)
An idea to fix the whole problems I see with char[] being treatedspecially by phobos: introduce an actual string type, with char[] asbacking, that is a dchar range, that actually dictates the rules wewant. Then, make the compiler use this type for literals.
e.g.:

struct string {
   immutable(char)[] representation;
   this(char[] data) { representation = data;}
   ... // dchar range primitives
}

Then, a char[] array is simply an array of char[].

points:

1. No more issues with foreach(c; "cassé"), it iterates via dchar
2. No more issues with "cassé"[4], it is a static compiler error.
3. No more awkward ASCII manipulation using ubyte[].
4. No more phobos schizophrenia saying char[] is not an array.
5. No more special casing char[] array templates to fool the compiler.
6. Any other special rules we come up with can be dictated by thelibrary, and not ignored by the compiler.
Note, std.algorithm.copy(string1, mutablestring) will stilldecode/encode, but it's more explicit. It's EXPLICITLY a dchar range.Use std.algorithm.copy(string1.representation,mutablestring.representation) will avoid the issues.
I imagine only code that is currently UTF ignorant will break, and thatcode is easily 'fixed' by adding the 'representation' qualifier.
-Steve
I know warnings are disliked, but couldn't we make the slicing andindexing work as currently but issue a warning*? It's not ideal but itdoes mean we get backwards compatibility.

As I mentioned elsewhere (but repeating here for viewers), I was notplanning on disabling slicing.

Indexing is rarely a feature one needs or should use, especially withencoded strings.


-Steve

Re: Proposal for fixing dchar ranges

Reply via email to