Re: Proposal for fixing dchar ranges

Steven Schveighoffer Mon, 10 Mar 2014 13:01:37 -0700

On Mon, 10 Mar 2014 15:30:00 -0400, John Colvin<john.loughran.col...@gmail.com> wrote:

On Monday, 10 March 2014 at 18:09:51 UTC, Steven Schveighoffer wrote:
Because one can slice out a multi-code-unit code point, one cannotaccess it via index. Strings would be horribly crippled withoutslicing. Without indexing, they are fine.
A possibility is to allow index, but actually decode the code point atthat index (error on invalid index). That might actually be the correctmechanism.
In order to be correct, both require exactly the same knowledge: Thebeginning of a code point, followed by the end of a code point. In theindexing case they just happen to be the same code-point and happen tobe one code unit from each other. I don't see how one is any more orless errror-prone or fundamentally wrong than the other.

Using indexing, you simply cannot get the single code unit that representsa multi-code-unit code point. It doesn't fit in a char. It's guaranteed tofail, whereas slicing will give you access to the all the data in thestring.

Now, with indexing actually decoding a code point, one can alias a[i] toa[i..$].front(), which means decode the first code point you come to atindex i. This means indexing is slow(er), and returns a dchar. I think asa first step, that might be too much to add silently. I'd rather break itfirst, then add it back later.


-Steve

Re: Proposal for fixing dchar ranges

Reply via email to