The Case For Autodecode

ag0aep6g via Digitalmars-d Fri, 03 Jun 2016 04:26:30 -0700

This is mostly me trying to make sense of the discussion.

So everyone hates autodecoding. But Andrei seems to hate it a good bitless than everyone else. As far as I could follow, he has one reason forthat, which might not be clear to everyone:

char converts implicitly to dchar, so the compiler lets you search for adchar in a range of chars. But that gives nonsensical results. Forexample, you won't find 'ö' in "ö".byChar, but you will find '¶' inthere ('¶' is U+00B6, 'ö' is U+00F6, and 'ö' is encoded as 0xC3 0xB6 inUTF-8).

The same does not happen when searching for a grapheme in a range ofcode points, because you just can't do that accidentally. dchar does notimplicitly convert to std.uni.Grapheme.

So autodecoding shields the user from one surprising aspect of narrowstrings, and indeed this one kind of problem does not exist with codepoints.


So:
code units - a lot of surprises
code points - a lot of surprises minus one

I don't think this makes autodecoding actually desirable, but I do thinkit prevents a mistake that could otherwise be common.

The issue could also be avoided by making char not convert implicitly todchar. I would like that, but it would of course be another substantialbreaking change.

At Andrei: Apologies if I'm misrepresenting your position. If you haveother arguments in favor of autodecoding, they haven't gotten through to me.

At everyone: Apologies if I'm just stating the obvious here. I neededthis pointed out, and it happened in the depths of the other thread. Somaybe this is an aspect others haven't considered either.

Finally, this is not the only argument in favor of *keeping*autodecoding, of course. Not wanting to break user code is the big onethere, I guess.

The Case For Autodecode

Reply via email to