Re: Scott Meyers' DConf 2014 keynote "The Last Thing D Needs"

via Digitalmars-d-announce Thu, 29 May 2014 02:41:36 -0700

On Thursday, 29 May 2014 at 03:29:31 UTC, Jonathan M Davis viaDigitalmars-d-announce wrote:

1. The order of the dimensions of multi-dimensional staticarrays is backwards
in comparison to what most everyone expects.

    int[4][5][6] foo;
is the same as

    int foo[6][5][4];

and has the same dimensions as

    auto bar = new int[][][](6, 5, 4);
The reasons for it stem from the fact that the compiler readstypes outwardfrom the variable name (which is very important to understandin C because ofits function pointer syntax but not so important in D).However, once we did
    const(int)* foo;

and didn't allow

    (int)const* foo;
I think that we threw that particular bit of consistency withC/C++ out thewindow, and we really should have just made static arraydimensions be readfrom left-to-right. Unfortunately, I don't think that we canfix that at thispoint, because doing so would cause silent breakage (or atminimum, would be
silent until RangeErrors were thrown at runtime).


I don't see this as an inconsistency. Just read it as follows:

    int[6][5]* foo;

- start with the type int
- make an array from it
- make an array from that
- and finally, turn it into a pointer.

    const(int)* bar;

Just read `const(int)` as one entity here (as its form suggests,some kind of "function call"):


- start with a const(int)
- make a pointer from it

3. const, immutable, and inout on the left-hand side of afunction declaration are unfortunately legal.

Agreed. At least it's possible to do it by convention (but see4.).

4. There are some cases (such as with static constructors andunittest blocks)that the attributes have to go on the left for some reason. Idon't rememberthe reasons for it, but it's an inconsistency which definitelytrips up even
seasoned D programmers from time to time.

I don't know these cases, but the reason might be is thatfunction declarations and unittests need to be followed by braces(or a semicolon in the case of functions), whereas some otherkeywords also allow non-compound statements. This could thereforelead to ambiguities as to whether the type qualifier applies tothe declaration or the following statement.

5. The fact that pure is called pure is very problematic atthis point as faras explaining things to folks goes. We should probably considerrenaming it tosomething like @noglobal, but I'm not sure that that would goover very wellgiven the amount of breakage involved. It _does_ require a lotof explaining
though.

Well, it's just a name, and it's for hysterical raisins ;-) Idon't think it's so bad, because the purity concept alreadydiffers from language to language.

6. The situation with ranges and string is kind of ugly, withthem beingtreated as ranges of code points. I don't know what the correctsolution tothis is, since treating them as ranges of code units promotesefficiency butmakes code more error-prone, whereas treating them as ranges ofgraphemeswould just cost too much. Ranges of code points is _mostly_correct but stillincorrect and _more_ efficient than graphemes but still quite abit lessefficient than code units. So, it's kind of like it's got thebest and worstof both worlds. The current situation causes inconsistencieswith everythingelse (forcing us to use isNarrowString all over the place) anddefinitelyrequires frequent explaining, but it does prevent some classesof problems.So, I don't know. I used to be in favor of the currentsituation, but at thispoint, if we could change it, I think that I'd argue in faverof just treatingthem as ranges of code units and then have wrappers for rangesof code pointsor graphemes. It seems like the current situation promoteseither usingubyte[] (if you care about efficiency) or the new graphemefacilities instd.uni if you care about correctness, whereas just usingstrings as ranges ofdchar is probably a bad idea unless you just don't want to dealwith any ofthe Unicode stuff, don't care all that much about efficiency,and are willinghave bugs in the areas where operating at the code point levelis incorrect.

My preferred solution would be to disallow iterating over barechar/wchar/dchar ranges, but require an explicit .byCodeUnit,.byCodePoint or .byGrapheme. Probably not going to happen,though...

Re: Scott Meyers' DConf 2014 keynote "The Last Thing D Needs"

Reply via email to