Re: Which D features to emphasize for academic review article

Jakob Ovrum Sat, 11 Aug 2012 07:51:47 -0700

On Friday, 10 August 2012 at 22:01:46 UTC, Walter Bright wrote:

It catches only a subset of these at compile time. I can craftany number of ways of getting it to miss diagnosing it.Consider this one:
    float z;
    if (condition1)
         z = 5;
    ... lotsa code ...
    if (condition2)
         z++;
To diagnose this correctly, the static analyzer would have todetermine that condition1 produces the same result ascondition2, or not. This is impossible to prove. So the staticanalyzer either gives up and lets it pass, or issues anincorrect diagnostic. So our intrepid programmer is forced towrite:
    float z = 0;
    if (condition1)
         z = 5;
    ... lotsa code ...
    if (condition2)
         z++;
Now, as it may turn out, for your algorithm the value "0" is anout-of-range, incorrect value. Not a problem as it is a deadassignment, right?
But then the maintenance programmer comes along and changescondition1 so it is not always the same as condition2, and nowthe z++ sees the invalid "0" value sometimes, and a silent bugis introduced.
This bug will not remain undetected with the default NaNinitialization.

The compiler in languages like C# doesn't try to prove that thevariable is NOT set and then emits an error. It tries to provethat the variable IS set, and if it can't prove that, it's anerror.

It's not an incorrect diagnostic, it does exactly what it'ssupposed to do and the programmer has to be explicit when onetakes on the responsibility of initialization. I don't seeanybody complaining about this feature in C#, most experienced C#programmers I've talked to love it (I much prefer it too).

Leaving a local variable initially uninitialized (or rather, notexplicitly initialized) is a good way to portray the intentionthat it's going to be conditionally initialized later. In C#, ifyour program compiles, your variable is guaranteed to beinitialized later but before use. This is a useful guarantee whenreading/maintaining code.


In D, on the other hand, it's possible to write D code like:

for(size_t i; i < length; ++i)
{
    ...
}

And I've actually seen this kind of code a lot in the wild. Itboggles my mind that you think that this code should be legal. Ithink it's lazy - the intention is not clear. Is the defaultinitializer being intentionally relied on, or was itunintentional? I've seen both cases. The for-loop example is anextreme one for demonstrative purposes, most examples are lessobvious.

Saying that most programmers will explicitly initialize floatingpoint numbers to 0 instead of NaN when taking on initializationresponsibility is a cop-out - float.init and float.nan areobviously the values you should be going for. The benefit is easyfor programmers to understand, especially if they alreadyunderstand why float.init is NaN. You say yelling at themprobably won't help - why not? I personally usefloat.init/double.init etc. in my own code, and I'm sure otherinformed programmers do too. I can understand why people don't doit in, say, C, with NaN being less defined there afaik. Dpromotes NaN actively and programmers should be eager to leverageNaN explicitly too.

It's also important to note that C# works the same as D fornon-local variables - they all have a defined default initializer(the C# equivalent of T.init is default(T)). Another point isthat the local-variable analysis is limited to the scope of asingle function body, it does not do inter-procedural analysis.

I think this would be a great thing for D, and I believe that allcode this change breaks is actually broken to begin with.

Re: Which D features to emphasize for academic review article

Reply via email to