Re: D array expansion and non-deterministic re-allocation

Steven Schveighoffer Tue, 01 Dec 2009 04:35:13 -0800

On Thu, 26 Nov 2009 17:45:30 -0500, Bartosz Milewski<bartosz-nos...@relisoft.com> wrote:

Steve, I don't know about you, but this exchange clarified some thingsfor me. The major one is that it's dangerous to define the semantics ofa language construct in terms of implementation. You were defending somepoints using implementation arguments rather than sticking to definedsemantics.

I was defending the semantics by using an example implementation. I wasnot defining the semantics in terms of implementation. The semantics aredefined by the spec, and do not indicate when an array is reallocated andwhen it is not. That detail is implementation defined. My examples usedmd's implementation to show how the assumption can break. You said theguy needs me to show him that it is broken, and all his tests pass, whycan't I use my knowledge of the implementation to come up with an example?

I could rewrite my statements as: "You should not rely on the array beingreallocated via append, because D does not guarantee such reallocation.Using the reference implementation of dmd, it is possible to come up withan example of where this fails: ..."

We have found out that one should never rely on the array beingre-allocated on expansion, even if it seems like there's no other way.The only correct statement is that the freshly expanded part of thearray is guaranteed not to be write-shared with any other array.

I agree with this (except for "even if it seems like there's no otherway," The spec says an allocation always occurs when you do a ~ b, so youcan always rewrite a ~= b as a = a ~ b). In fact, at one point to avoidstomping I went through Tango and found all places where append couldresult in stomping, and changed the code this way. There were probablyless than 5 instances. Append is not a very common operation when youdidn't create the array to begin with.

However, this discussion veered away from a more important point. Idon't believe that programmers will consciously make assumptions aboutre-allocation breaking sharing.

For the most part, this is ok -- rarely do you see someone append to anarray they didn't create *and* modify the original data.

My belief is that people will expect more that appending an array*doesn't* reallocate. If you have experience in programming, the languageyou are used to either treats arrays as value types or as referencetypes. I don't think I've ever seen a language besides D that uses thehybrid type for arrays. So you are going to come to D expecting value orreference. If you expect value, you should quickly learn that's not thecase because 99% of the time, arrays look like reference types. It isnatural then to expect appending to an array to affect all other aliasesof that array, after all it is a reference type. I just think yourexamples don't ring true in practice because there are simpler ways toguarantee allocation. You have to go out of your way to write bad codethat doesn't work correctly.

Finally, it's easy to turn an array into a reference type when passing asa parameter, just use the ref decorator. All we need is a way to turn itinto a value type, and I think Andrei's idea of Value!(arr) would be greatfor that.

The danger is that it's easy to miss accidental sharing and it's veryhard to test for it.

I think this danger is rare, and it's easy to search for (just search for~= in your code, I did it with Tango). I think it can be very welldefined in a tutorial or book chapter.


-Steve

Re: D array expansion and non-deterministic re-allocation

Reply via email to