Re: Possible bug in associative array implementation (and/or @safe checking)

Aaron D. Trout via Digitalmars-d-learn Thu, 16 Aug 2018 13:50:44 -0700

On Thursday, 16 August 2018 at 18:56:45 UTC, Steven Schveighofferwrote:

On 8/16/18 2:32 PM, Aaron D. Trout wrote:
[...]
On Thursday, 16 August 2018 at 17:20:23 UTC, StevenSchveighoffer wrote:
Yes, this is the effect I would expect.
D has traditionally simply allowed slicing stack data withoutquestion (even in @safe code), but that will change whendip1000 is fully realized. It will be allowed, but only whenassigning to scope variables.
Thanks for the quick and knowledgeable reply! I think Iunderstand what's going on, but I'm surprised it is allowed in@safe code since the compiler doesn't allow the following,even in non-@safe code:
int[] badSlice()
{
     int[2] buffer;
     return buffer[];
}
It's because it's on the same line. This is a crude "safe"feature that is easily duped.
This is allowed to compile:

int[2] buffer;
auto buf = buffer[];
return buf;

But add -dip1000 to the dmd options and that fails.
I would warn you that I think dip1000 is too crude to starttrying to apply it to your project, and may have linker errorswith Phobos.
I guess the compiler just isn't (yet!) able to catch that theassociative array is storing a slice of expired stack. I'msurprised that the built-in AA implementation *allows* usingslices as keys in @safe code without copying the underlyingdata to the heap first. This is clearly dangerous, but perhapsheap-copying slices defensively would result in anunacceptable performance hit.
I wouldn't put too much stock in having safety in the AA. TheAA is a very very old piece of the compiler, that pre-datessafety checks, and still is a bit of a kludge in terms of typeand memory safety. If you do find any obvious bugs, it's goodto report them.
This issue came up while trying to eliminate unnecessaryallocation in my code. In my case, I could set a maximum keylength at compile time and switch my key type to a structwrapping a static array buffer.
In hindsight, it was silly for me to think I could eliminateseparately allocating the keys when the key type was avariable length array, since the AA must store the keys. Thatsaid, a suitable admonition from the compiler here would havebeen very educational. I look forward to seeing the fullinclusion of DIP1000!
In this case, actually, the AA does NOT store the key data, butjust the reference to the keys. An array slice is a pointer andlength, and the data is stored elsewhere. The static version,however, does store all the key data inside the AA.
That being said, you can potentially avoid more allocation withthe keys with various tricks, such as pre-allocating all thekeys and then using the reference.
In other words, eagerly stick the data into an array of arrays:
auto sets = setA.map!(j => setB.filter!(i => i % j ==0).array).array;
and then not worry about duping them. But it all depends onyour use case.
-Steve

Thanks again for the quick reply! I have a pretty firm grasp onwhat a slice is (array + offset). What I had meant by the comment"the AA must store the keys" was that I had somehow gotten the(of course totally mistaken!) idea that the AA only ever neededto *examine* the key rather than actually storing it. If thatwere the case, a slice of about-to-be-expired stack would beperfectly fair game as a key. Am I correct that doing this*would* be an OK way to avoid unnecessary allocation if we knewthe key already existed (as a heap allocated slice) in the AA andwe simply wanted to modify the associated value? Example code:


--------------------------------------------------------------

immutable(int)[len] toImmutStaticArray(size_t len, R)(R range)
{
    import std.algorithm : copy;
    int[len] r;
    copy(range, r[]);
    return r;
}

void main() @safe
{
    int[int[]] aa;
    immutable(int)[] heapSlice = [0,1];
    aa[heapSlice] = 0;  // OK, aa stores heap allocated key

    {
        import std.range : iota;
        auto buffer = 2.iota.toImmutStaticArray!2;
        auto stackSlice = buffer[];
        aa[stackSlice] = 1; // OK yes? only accessing value
    }

    assert(aa[heapSlice] == 1);
}

--------------------------------------------------------------

Thanks also for the advice about -dip1000 and the state of thebuilt-in AA implementation. My code base has been changing toinclude more AA-heavy data structures, so I think that in thenear future I will need to do some refactoring to make changingAA implementation easier.

Also, one last question: should this issue be reported as a newbug? My understanding was that @safe code should not allowobtaining references to expired stack memory, but perhaps this isalready a known problem? I'm happy to file a new bug report ifthat would be helpful!


- Aaron Trout

Re: Possible bug in associative array implementation (and/or @safe checking)

Reply via email to