Re: GCs in the news

w0rp via Digitalmars-d Thu, 17 Jul 2014 05:41:35 -0700

The key to making D's GC acceptable lies in two factors I believe.

1. Improve the implementation enough so that you will only beimpacted by GC in extermely low memory or real time environments.2. Defer allocation more and more by using ranges and algorithmsmore, and trust that compiler optimisations will make these fast.

The big, big offender I believe for extra allocations isfunctions which return objects, rather than functions which writeto output ranges. The single most common occurence of this issomething like this is toString. Instead of writing this...


string toString() {
    // Allocations the user of the library has no control over.
    return foo.toString() ~ bar.toString() ~ " something else";
}

I believe you should always, always instead write this.

// I left out the part with different character types.
void writeString(OutputRange)(OutputRange outputRange)
if (isOutputRange!(OutputRange, char)) {
    // Allocations controlle by the user of the library,
    // this template could appear in a @nogc function.
    foo.writeString(outputRange);
    bar.writeString(outputRange);

    "something else".copy(outputRange);
}

It's perhaps strange at first because you're pre-programmed fromother languages, except maybe C++ which uses output streams, toalways be allocating temporary objects everywhere, even if allyou are doing is writing them to an object.

For improving the GC to an acceptable level, I believe collectiononly needs to execute fast enough such that it will fit within aframe comfortably. So for something rendering at 60FPS you have 1second / 60 frames ~= 16.6 milliseconds of computation you can dowithout resulting in a single dropped frame. That means you needto get collection down to something in the 1ms to 2ms region. Atwhich point collection time will only impact something which isreally pushing the hardware, which would exclude most mobilevideo games, which are about the complexity of Angry Birds.

I firmly believe there's no silver bullet for automatic memorymanagement. Reference counting solutions, including automaticreference counting, will consume less memory than a garbagecollector and offer more predictable collection times, but do soat the expense of memory safety and simplicity. You need fatterpointers to manage the reference counts, and you need tocarefully deal with reference cycles.

In addition, you cannot easily share slices of memory withreference counting, which is an advantage of garbage collection.With GC, you can allocate a string, slice a part of it, hand overthe slice to some other object, and you know that the slice willstay around for as long as it's needed. With reference counting,you have to either retain the slice and the whole segment in thesame way and allow for the possibility of hidden cycles, ordisallow slicing and create copies instead. Slicing in GC isimportant, because you can create much more efficient programswhich take slices based on regex, which we do right now.

For the environments which cannot tolerate collection whatsoever,like Sociomantic's real time bidding operations, then control ofallocation will have to be left to the user. This is where thezero allocation idea behind ranges and algorithms comes intoplay, because then the code which doesn't allocate, which couldpotentially be all of std.algorithm, can still be used in thoseenvironments, rather than being rendered unusable.


There's my thoughts on it anyway. I probably rambled on too long.

Re: GCs in the news

Reply via email to