Re: RFC: moving forward with @nogc Phobos

via Digitalmars-d Wed, 01 Oct 2014 13:55:56 -0700

On Wednesday, 1 October 2014 at 15:48:39 UTC, Oren Tirosh wrote:

On Tuesday, 30 September 2014 at 19:10:19 UTC, Marc Schützwrote:One problem with actually implementing this is that usingreference counting as a memory management policy requires extraspace for the reference counter in the object, just as garbagecollection requires support for scanning and identification ofinterior object memory range. While allocation and memorymanagement may be quite independent in theory, practical highperformance implementations tend to be intimately related.
(I'll try to make a sketch on how this can be implemented inanother post.)
Do elaborate!
As a conclusion, I would say that APIs should strive for thefollowing principles, in this order:
1. Avoid allocation altogether, for example by laziness(ranges), or by accepting sinks.
2. If allocations are necessary (or desirable, to make the APImore easily usable), try hard to return a unique value (thisof course needs to be expressed in the return type).
3. If both of the above fails, only then return a GCedpointer, or alternatively provide several variants of thefunction (though this shouldn't be necessary often). Aninteresting alternative: Instead of passing a flag directlydescribing the policy, pass the function a type that it shouldwrap it's return value in.
As for the _allocation_ strategy: It indeed needs to beconfigurable, but here, the same objections against a templateparameter apply. As the allocator doesn't necessarily need tobe part of the type, a (thread) global variable can be used tospecify it. This lends itself well to idioms like
   with(MyAllocator alloc) {
       // ...
   }
Assuming there is some dependency between the allocator and thememory management policy I guess this would be initialized onthread start that cannot be modified later. All code runninginside the thread would need to either match the configuredpolicy, not handle any kind of pointers or use a limited subsetof unique pointers. Another way to ensure that code can run oneither RC or GC is to make certain objects (specifically,Exceptions) always allocate a reference counter, regardless ofthe currently configured policy.

I don't have all answers to these questions. Still, I'm convincedthis is doable.

A straight-forwarding and general way to convert a unique objectto a ref-counted one is to allocate new memory for it plus thereference count, move the original object into it, and releasethe original memory. This is safe, because there can be noexternal pointers to the object, as it is unique. Of course, thiscan be optimized if the allocator supports extending anallocation. It could then preallocate a few extra bytes at theend to make the extend operation always succeed, similar to yoursuggestion to always allocate a reference counter.

I think the most difficult part is to find an efficient anduser-friendly way for the wrapper types to get at the allocator.Maybe the allocators should all implement an interface (a realone, not duck-typing). The wrappers (Owned, RC) can then includea pointer to the allocator (or for RC, embed it next to thereference count). This would make it possible to specify a(thread) global default allocator at runtime, which all libraryfunctions use by convention (for example let's call it `alloc`,then they would call `alloc.make!MyStruct()`). At the same time,it is safe to change the default allocator at any time, and touse different allocators in parallel in the same thread.

The alternative is obviously a template parameter to the functionthat returns the unique object. But this unfortunately is thennot restricted to just the function, but "infects" the returntype, too. And from there, it needs to spread to the RC wrapper,or any containers. Thus we'd have incompatible RC types, which Iwould imagine would be very inconvenient and restrictive.Besides, it would probably be too tedious to specify theallocator everywhere.

Therfore, I think the additional cost of an allocator interfacepointer is worth it. For Owned!T (with T being a pointer orreference), it would just be two words, which we can returnefficiently. We already have slices doing that, and AFAIK there'sno significantly worse performance because of them.

Re: RFC: moving forward with @nogc Phobos

Reply via email to