Re: On heap segregation, GC optimization and @nogc relaxing

Orvid King via Digitalmars-d Tue, 11 Nov 2014 19:07:11 -0800

On Wednesday, 12 November 2014 at 02:34:55 UTC, deadalnix wrote:

Hi all,
I want to get back on the subject of ownership, lifetime andpropose some solution, but before, propose to state the problemin a way that haven't seen before (even if I have no doubt somehave came to the same conclusion in the past).
The problem at hand is double: memory management and threadsafety. Number one has been a hot topic for ages, and number 2has become very over the past years, to the widespreading ofmulticores CPU.
The problem at hand here is ownership of data. There are 3roads you can go about it:- immutability and GC. Effectively, these 2 technique allowyou to get rid of ownership. There are advantages and drawbacksi'm going to discuss later.- Being unsafe and rely on convention. This is the C++ road(and a possible road in D). It allow to implement almost anywanted scheme, but come at great cost for the developer.- Annotations. This is the Rust road. It also come a greatcost for the developer, as some schemes may be non trivial toexpress granted the type system, but, contrary to the C++ road,is safe.
These approach all have some very nice things going on forthem, but also some killer scenarios.
Immutability+GC allow to have safety while keeping interfacessimple. That is of great value. It also come with some nicegoodies, in the sense that is it easy and safe to shared datawithout bookkeeping, allowing one to fit more in cache, andreduce the amount of garbage created. Most text processing appsfall into this category and this is why D is that good at them.Another big goodies is that many lock free algorithm becomepossible. Once you remove the need for bookkeeping of ownershipmany operations can be implemented in an atomic manner.Additionally, it is possible to implement various GCoptimization on immutable heap, which make the GC generallymore efficient. But the cost is also real. For some use case,this mean having a large amount of garbage generated (Carmackwrote a piece on haskell were he mention the disastrous effectthat having a framebuffer immutable would have: you'd have toclone it everytime you draw in it, which is a no go). GC alsotend to cause unpredictable runtime characteristics, whichprograms with real time constraint can have hard time to dealwith.
Relying on convention has the advantage that any scheme can beimplemented without constraint, while keeping interface simple.The obvious drawback is that it is time consuming and errorprone. It also make a lot of things unclear, and dev choose thebetter safe than sorry road. That mean excessive copying tomake sure one own the data, which is wasteful (in term of workfor the copy itself, garbage generation and cache pressure). Ifthis must be an option locally for system code, it doesn'tseems like this is the right option at program scale and we doit in C++ simply because we have to.
Finally, annotations are a great way to combine safety andspeed, but generally come at a great cost when implentinguncommon ownership strategies where you ends up having toexpress complex lifetime and ownership relations.
Ideally, we want to map with what the hardware does. So whatdoes the hardware do ?
Multicore CPU have various cores, each of them having layers ofcache. Cache is organized in cache line and each cache line canbe in various modes. Actual system are quite complex and dealwith problems we are not very interesting here (like writeback)but the general idea is that every cache line is owned withdifferent modes.
Either the cache line is owned by a single core and can bewritten to, or the cache line shared by several cores, each ofthem having a local copy of the line, but none of them canwrite to. There is an internal bus where cores can exchangecache line with each other and messages to acquire cache linein read or read/write mode. That mean CPU are good at threadlocal read/write, shared immutable and transfer of ownershipfrom one core to the other. They are bad at shared writabledata (as effectively, the cache line will have to bounce backand forth between cores, and all memory access will need to beserialized instead of performed out of order).
In that world, D has a bizaro position were it use acombination of annotations (immutable, shared) and GC.Ultimately, this is a good solution. Using annotation forcommon cases, fallback on GC/unsafe code when these annotationsfall short.
Before going into why it is fallign short, a digression on GCand the benefits of segregating the heap. In D, the heap isalmost segregated in 3 groups: thread local, shared andimmutable. These group are very interesting for the GC:- Thread local heap can be collected while disturbing only onethread. It should be possible to use different strategy indifferent threads.- Immutable heap can be collected 100% concurrently withoutany synchronization with the program.- Shared heap is the only one that require disturbing thewhole program, but as a matter of good practice, this heapshould be small anyway.
Various ML family languages (like OCaml) have adoptedsegregated heap strategy and get great benefice out of it. Forinstance, OCaml's GC is known to outperform Java's in mostscenarios.
We are sitting on a huge GC goldmine here, but 3 things preventus to exploit it:- Exceptions. They can bubble from one thread to the other andcreate implicit sharing.- Uniqueness (as it is defined now) as it allow for uniqueobject to be merged with any heap.- message passing. Ownership transfert is not possible and sounsafe casting ensue.
* It has to be noted that delegate allow as well for this kindof stunt, but this is recognized as a bug by now and hopefullyit is gonna be fixed.
D has a type qualifier system for which we pay a big price.Getting everything const correct is difficult. We'd want to getthe most bang for the buck. One of the bang we are not far tobe able to get is segregating the heap. That mean shitty GC andunsafe code.
Let's present a concrete exemple using ownership:
pure Object foo() { ... }
immutable o = foo();
This is valid code. However, foo can do arbitrary manipulationto come up with the object. These include various allocations.These allocation are mutable into foo, which makes itimpossible to allocate them on the immutable heap (as a GCrelying on this immutability could mess up things pretty bad).They also cannot be allocated on the TL heap as once promotedto immutable, the data become shared as well.
On the other hand, ownership means that the compiler can knowwhen things go out of scope and free them explicitly. Which isa plus as generating less garbage is always a way to improvegarbage collection. The most efficient work there is is the onethat do not need to be done.
I'd argue for the introduction of a basic ownership system.Something much simpler than rust's, that do not cover all usescases. But the good thing is that we can fallback on GC orunsafe code when the system show its limits. That mean we relyless on the GC, while being able to provide a better GC.
We already pay a cost at interface with type qualifier, let'smake the best of it ! I'm proposing to introduce a new typequalifier for owned data.
Now it means that throw statement expect a owned(Throwable),that pure function that currently return an implicitly uniqueobject will return owned(Object) and that message passing willaccept to pass around owned stuff.
The GC heap can be segregated into island. We currently have 3types of islands : Thread local, shared and immutable. Theseare builtin island with special characteristics in thelanguage. The new qualifier introduce a new type of island, theowned island.
owned island can only refers to other owned island and toimmutable. they can be merged in any other island at any time(that is why they can't refers to TL or shared).
owned(T) can be passed around as function parameter orreturned, or stored as fields. When doing so they are consumed.When an owned is not consumed and goes out of scope, the wholeisland is freed.
That means that owned(T) can implicitly decay into T,immutable(T), shared(T) at any time. When doing so, a call tothe runtime is done to merge the owned island to thecorresponding island. It is passed around as owned, then theownership is transferred and all local references to the islandare invalidated (using them is an error).
On an implementation level, a call to a pure function thatreturn an owned could look like this :
{
  IslandID __saved = gc_switch_new_island();
  scope(exit) gc_restore_island(__saved);

  call_pure_function();
}
This allow us to rely much less on the GC and allow for abetter GC implementation.
@nogc . Remember ? It was in the title. What does a @nogcfunction look like ? a no gc function o not produce any garbageor trigger the collection cycle. there is no reason per se toprevent the @nogc code to allocate on the GC as long as youknow it won't produce garbage. That mean the only operation youneed to ban are the one that merge the owned things into TL,shared or immutable heap.
This solves the problem of the @nogc + Exception. As Exceptionare isolated, they can be allocated, throw and catched into@nogc code without generating garbage. They can safely bubbleout of the @nogc section of the code and still be safe.
The same way, it open the door for a LOT of code that is not@nogc to be. If the code allocate memory in an owned island andreturn it, then it is now up to the caller to decide whether iswant's it garbage collected or keep it as owned (and/or make itreference counted for instance).
The solution of passing a policy at compile for allocation isclose to what C++'s stdlib is doing, and even if the proposedapproach by Andrei is better, I don't think this is a good one.The proposed approach allow for a lot of code to be marked as@nogc and allow for the caller to decide. That is ultimatelywhat we want libraries to look like.

I think a combination of the C++'s standard library's approachand Rust's approach would actually be the best possible. If wewere to follow C++'s strategy, I think it would be important tomake sure that it wouldn't require specifically adding templateparameters and constraints, and instead allow the use of aconcept-like system. I think that being able to default theallocator parameter to the GC, provided the current method is not@nogc, would also be a good idea. I think that if C++'s approachwere taken it would also be very beneficial to allow a syntaxsuch as `auto obj = new MyClass() with allocator`, and `deleteobj with allocator`. I do think that the definition of @nogcwould have to be slightly expanded though, so mean that anyvalues that are allocated with a given allocator are also freedwith the given allocator before returning. To connect back withyour proposal and allow even more to be @nogc, an owned(MyClass,allocator) object would be allowable to be returned in an @nogcfunction. This would allow transfer of the ownership of the dataand responsibility of deletion to the caller, provided that thecaller is @nogc. If the caller is @nogc and fails to free thememory, DMD should produce an error. If the caller is not @nogc,then DMD would say nothing, and assume that the allocator used toallocate the object will do the cleanup.

This would allow far more situations to be accounted for by theallocation system without needing a GC, while still allowingprograms that want to to use the GC. @nogc would simply mean thatno garbage is produced, it would not make a guarantee of whatallocator was used to perform the allocation. @nogc would alsomean that no allocators, other than the ones passed in by theuser, would be used to perform the allocations. This allows thecurrent definition of @nogc to still be present, while openingthe scope of @nogc up for use in a much larger variety ofsituations.

Re: On heap segregation, GC optimization and @nogc relaxing

Reply via email to