Re: D GC theory

Sativa via Digitalmars-d Tue, 24 Feb 2015 07:26:34 -0800

On Tuesday, 24 February 2015 at 08:39:02 UTC, Kagamin wrote:

On Tuesday, 24 February 2015 at 00:30:43 UTC, ketmar wrote:
On Mon, 23 Feb 2015 21:11:22 +0000, Sativa wrote:
How hard would it be to modify D's GC to do the following twothings:
1. Scan the heap in the BG on another thread/cpu forcompactification.
needs read/write barriers added to generated code. a majorslowdown for
ALL memory access.
Only modifications of pointers, which introduce new cross-blockdependencies (so that GC knows to recheck the new dependency).Other memory access goes without slowdown.

But this type of thinking is the reason why the current GC is inthe state it is.

The compiler knows which pointers are "free" and which ones are"bound". Bound pointers are pointers that are not assigned freelyby the user. e.g., a pointer to an array who's address never isarbitrarily set by the user is bound. The compiler knows whereand how the pointer is assigned. Most pointers are this way.

Bound pointers are pointers the GC can easily clean up because itknows when and how they are used. In this way, if all pointers ofa program were bound, the GC can work in the background and neverpause the state to clean up. (essentially the compiler would needto insert special code) most pointers are bound pointers.

Free pointers are more difficult as they can, say, be randomlyinitiated and point to anywhere on the heap and have to be lookedin a locked way. (to prevent them changing in the middle of someGC operation)

But if one distinguishes bound and free pointers(Easily done witha bit in the pointers) and has the compiler keep track of whenfree pointers are used(by having a dirty bit when they arewritten to), then one can more easily scan the heap in thebackground.

In fact, one could potentially get away from all synching issuesby doing the following:

When ever free pointers are used a simple spin lock is used. Thespin lock checks a flag in the free pointers table that signalsthat a pointer is being changed by the code. When this is true,the free pointers table is in a state of flux and can't be reliedon. In the mean time, the GC can build up information about theheap for the bound pointers. It can figure out what needs to bechanged, setup buffering(which can be done using bits in thepointer), etc all in the background because the bound pointersare "stable" and deterministically change.

When the free pointers table's dirty flag is unset it means thatthe free pointers are not changing in the program and the GC canlock the table using another flag. When the flag is set the spinlock kicks in and pauses the program while the GC is working onthe free pointers table. (or to be more efficient, the programcan yield to some other background task code)

By having multiple tables of free pointers one can reduce theoverhead. The GC looks at on a piece at a time and locks on afraction of the code at any point in time. The compiler candistribute the locks vs pages in an optimized way throughprofiling.

Re: D GC theory

Reply via email to