Re: Idiomatic D using GC as a library writer

Adam D Ruppe via Digitalmars-d-learn Sun, 04 Dec 2022 15:32:55 -0800

On Sunday, 4 December 2022 at 22:46:52 UTC, Ali Çehreli wrote:

That's way beyond my pay grade. Explain please. :)

The reason that the GC stops threads right now is to ensure thatsomething doesn't change in the middle of its analysis.

Consider for example, the GC scans address 0 - 1000 and findsnothing. Then a running thread moves a reference from memoryaddress 2200 down to address 800 while the GC is scanning1000-2000.

Then the GC scans 2000-3000, where the object used to be, but itisn't there anymore... and the GC has no clue it needs to scanaddress 800 again. It, never having seen the object, thinks theobject is just dead and frees it.


Then the thread tries to use the object, leading to a crash.

The current implementation prevents this by stopping all threads.If nothing is running, nothing can move objects around while theGC is trying to find them.

But, actually stopping everything requires 1) the GC knows whichthreads are there and has a way to stop them and 2) is overkill!All it really needs to do is prevent certain operations thatmight change the GC's analysis while it is running, like whathappened in the example. It isn't important to stop numeric work,that won't change the GC. It isn't important to stop pointerreads (well not in D's gc anyway, there's some that do need tostop this) so it doesn't need to stop them either.

Since what the GC cares about are pointer locations, it ispossible to hook that specifically, which we call write barriers;they either block pointer writes or at least notify the GC aboutthem. (And btw not all pointer writes need to be blocked either,just ones that would point to a different memory block. So thingslike slice iterations can also be allowed to continue. More on mybloghttp://dpldocs.info/this-week-in-d/Blog.Posted_2022_10_31.html#thoughts-on-pointer-barriers )


So what happens then:


GC scans address 0 - 1000 and finds nothing.

Then a running thread moves a reference from memory address 2200down to address 800... which would trigger the write barrier. Thethread isn't allowed to complete this operation until the GC isdone. Notice that the GC didn't have to know about this threadahead of time, since the running thread is responsible forcommunicating its intentions to the GC as it happens.(Essentially, the GC holds a mutex and all pointer writes ingenerated D code are synchronized on it, but there's variousimplementations.)

Then the GC scans 2000-3000, and the object is still there sincethe write is paused! It doesn't free it.

The GC finishes its work and releases the barriers. The threadnow resumes and finishes the move, with the object still aliveand well. No crash.

This would be a concurrent GC, not stopping threads that aredoing self-contained work, but it would also be more compatiblewith external threads, since no matter what the thread, it'd usethat gc mutex barrier.

Re: Idiomatic D using GC as a library writer

Reply via email to