Re: Performance of GC.collect() for single block of `byte`s

Rainer Schuetze via Digitalmars-d-learn Mon, 01 Oct 2018 00:25:40 -0700



On 28/09/2018 14:21, Per Nordlöw wrote:

On Monday, 24 September 2018 at 14:31:45 UTC, Steven Schveighoffer wrote:
It's not scanning the blocks. But it is scanning the stack.
Each time you are increasing the space it must search for a given*target*. It also must *collect* any previous items at the end of thescan. Note that a collection is going to mark every single page andbitset that is contained in the item being collected (which getsincreasingly larger).
Is this because of the potentially (many) slices referencing this largeblock?
I assume the GC doesn't scan the `byte`-array for pointer-values in thiscase, but that happens for `void`-arrays and class/pointer-arrays right?
Couldn't that scan be optimized by adding a bitset that indicates whichpages need to be scanned?
Is it common for GC's to treat large objects in this way?

A profiler reveals that most of the time is spent in "sweeping" thememory, i.e. looking for allocations no longer referenced. The existingimplementation checks every page which causes a linear growth ofrequired CPU resources with used memory.

This version https://github.com/rainers/druntime/tree/gc_opt_sweep takesadvantage of the known size of allocations to skip unnecessary checks.The last commit also adds support for keeping track of the size ofblocks of consecutive free pages. With this your example has more orless constant collection time (note that most of the program time isspent setting the array to zero, though not measured, and that theallocation often triggers a collection, too).

I also noticed a rather serious bug for huge allocations:https://issues.dlang.org/show_bug.cgi?id=19281

Re: Performance of GC.collect() for single block of `byte`s

Reply via email to