Re: Feature suggestion: in-place append to array

Steven Schveighoffer Thu, 01 Apr 2010 07:35:10 -0700

On Thu, 01 Apr 2010 01:41:02 -0400, Mike S<mi...@notarealaddresslololololol.com> wrote:

Steven Schveighoffer wrote:
> What do you mean by nondeterministic? It's very deterministic, justnot
always easy to determine ;) However, given enough context, it's reallyeasy to determine.
When I say deterministic, I'm referring to determinism from the user'spoint of view, where the allocation behavior is affected solely by theparameter (the size request, e.g. 10000 objects) and not by some kind ofinternal state, hidden context, or arcane black magic. :p

Its abstracted to the GC, but the current GC is well defined. If yourequest to allocate blocks with length of a power of 2 under a page, youwill get exactly that length, all the way down to 16 bytes. If yourequest to allocate a page or greater, you get a contiguous block ofmemory that is a multiple of a page.


With that definition, is the allocator deterministic enough for your needs?

  > The amount of memory given is determined by the GC, and ultimately by
the OS. The currently supported OSes allocate in Page-sized chunks, sowhen you allocate any memory from the OS, you are allocating a page(4k). Most likely, you may not need a whole page for the data you areallocating, so the GC gives you more finely sized chunks by breaking upa page into smaller pieces. This strategy works well in some cases,and can be wasteful in others. The goal is to strike a balance that is"good enough" for everyday programming, but can be specialized when youneed it.
That's understandable, and it makes sense that the actual memory beingallocated would correspond to some chunk size. It's really just opaqueblack box behavior that poses a problem; if users are given well-definedguidelines and chunk sizes, that would work just fine. For instance, aspec like, "reserve a multiple of 512 bytes and that's exactly what youwill be given," would allow users to minimize wastefulness and knowprecisely how much memory they're allocating.

I think in the interest of allowing innovative freedom, such requirementsshould be left up to the GC implementor, not the spec or runtime. Anyonewho wants to closely control memory usage should just understand how theGC they are using works.

If you want to control memory allocation yourself, you can always dothat by allocating page-sized chunks and doing the memory management onthose chunks yourself. I do something very similar in dcollections tospeed up allocation/destruction.
 <snip>
I think D has deterministic allocation, and better ability than C++ tomake custom types that look and act like builtins. Therefore, you canmake an array type that suits your needs and is almost exactly the samesyntax as a builtin array (except for some things reserved forbuiltins, like literals). Such a thing is certainly possible, evenwith using the GC for your allocation.
That parallels what game devs do in C++: They tend to use customallocators a lot, and they're likely to follow the same basic strategyin D too, if/when it becomes a suitable replacement. I'm still justbrowsing though, and I'm not all that familiar with D. If you can'tactually use the built-in dynamic arrays for this purpose, how difficultwould it be to reimplement a contiguously stored dynamic container usingcustom allocation? I suppose you'd have to build it from the ground upusing a void pointer to a custom allocated block of memory, right? Douser-defined types in D have any/many performance disadvantages comparedto built-ins?

No, you would most likely use templates, not void pointers. D's templatesystem is far advanced past C++, and I used it to implement my customallocators. It works great.

User-defined types are as high performance as builtins as long as thecompiler inlines properly.

BTW, I made the change to the runtime renaming the function previouslyknown as setCapacity to reserve. It won't be a property, even if thatbug is fixed.
 -Steve
That's a bit of a downer, since a capacity property would have nicesymmetry with the length property. I suppose there were good reasonsthough. Considering the name change, does that mean reserve can onlyreserve new space, i.e. it can't free any that's already been allocated?

Capacity still exists as a read-only property. I did like the symmetry,but the point was well taken that the act of setting the capacity was notexact. It does mean that reserving space can only grow, not shrink. Infact, the capacity property calls the same runtime function as reserve,just passing 0 as the amount requested to get the currently reserved space.

You can't use capacity to free space because that could result in danglingpointers. Freeing space is done through the delete keyword. We do notwant to make it easy to accidentally free space.

(That makes me wonder: Out of curiosity, how does the garbagecollector know how much space is allocated to a dynamic array orespecially to a void pointer? I suppose it's registered somewhere?)

The GC can figure out what page an interior pointer belongs to, andtherefore how much memory that block uses. There is a GC function to getthe block info of an interior pointer, which returns a struct thatcontains the pointer to the block, the length of the block, and its flags(whether it contains pointers or not). This function is what the arrayappend feature uses to determine how much capacity can be used. I believethis lookup is logarithmic in complexity.


-Steve

Re: Feature suggestion: in-place append to array

Reply via email to