Re: D array expansion and non-deterministic re-allocation

Andrei Alexandrescu Tue, 24 Nov 2009 08:30:16 -0800

Steven Schveighoffer wrote:

On Tue, 24 Nov 2009 11:01:10 -0500, Andrei Alexandrescu<seewebsiteforem...@erdani.org> wrote:
Steven Schveighoffer wrote:
[snip]
Andrei has mentioned that he thinks we can store the allocated lengthin the GC block, which I think would also work. You also wouldn'tneed an MRU cache in that case, but he says it's in *addition* to theMRU cache, so I'm not sure what he means.
[snip]
Reaching the GC block is relatively expensive, so the MRU still helps.In essence it's like this. When appending:
a) look up the cache, if there, you have O(1) amortized append that'sreally fast
b) if not in the cache, talk to the GC, still O(1) amortized appendthat's not as fast
Both help providing an important performance guarantee. I was a bitworried about guaranteeing "O(1) amortized append for up to 8 arraysat a time."
Have you considered the performance impact on allocating non-arraytypes? That is, are you intending on all allocations storing theallocated length, even class or struct allocations who will likely neverappend? Or will there be a "non-appendable" flag?
Also, the part without the MRU cache was my original proposal from lastyear, I had some ideas on how length could be stored. For example, in apage of up to 128 byte blocks, you only need 8 bits to store the length(alas, you cannot store with 4 bits for 16-byte blocks because you needto cover both 0 and 16). This reduces the overhead for those blocks.For 256 byte to 1-page blocks, 16 bits is acceptable multi-page blocks,the cost of storing a 32-bit integer is negligible.

Having access to the requested length is important at larger lengths, soprobably we could be fine by not actually storing it for allocations upto 128 bytes.

It is true the lookup of the MRU cache will not involve dissecting theaddress of the block to find it's container block, but you still willneed a lock, or are you planning on doing something clever? I thinkthe locking will be the bottleneck, and if you don't make it the same asthe global lock, add the cost of both locks when you actually need toappend.

The cache is a thread-local map from pointers to size_t. Using it doesnot require any locking I think.



Andrei

Re: D array expansion and non-deterministic re-allocation

Reply via email to