Re: LRU cache for ~=

Andrei Alexandrescu Tue, 20 Oct 2009 07:15:29 -0700

Steven Schveighoffer wrote:

On Mon, 19 Oct 2009 14:51:32 -0400, Andrei Alexandrescu<seewebsiteforem...@erdani.org> wrote:
I just wrote this to Sean and Walter and subsequently discussed itwith Walter. Walter thinks this should work. Does anyone have the timeand inclination to test this out? It would involve hacking intodruntime's implementation of ~= (I'm not sure what the function nameis). I'd really appreciate this; I'm overloaded as it is.
==================
In wake of the recent demise of T[new], I was thinking of finding waysof making ~= efficient for T[].
Currently ~= is slow because accessing GC.sizeOf(void*) acquires aglobal lock and generally must figure out a lot of things about thepointer to make a decision.
Also, ~= is dangerous because it allows slices to stomp over otherslices.
I was thinking of solving these issues by keeping an LRU (LeastRecently Used) cache inside the implementation of ~=. The LRU wouldonly have a few entries (4-8) and would store the parameters of thelast ~= calls, and their cached capacities.
So whenever code calls arr ~= b, the LRU is searched first. If thesystem finds "arr" (both bounds) in the LRU, that means the cachedcapacity is correct and can solve the matter without an actual trip tothe GC at all! Otherwise, do the deed and cache the new slice and thenew capacity.
This also solves the lack of safety: if you request a growth on anarray you just grew, it's impossible to have a valid slice beyondthat array.
This LRU would allow us to keep the slice API as it currently is, andalso at excellent efficiency.
What do you think?
This is a very good idea. Incidentally, you only need the upper boundlocation, the beginning location is irrelevant, since you don't growdown.


Awesome, didn't think of that. So now more cases are caught:

auto a = new int[100];
a ~= 42;
a = a[50 .. $];
a ~= 52;

That wouldn't have worked with my original suggestion, but it does worksafely with yours.

What do you do in the case where the memory was recycled? Does aGC collection cycle clean out the cache as well?


As you saw, there was some discussion about that as well.

This is better than my two previous ideas. The only drawback I see isif you have many threads doing appending, or you are appending more than8 arrays at once in a round-robin fashion, you would lose all thebenefit (although it shouldn't affect correctness). At that pointhowever, you'd have to ask yourself why you aren't using a specializedappender type or function.

Yah. As I suspect a lot of code is actually doing round-robin naturally,I'm considering using a random eviction strategy. That way performancewill degrade smoother. A more advanced algorithm would use introspectionto choose dynamically between LRU and random.



Andrei

Re: LRU cache for ~=

Reply via email to