Re: LRU cache for ~=

Robert Jacques Tue, 20 Oct 2009 07:50:16 -0700

On Tue, 20 Oct 2009 10:05:42 -0400, Steven Schveighoffer<schvei...@yahoo.com> wrote:

On Mon, 19 Oct 2009 22:37:26 -0400, dsimcha <dsim...@yahoo.com> wrote:
== Quote from Andrei Alexandrescu (seewebsiteforem...@erdani.org)'sarticle
dsimcha wrote:
> == Quote from Andrei Alexandrescu (seewebsiteforem...@erdani.org)'sarticle
>> dsimcha wrote:
>>> == Quote from Andrei Alexandrescu(seewebsiteforem...@erdani.org)'s article
>>>> dsimcha wrote:
>>>>> Started playing w/ the implementation a little and I see aproblem. What
about
>>>>> the garbage collector?  There are two possibilities:
>>>> [snip]
>>>>> The only possible solutions I see would be to have the GC knoweverything
about
>>>>> the LRU cache and evict stale entries (probably slows down GC alot, a
huge PITA
>>>>> to implement, couples things that shouldn't be tightly coupled),or clear the>>>>> cache every time GC is run (probably would make appending soslow as to
> defeat the
>>>>> purpose of having the cache).
>>>> I think GC.collect may simply evict the entire cache. Thecollection>>>> cycle costs so much, the marginal cost of losing cachedinformation is
>>>> lost in the noise.
>>>> Andrei
>>> But then you have to copy the whole array again, likely triggeringanother GC if>>> the array is large. Then things really get ugly as, for allpractical purposes,
>>> you've completely done away with the cache.
>> This happens whether or not a cache is in use.
>> Andrei
>
> But the array isn't guaranteed to get reallocated immediately after*every* GC> run. If you're appending to a huge array, the GC will likely runseveral times
> while you're appending, leading to several unnecessary reallocations.
I don't think I understand this.
1. Request for an append comes that runs out of memory
2. GC runs and clears memory
3. Array is reallocated and the capacity cached.
No?
This is entirely correct.
> Each of
> those unnecessary reallocations will increase the memory footprintof your> program, possibly triggering another GC run and wiping out yourcache again in
> short order, until, for sufficiently large arrays,
>
> a ~= b;
>
> is almost equivalent to
>
> a = a ~ b;
I don't understand how the cache makes that all worse.
Andrei
The cache doesn't make anything *worse* than with no cache. The onlypoint I'mtrying to make is that, for large arrays, if the GC clears the cacheevery time itruns, things would start to get *almost as bad as* having no cachebecause the
copy operations become expensive and the GC may run frequently.
The cache can't be "cleared" every time, or else you might as well onlykeep one LRU entry:
int[] twos, threes;

for(int i = 1; i < 10000; i++)
{
   twos ~= i * 2;
   threes ~= i * 3;
}
At some point, twos or threes needs an allocation triggering acollection, and that clears the cache, making the other array need anallocation, clearing the cache, etc.
I'd think you only want to clear the entries affected by the collection.

-Steve

If it was free and simple to only clear the affected entries, sure. Butdoing so requires (very heavy?) modification of the GC in order to trackand check changes. It also reduces collection performance. I think, thatif GC allocations added entries to the LRU, and therefore the informationin the LRU is never stale, you could avoid clearing the LRU. But thisrequires the LRU to be part of the GC.

Re: LRU cache for ~=

Reply via email to