Re: [DISCUSS] Should L2 cache be strictly L2 by default

Biju N Fri, 18 Aug 2017 08:07:34 -0700

Thanks Anoop/Stack.
With L1+L2, there is an overhead during getBlocks when the block need to be
cached in L1. Since the evictions are done in a separate thread when
threshold are reached the overhead during evictions can be discounted.
Based on a quick test looks like the overhead is in 100s of nano secs to
500 micro secs. Agreed from the performance perspective it is better to use
off heap directly for data blocks instead of going through L1.  Probably
the terminology L1/L2 can be dropped at some point.


>From lruCache code[1],  when a cacheBlock request is made and the block is
already in cache there is logic to compare the contents. With offHeap cache
will be used directly for data blocks, do we want to include the same logic
BucketCache [2]?

[1]
https://github.com/apache/hbase/blob/branch-1.1/hbase-server/src/main/java/org/apache/hadoop/hbase/io/hfile/LruBlockCache.java#L330-L335
[2]
https://github.com/apache/hbase/blob/branch-1.1/hbase-server/src/main/java/org/apache/hadoop/hbase/io/hfile/bucket/BucketCache.java#L362-L364

On Thu, Aug 17, 2017 at 9:34 PM, Anoop John <[email protected]> wrote:

> Yes I would also say the current way is better.  Specially after the
> off heap read path improve, we are at  almost same perf from bucket
> cache compared to LRU cache.  So it would be better to work with a
> small Java heap and small L1 cache size. (This is not strict L1 vs L2
> . Still I call it that way)  And we will get a large L2 off heap cache
> size.  The data blocks are not moving around the L1 and L2 but
> strictly goes into and out of L2.  The index and blooms blocks alone
> goes into L1.   I believe we should make this the default for 2.0
>
> -Anoop-
>
> On Fri, Aug 18, 2017 at 1:12 AM, Stack <[email protected]> wrote:
> > Some more info, when COMBINED=false, this is what happens:
> >
> > // L1 and L2 are not 'combined'. They are connected via the LruBlockCache
> > victimhandler
> > // mechanism. It is a little ugly but works according to the following:
> > when the
> > // background eviction thread runs, blocks evicted from L1 will go to L2
> > AND when we get
> > // a block from the L1 cache, if not in L1, we will search L2.
> >
> > For me, I'd be interested in seeing perf compare. IIRC, when NOT
> combined,
> > data blocks coming up into L1 and the being 'victim handled' -- evicted
> ==
> > copied -- out to L2 was costly.
> >
> > St.Ack
> >
> >
> >
> > On Thu, Aug 17, 2017 at 11:23 AM, Biju N <[email protected]> wrote:
> >
> >> Currently BUCKET_CACHE_COMBINED_KEY is set to "true" by default  [1]
> which
> >> makes L2 cache not strictly L2 cache. From the usability perspective,
> it is
> >> better to set BUCKET_CACHE_COMBINED_KEY  to "false" so that L2 cache
> would
> >> behave strictly L2 and also use the L1 cache to store data blocks
> improving
> >> memory use. Thoughts?
> >>
> >> Thanks,
> >> Biju
> >>
> >> [1]
> >> https://github.com/apache/hbase/blob/84d7318f86305f34102502a70d7182
> >> 23320590d5/hbase-server/src/main/java/org/apache/hadoop/
> >> hbase/io/hfile/CacheConfig.java#L112
> >>
>

Re: [DISCUSS] Should L2 cache be strictly L2 by default

Reply via email to