[ 
https://issues.apache.org/jira/browse/HBASE-29012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Wellington Chevreuil resolved HBASE-29012.
------------------------------------------
    Resolution: Fixed

Marking this as resolved since the merge of HBASE-28596 into branch-2.6.

> Performance regression in hot reads after split/merge
> -----------------------------------------------------
>
>                 Key: HBASE-29012
>                 URL: https://issues.apache.org/jira/browse/HBASE-29012
>             Project: HBase
>          Issue Type: Bug
>    Affects Versions: 3.0.0-beta-1, 2.6.1
>            Reporter: Bryan Beaudreault
>            Priority: Major
>
> We noticed a significant performance regression which comes from HBASE-27474. 
> In that ticket, logic is added so that we don't cache blocks that exist 
> within a reference or a link if compactions are enabled.
> The issue we noticed is that we had a cluster which had compactions enabled, 
> but compactions were a bit delayed. During the time, there were some regions 
> which were recently split/merged and they contained references. This cluster 
> is very hot reads and relies heavily on bloom filters. I noticed through 
> profiles that we were spending a lot of time fetching BLOOM_CHUNK blocks from 
> hdfs. This is almost never the case since we continually rightside the block 
> cache to ensure all blooms are cached. In fact, we had no evictions at the 
> time. So why weren't they getting cached?
> With trace logging enabled I noticed that all of the blocks being read over 
> and over happened to come from hfiles that looked to be references. This led 
> me to the ticket in question.
> This feels like a very serious regression, as it leads to substantantial 
> impact to both hdfs and hbase in terms of request times and GC time and the 
> host becomes fully hosed. I sort of wonder if we should revert that issue, or 
> at the very least make it configurable. I'm not sure how to preserve the 
> intended behavior of the ticket while also protecting the regionserver 
> performance. In our case this happened for bloom blocks, but it could just as 
> easily happen to a hot data block.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to