[ https://issues.apache.org/jira/browse/HBASE-29012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Wellington Chevreuil resolved HBASE-29012. ------------------------------------------ Resolution: Fixed Marking this as resolved since the merge of HBASE-28596 into branch-2.6. > Performance regression in hot reads after split/merge > ----------------------------------------------------- > > Key: HBASE-29012 > URL: https://issues.apache.org/jira/browse/HBASE-29012 > Project: HBase > Issue Type: Bug > Affects Versions: 3.0.0-beta-1, 2.6.1 > Reporter: Bryan Beaudreault > Priority: Major > > We noticed a significant performance regression which comes from HBASE-27474. > In that ticket, logic is added so that we don't cache blocks that exist > within a reference or a link if compactions are enabled. > The issue we noticed is that we had a cluster which had compactions enabled, > but compactions were a bit delayed. During the time, there were some regions > which were recently split/merged and they contained references. This cluster > is very hot reads and relies heavily on bloom filters. I noticed through > profiles that we were spending a lot of time fetching BLOOM_CHUNK blocks from > hdfs. This is almost never the case since we continually rightside the block > cache to ensure all blooms are cached. In fact, we had no evictions at the > time. So why weren't they getting cached? > With trace logging enabled I noticed that all of the blocks being read over > and over happened to come from hfiles that looked to be references. This led > me to the ticket in question. > This feels like a very serious regression, as it leads to substantantial > impact to both hdfs and hbase in terms of request times and GC time and the > host becomes fully hosed. I sort of wonder if we should revert that issue, or > at the very least make it configurable. I'm not sure how to preserve the > intended behavior of the ticket while also protecting the regionserver > performance. In our case this happened for bloom blocks, but it could just as > easily happen to a hot data block. -- This message was sent by Atlassian Jira (v8.20.10#820010)