[jira] [Created] (HBASE-16958) Balancer recomputes block distributions every time balanceCluster() runs

Gary Helmling (JIRA) Thu, 27 Oct 2016 13:50:10 -0700

Gary Helmling created HBASE-16958:
-------------------------------------

             Summary: Balancer recomputes block distributions every time 
balanceCluster() runs
                 Key: HBASE-16958
                 URL: https://issues.apache.org/jira/browse/HBASE-16958
             Project: HBase
          Issue Type: Bug
          Components: Balancer
            Reporter: Gary Helmling
            Assignee: Gary Helmling
             Fix For: 1.3.0



The change in HBASE-16570 modified the balancer to compute block distributions 
in parallel with a pool of 5 threads.  However, because it does this every time 
Cluster is instantiated, it effectively bypasses the cache of block locations 
added in HBASE-14473:

In the LoadBalancer.balanceCluster() implementations (in 
StochasticLoadBalancer, SimpleLoadBalancer), we create a new Cluster instance.

In Cluster.<init>, we call registerRegion() on every HRegionInfo.

In registerRegion(), we do the following:
{code}
        regionLocationFutures.set(regionIndex,
            regionFinder.asyncGetBlockDistribution(region));
{code}

Then, back in Cluster.<init> we do a get() on each ListenableFuture in a loop.

So while we are doing the calls to get block locations in parallel with 5 
threads, we're recomputing them every time balanceCluster() is called and not 
taking advantage of the cache at all.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (HBASE-16958) Balancer recomputes block distributions every time balanceCluster() runs

Reply via email to