[ 
https://issues.apache.org/jira/browse/HBASE-29203?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ray Mattingly resolved HBASE-29203.
-----------------------------------
    Fix Version/s: 4.0.0-alpha-1
                   2.7.0
                   3.0.0-beta-2
     Release Note: This adds a StoreFileTableSkewCostFunction which will let 
you balance every table by datasize, which can be a better metric than region 
number. This will work well, even if you are balancing by cluster (the default 
balancer setup). It comes with a default cost of 35, equivalent to 
StorefileSizeCost
       Resolution: Fixed

> There should be a StorefileSize equivalent to the TableSkewCost
> ---------------------------------------------------------------
>
>                 Key: HBASE-29203
>                 URL: https://issues.apache.org/jira/browse/HBASE-29203
>             Project: HBase
>          Issue Type: Improvement
>          Components: Balancer
>    Affects Versions: 2.7.0
>            Reporter: Ray Mattingly
>            Assignee: Ray Mattingly
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 4.0.0-alpha-1, 2.7.0, 3.0.0-beta-2
>
>
> We don't want to rely on per-table balancing, but we do want tables to be 
> balanced effectively based on their data size. This is because regions are 
> variably sized, but 1gb is always 1gb, so, in a well distributed schema, data 
> size is a better metric for a region's load than just counting every region 
> identically.
> Right now the TableSkewCostFunction is the balancer's mechanism for balancing 
> each table somewhat independently within a cluster-wide plan, but it only 
> looks at region counts. We could balance more effectively if we had a 
> StoreFileTableSkewCost.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to