[jira] [Updated] (HBASE-14215) Default cost used for PrimaryRegionCountSkewCostFunction is not sufficient

2022-06-11 Thread Andrew Kyle Purtell (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-14215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Kyle Purtell updated HBASE-14215:

Resolution: Abandoned
Status: Resolved  (was: Patch Available)

> Default cost used for PrimaryRegionCountSkewCostFunction is not sufficient 
> ---
>
> Key: HBASE-14215
> URL: https://issues.apache.org/jira/browse/HBASE-14215
> Project: HBase
>  Issue Type: Bug
>  Components: Balancer
>Reporter: Biju Nair
>Priority: Minor
> Attachments: 14215-v1.txt
>
>
> Current multiplier of 500 used in the stochastic balancer cost function 
> {{PrimaryRegionCountSkewCostFunction}} to calculate the cost of  total 
> primary replication skew doesn't seem to be sufficient to prevent the skews 
> (Refer HBASE-14110). We would want the default cost to be a higher value so 
> that skews in primary region replica has higher cost. The following is the 
> test result by setting the multiplier value to 1 (same as the region 
> replica rack cost multiplier) on a 3 Rack 9 RS node cluster which seems to 
> get the balancer distribute the primaries uniformly.
> *Initial Primary replica distribution - using the current multiplier* 
>  |r1n10|  102|
>  |r1n11|  85|
>  |r1n9|88|
>  |r2n10|  120|
>  |r2n11|  120|
>  |r2n9|   124|
>  |r3n10|  135|
>  |r3n11|  124|
>  |r3n9|129|
> *After long duration of read & writes - using current multiplier* 
> | r1n10|  102|
> | r1n11|  85|
> | r1n9|88|
> | r2n10|  120|
> | r2n11|  120|
> | r2n9 |   124|
> | r3n10|  135|
> | r3n11|  124|
> | r3n9|129|
> *After manual balancing*  
> | r1n10|  102|
> | r1n11|  85|
> | r1n9|88|
> | r2n10|  120|
> | r2n11|  120|
> | r2n9 |   124|
> | r3n10|  135|
> | r3n11|  124|
> | r3n9|129|
> *Increased multiplier for primaryRegionCountSkewCost to 1*
> | r1n10|  114|
> | r1n11 | 113|
> | r1n9 |   114|
> | r2n10|  114|
> | r2n11|  114|
> | r2n9 |   113|
> | r3n10|  115|
> | r3n11|  115|
> | r3n9 |   115 |
> Setting the {{PrimaryRegionCountSkewCostFunction}} multiplier value to 1 
> should help HBase general use.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Updated] (HBASE-14215) Default cost used for PrimaryRegionCountSkewCostFunction is not sufficient

2015-09-01 Thread Biju Nair (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-14215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Biju Nair updated HBASE-14215:
--
Description: 
Current multiplier of 500 used in the stochastic balancer cost function 
{{PrimaryRegionCountSkewCostFunction}} to calculate the cost of  total primary 
replication skew doesn't seem to be sufficient to prevent the skews (Refer 
HBASE-14110). We would want the default cost to be a higher value so that skews 
in primary region replica has higher cost. The following is the test result by 
setting the multiplier value to 1 (same as the region replica rack cost 
multiplier) on a 3 Rack 9 RS node cluster which seems to get the balancer 
distribute the primaries uniformly.

*Initial Primary replica distribution - using the current multiplier*   
 |r1n10|  102|
 |r1n11|  85|
 |r1n9|88|
 |r2n10|  120|
 |r2n11|  120|
 |r2n9|   124|
 |r3n10|  135|
 |r3n11|  124|
 |r3n9|129|
*After long duration of read & writes - using current multiplier*   
| r1n10|  102|
| r1n11|  85|
| r1n9|88|
| r2n10|  120|
| r2n11|  120|
| r2n9 |   124|
| r3n10|  135|
| r3n11|  124|
| r3n9|129|
*After manual balancing*
| r1n10|  102|
| r1n11|  85|
| r1n9|88|
| r2n10|  120|
| r2n11|  120|
| r2n9 |   124|
| r3n10|  135|
| r3n11|  124|
| r3n9|129|
*Increased multiplier for primaryRegionCountSkewCost to 1*  
| r1n10|  114|
| r1n11 | 113|
| r1n9 |   114|
| r2n10|  114|
| r2n11|  114|
| r2n9 |   113|
| r3n10|  115|
| r3n11|  115|
| r3n9 |   115 |

Setting the {{PrimaryRegionCountSkewCostFunction}} multiplier value to 1 
should help HBase general use.

  was:
Current multiplier of 500 used in the stochastic balancer cost function 
{{PrimaryRegionCountSkewCostFunction}} to calculate the cost of  total primary 
replication skew doesn't seem to be sufficient to prevent the skews (Refer 
HBASE-14110). We would want the default cost to be a higher value so that skews 
in primary region replica has higher cost. The following is the test result by 
setting the multiplier value to 1 (same as the region replica rack cost 
multiplier) on a 3 Rack 9 RS node cluster which seems to get the balancer 
distribute the primaries uniformly.

*Initial Primary replica distribution - using the current multiplier*   
 |r1n10|  102|
 |r1n11|  85|
 |r1n9|88|
 |r2n10|  120|
 |r2n11|  120|
 |r2n9|   124|
 |r3n10|  135|
 |r3n11|  124|
 |r3n9|129|
*After long duration of read & writes - using current multiplier*   
 r1n10  102
 r1n11  85
 r1n988
 r2n10  120
 r2n11  120
 r2n9124
 r3n10  135
 r3n11  124
 r3n9129
*After manual balancing*
 r1n10  102
 r1n11  85
 r1n988
 r2n10  120
 r2n11  120
 r2n9124
 r3n10  135
 r3n11  124
 r3n9129
*Increased multiplier for primaryRegionCountSkewCost to 1*  
 r1n10  114
 r1n11  113
 r1n9114
 r2n10  114
 r2n11  114
 r2n9113
 r3n10  115
 r3n11  115
 r3n9115 

Setting the {{PrimaryRegionCountSkewCostFunction}} multiplier value to 1 
should help HBase general use.


> Default cost used for PrimaryRegionCountSkewCostFunction is not sufficient 
> ---
>
> Key: HBASE-14215
> URL: https://issues.apache.org/jira/browse/HBASE-14215
> Project: HBase
>  Issue Type: Bug
>  Components: Balancer
>Reporter: Biju Nair
>Priority: Minor
> Attachments: 14215-v1.txt
>
>
> Current multiplier of 500 used in the stochastic balancer cost function 
> {{PrimaryRegionCountSkewCostFunction}} to calculate the cost of  total 
> primary replication skew doesn't seem to be sufficient to prevent the skews 
> (Refer HBASE-14110). We would want the default cost to be a higher value so 
> that skews in primary region replica has higher cost. The following is the 
> test result by setting the multiplier value to 1 (same as the region 
> replica rack cost multiplier) on a 3 Rack 9 RS node cluster which seems to 
> get the balancer distribute the primaries uniformly.
> *Initial Primary replica distribution - using the current multiplier* 
>  |r1n10|  102|
>  |r1n11|  85|
>  |r1n9|88|
>  |r2n10|  120|
>  |r2n11|  120|
>  |r2n9|   124|
>  |r3n10|  135|
>  |r3n11|  124|
>  |r3n9|129|
> *After long duration of read & writes - using current multiplier* 
> | r1n10|  102|
> | r1n11|  85|
> | r1n9|88|
> | r2n10|  120|
> | r2n11|  120|
> | r2n9 |   124|
> | r3n10|  135|
> | r3n11|  124|
> | r3n9|129|
> *After manual balancing*  
> | r1n10|  102|
> | r1n11|  85|
> | r1n9|88|
> | r2n10|  120|
> | r2n11|  120|
> | r2n9 |   124|
> | r3n10|  135|
> | r3n11|  124|
> | r3n9|129|
> *Increased multiplier for primaryRegionCountSkewCost to 1*
> | r1n10|  114|
> | r1n11 | 113|
> | r1n9 |   114|
> | r2n10|  114|
> | r2n11|  114|
> | r2n9 |   113|
> | r3n10|  115|
> | r3n11|  115|
> 

[jira] [Updated] (HBASE-14215) Default cost used for PrimaryRegionCountSkewCostFunction is not sufficient

2015-09-01 Thread Biju Nair (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-14215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Biju Nair updated HBASE-14215:
--
Description: 
Current multiplier of 500 used in the stochastic balancer cost function 
{{PrimaryRegionCountSkewCostFunction}} to calculate the cost of  total primary 
replication skew doesn't seem to be sufficient to prevent the skews (Refer 
HBASE-14110). We would want the default cost to be a higher value so that skews 
in primary region replica has higher cost. The following is the test result by 
setting the multiplier value to 1 (same as the region replica rack cost 
multiplier) on a 3 Rack 9 RS node cluster which seems to get the balancer 
distribute the primaries uniformly.

*Initial Primary replica distribution - using the current multiplier*   
 |r1n10|  102|
 |r1n11|  85|
 |r1n9|88|
 |r2n10|  120|
 |r2n11|  120|
 |r2n9|   124|
 |r3n10|  135|
 |r3n11|  124|
 |r3n9|129|
*After long duration of read & writes - using current multiplier*   
 r1n10  102
 r1n11  85
 r1n988
 r2n10  120
 r2n11  120
 r2n9124
 r3n10  135
 r3n11  124
 r3n9129
*After manual balancing*
 r1n10  102
 r1n11  85
 r1n988
 r2n10  120
 r2n11  120
 r2n9124
 r3n10  135
 r3n11  124
 r3n9129
*Increased multiplier for primaryRegionCountSkewCost to 1*  
 r1n10  114
 r1n11  113
 r1n9114
 r2n10  114
 r2n11  114
 r2n9113
 r3n10  115
 r3n11  115
 r3n9115 

Setting the {{PrimaryRegionCountSkewCostFunction}} multiplier value to 1 
should help HBase general use.

  was:
Current multiplier of 500 used in the stochastic balancer cost function 
{{PrimaryRegionCountSkewCostFunction}} to calculate the cost of  total primary 
replication skew doesn't seem to be sufficient to prevent the skews (Refer 
HBASE-14110). We would want the default cost to be a higher value so that skews 
in primary region replica has higher cost. The following is the test result by 
setting the multiplier value to 1 (same as the region replica rack cost 
multiplier) on a 3 Rack 9 RS node cluster which seems to get the balancer 
distribute the primaries uniformly.

*Initial Primary replica distribution - using the current multiplier*   
 r1n10  102
 r1n11  85
 r1n988
 r2n10  120
 r2n11  120
 r2n9   124
 r3n10  135
 r3n11  124
 r3n9129
*After long duration of read & writes - using current multiplier*   
 r1n10  102
 r1n11  85
 r1n988
 r2n10  120
 r2n11  120
 r2n9124
 r3n10  135
 r3n11  124
 r3n9129
*After manual balancing*
 r1n10  102
 r1n11  85
 r1n988
 r2n10  120
 r2n11  120
 r2n9124
 r3n10  135
 r3n11  124
 r3n9129
*Increased multiplier for primaryRegionCountSkewCost to 1*  
 r1n10  114
 r1n11  113
 r1n9114
 r2n10  114
 r2n11  114
 r2n9113
 r3n10  115
 r3n11  115
 r3n9115 

Setting the {{PrimaryRegionCountSkewCostFunction}} multiplier value to 1 
should help HBase general use.


> Default cost used for PrimaryRegionCountSkewCostFunction is not sufficient 
> ---
>
> Key: HBASE-14215
> URL: https://issues.apache.org/jira/browse/HBASE-14215
> Project: HBase
>  Issue Type: Bug
>  Components: Balancer
>Reporter: Biju Nair
>Priority: Minor
> Attachments: 14215-v1.txt
>
>
> Current multiplier of 500 used in the stochastic balancer cost function 
> {{PrimaryRegionCountSkewCostFunction}} to calculate the cost of  total 
> primary replication skew doesn't seem to be sufficient to prevent the skews 
> (Refer HBASE-14110). We would want the default cost to be a higher value so 
> that skews in primary region replica has higher cost. The following is the 
> test result by setting the multiplier value to 1 (same as the region 
> replica rack cost multiplier) on a 3 Rack 9 RS node cluster which seems to 
> get the balancer distribute the primaries uniformly.
> *Initial Primary replica distribution - using the current multiplier* 
>  |r1n10|  102|
>  |r1n11|  85|
>  |r1n9|88|
>  |r2n10|  120|
>  |r2n11|  120|
>  |r2n9|   124|
>  |r3n10|  135|
>  |r3n11|  124|
>  |r3n9|129|
> *After long duration of read & writes - using current multiplier* 
>  r1n10  102
>  r1n11  85
>  r1n988
>  r2n10  120
>  r2n11  120
>  r2n9124
>  r3n10  135
>  r3n11  124
>  r3n9129
> *After manual balancing*  
>  r1n10  102
>  r1n11  85
>  r1n988
>  r2n10  120
>  r2n11  120
>  r2n9124
>  r3n10  135
>  r3n11  124
>  r3n9129
> *Increased multiplier for primaryRegionCountSkewCost to 1*
>  r1n10  114
>  r1n11  113
>  r1n9114
>  r2n10  114
>  r2n11  114
>  r2n9113
>  r3n10  115
>  r3n11  115
>  r3n9115 
> Setting the {{PrimaryRegionCountSkewCostFunction}} multiplier value to 1 
> should help HBase general use.



--
This message was sent by Atlassian JIRA

[jira] [Updated] (HBASE-14215) Default cost used for PrimaryRegionCountSkewCostFunction is not sufficient

2015-08-19 Thread Biju Nair (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-14215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Biju Nair updated HBASE-14215:
--
Description: 
Current multiplier of 500 used in the stochastic balancer cost function 
{{PrimaryRegionCountSkewCostFunction}} to calculate the cost of  total primary 
replication skew doesn't seem to be sufficient to prevent the skews (Refer 
HBASE-14110). We would want the default cost to be a higher value so that skews 
in primary region replica has higher cost. The following is the test result by 
setting the multiplier value to 1 (same as the region replica rack cost 
multiplier) on a 3 Rack 9 RS node cluster which seems to get the balancer 
distribute the primaries uniformly.

*Initial Primary replica distribution - using the current multiplier*   
 r1n10  102
 r1n11  85
 r1n988
 r2n10  120
 r2n11  120
 r2n9   124
 r3n10  135
 r3n11  124
 r3n9129
*After long duration of read  writes - using current multiplier*   
 r1n10  102
 r1n11  85
 r1n988
 r2n10  120
 r2n11  120
 r2n9124
 r3n10  135
 r3n11  124
 r3n9129
*After manual balancing*
 r1n10  102
 r1n11  85
 r1n988
 r2n10  120
 r2n11  120
 r2n9124
 r3n10  135
 r3n11  124
 r3n9129
*Increased multiplier for primaryRegionCountSkewCost to 1*  
 r1n10  114
 r1n11  113
 r1n9114
 r2n10  114
 r2n11  114
 r2n9113
 r3n10  115
 r3n11  115
 r3n9115 

Setting the {{PrimaryRegionCountSkewCostFunction}} multiplier value to 1 
should help HBase general use.

  was:
Current multiplier of 500 used in the stochastic balancer cost function 
``PrimaryRegionCountSkewCostFunction`` to calculate the cost of  total primary 
replication skew doesn't seem to be sufficient to prevent the skews (Refer 
HBASE-14110). We would want the default cost to be a higher value so that skews 
in primary region replica has higher cost. The following is the test result by 
setting the multiplier value to 1 (same as the region replica rack cost 
multiplier) on a 3 Rack 9 RS node cluster which seems to get the balancer 
distribute the primaries uniformly.

*Initial Primary replica distribution - using the current multiplier*   
 r1n10  102
 r1n11  85
 r1n988
 r2n10  120
 r2n11  120
 r2n9   124
 r3n10  135
 r3n11  124
 r3n9129
*After long duration of read  writes - using current multiplier*   
 r1n10  102
 r1n11  85
 r1n988
 r2n10  120
 r2n11  120
 r2n9124
 r3n10  135
 r3n11  124
 r3n9129
*After manual balancing*
 r1n10  102
 r1n11  85
 r1n988
 r2n10  120
 r2n11  120
 r2n9124
 r3n10  135
 r3n11  124
 r3n9129
*Increased multiplier for primaryRegionCountSkewCost to 1*  
 r1n10  114
 r1n11  113
 r1n9114
 r2n10  114
 r2n11  114
 r2n9113
 r3n10  115
 r3n11  115
 r3n9115 

Setting the `PrimaryRegionCountSkewCostFunction` multiplier value to 1 
should help HBase general use.


 Default cost used for PrimaryRegionCountSkewCostFunction is not sufficient 
 ---

 Key: HBASE-14215
 URL: https://issues.apache.org/jira/browse/HBASE-14215
 Project: HBase
  Issue Type: Bug
  Components: Balancer
Reporter: Biju Nair
Priority: Minor
 Attachments: 14215-v1.txt


 Current multiplier of 500 used in the stochastic balancer cost function 
 {{PrimaryRegionCountSkewCostFunction}} to calculate the cost of  total 
 primary replication skew doesn't seem to be sufficient to prevent the skews 
 (Refer HBASE-14110). We would want the default cost to be a higher value so 
 that skews in primary region replica has higher cost. The following is the 
 test result by setting the multiplier value to 1 (same as the region 
 replica rack cost multiplier) on a 3 Rack 9 RS node cluster which seems to 
 get the balancer distribute the primaries uniformly.
 *Initial Primary replica distribution - using the current multiplier* 
  r1n10  102
  r1n11  85
  r1n988
  r2n10  120
  r2n11  120
  r2n9   124
  r3n10  135
  r3n11  124
  r3n9129
 *After long duration of read  writes - using current multiplier* 
  r1n10  102
  r1n11  85
  r1n988
  r2n10  120
  r2n11  120
  r2n9124
  r3n10  135
  r3n11  124
  r3n9129
 *After manual balancing*  
  r1n10  102
  r1n11  85
  r1n988
  r2n10  120
  r2n11  120
  r2n9124
  r3n10  135
  r3n11  124
  r3n9129
 *Increased multiplier for primaryRegionCountSkewCost to 1*
  r1n10  114
  r1n11  113
  r1n9114
  r2n10  114
  r2n11  114
  r2n9113
  r3n10  115
  r3n11  115
  r3n9115 
 Setting the {{PrimaryRegionCountSkewCostFunction}} multiplier value to 1 
 should help HBase general use.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HBASE-14215) Default cost used for PrimaryRegionCountSkewCostFunction is not sufficient

2015-08-11 Thread Ted Yu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-14215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ted Yu updated HBASE-14215:
---
Status: Patch Available  (was: Open)

 Default cost used for PrimaryRegionCountSkewCostFunction is not sufficient 
 ---

 Key: HBASE-14215
 URL: https://issues.apache.org/jira/browse/HBASE-14215
 Project: HBase
  Issue Type: Bug
  Components: Balancer
Reporter: Biju Nair
Priority: Minor
 Attachments: 14215-v1.txt


 Current multiplier of 500 used in the stochastic balancer cost function 
 ``PrimaryRegionCountSkewCostFunction`` to calculate the cost of  total 
 primary replication skew doesn't seem to be sufficient to prevent the skews 
 (Refer HBASE-14110). We would want the default cost to be a higher value so 
 that skews in primary region replica has higher cost. The following is the 
 test result by setting the multiplier value to 1 (same as the region 
 replica rack cost multiplier) on a 3 Rack 9 RS node cluster which seems to 
 get the balancer distribute the primaries uniformly.
 *Initial Primary replica distribution - using the current multiplier* 
  r1n10  102
  r1n11  85
  r1n988
  r2n10  120
  r2n11  120
  r2n9   124
  r3n10  135
  r3n11  124
  r3n9129
 *After long duration of read  writes - using current multiplier* 
  r1n10  102
  r1n11  85
  r1n988
  r2n10  120
  r2n11  120
  r2n9124
  r3n10  135
  r3n11  124
  r3n9129
 *After manual balancing*  
  r1n10  102
  r1n11  85
  r1n988
  r2n10  120
  r2n11  120
  r2n9124
  r3n10  135
  r3n11  124
  r3n9129
 *Increased multiplier for primaryRegionCountSkewCost to 1*
  r1n10  114
  r1n11  113
  r1n9114
  r2n10  114
  r2n11  114
  r2n9113
  r3n10  115
  r3n11  115
  r3n9115 
 Setting the `PrimaryRegionCountSkewCostFunction` multiplier value to 1 
 should help HBase general use.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HBASE-14215) Default cost used for PrimaryRegionCountSkewCostFunction is not sufficient

2015-08-11 Thread Biju Nair (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-14215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Biju Nair updated HBASE-14215:
--
Description: 
Current multiplier of 500 used in the stochastic balancer cost function 
``PrimaryRegionCountSkewCostFunction`` to calculate the cost of  total primary 
replication skew doesn't seem to be sufficient to prevent the skews (Refer 
HBASE-14110). We would want the default cost to be a higher value so that skews 
in primary region replica has higher cost. The following is the test result by 
setting the multiplier value to 1 (same as the region replica rack cost 
multiplier) on a 3 Rack 9 RS node cluster which seems to get the balancer 
distribute the primaries uniformly.

**Initial Primary replica distribution - using the current multiplier** 
 r1n10  102
 r1n11  85
 r1n988
 r2n10  120
 r2n11  120
 r2n9   124
 r3n10  135
 r3n11  124
 r3n9129
**After long duration of read  writes - using current multiplier** 
 r1n10  102
 r1n11  85
 r1n988
 r2n10  120
 r2n11  120
 r2n9124
 r3n10  135
 r3n11  124
 r3n9129
**After manual balancing**  
 r1n10  102
 r1n11  85
 r1n988
 r2n10  120
 r2n11  120
 r2n9124
 r3n10  135
 r3n11  124
 r3n9129
**Increased multiplier for primaryRegionCountSkewCost to 1**
 r1n10  114
 r1n11  113
 r1n9114
 r2n10  114
 r2n11  114
 r2n9113
 r3n10  115
 r3n11  115
 r3n9115 

Setting the `PrimaryRegionCountSkewCostFunction` multiplier value to 1 
should help HBase general use.

  was:
Current multiplier of 500 used in the stochastic balancer cost function 
`PrimaryRegionCountSkewCostFunction` to calculate the cost of  total primary 
replication skew doesn't seem to be sufficient to prevent the skews (Refer 
HBASE-14110). We would want the default cost to be a higher value so that skews 
in primary region replica has higher cost. The following is the test result by 
setting the multiplier value to 1 (same as the region replica rack cost 
multiplier) on a 3 Rack 9 RS node cluster which seems to get the balancer 
distribute the primaries uniformly.

**Initial Primary replica distribution - using the current multiplier** 
 r1n10  102
 r1n11  85
 r1n988
 r2n10  120
 r2n11  120
 r2n9   124
 r3n10  135
 r3n11  124
 r3n9129
**After long duration of read  writes - using current multiplier** 
 r1n10  102
 r1n11  85
 r1n988
 r2n10  120
 r2n11  120
 r2n9124
 r3n10  135
 r3n11  124
 r3n9129
**After manual balancing**  
 r1n10  102
 r1n11  85
 r1n988
 r2n10  120
 r2n11  120
 r2n9124
 r3n10  135
 r3n11  124
 r3n9129
**Increased multiplier for primaryRegionCountSkewCost to 1**
 r1n10  114
 r1n11  113
 r1n9114
 r2n10  114
 r2n11  114
 r2n9113
 r3n10  115
 r3n11  115
 r3n9115 

Setting the `PrimaryRegionCountSkewCostFunction` multiplier value to 1 
should help HBase general use.


 Default cost used for PrimaryRegionCountSkewCostFunction is not sufficient 
 ---

 Key: HBASE-14215
 URL: https://issues.apache.org/jira/browse/HBASE-14215
 Project: HBase
  Issue Type: Bug
  Components: Balancer
Reporter: Biju Nair
Priority: Minor

 Current multiplier of 500 used in the stochastic balancer cost function 
 ``PrimaryRegionCountSkewCostFunction`` to calculate the cost of  total 
 primary replication skew doesn't seem to be sufficient to prevent the skews 
 (Refer HBASE-14110). We would want the default cost to be a higher value so 
 that skews in primary region replica has higher cost. The following is the 
 test result by setting the multiplier value to 1 (same as the region 
 replica rack cost multiplier) on a 3 Rack 9 RS node cluster which seems to 
 get the balancer distribute the primaries uniformly.
 **Initial Primary replica distribution - using the current multiplier**   
  r1n10  102
  r1n11  85
  r1n988
  r2n10  120
  r2n11  120
  r2n9   124
  r3n10  135
  r3n11  124
  r3n9129
 **After long duration of read  writes - using current multiplier**   
  r1n10  102
  r1n11  85
  r1n988
  r2n10  120
  r2n11  120
  r2n9124
  r3n10  135
  r3n11  124
  r3n9129
 **After manual balancing**
  r1n10  102
  r1n11  85
  r1n988
  r2n10  120
  r2n11  120
  r2n9124
  r3n10  135
  r3n11  124
  r3n9129
 **Increased multiplier for primaryRegionCountSkewCost to 1**  
  r1n10  114
  r1n11  113
  r1n9114
  r2n10  114
  r2n11  114
  r2n9113
  r3n10  115
  r3n11  115
  r3n9115 
 Setting the `PrimaryRegionCountSkewCostFunction` multiplier value to 1 
 should help HBase general use.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HBASE-14215) Default cost used for PrimaryRegionCountSkewCostFunction is not sufficient

2015-08-11 Thread Biju Nair (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-14215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Biju Nair updated HBASE-14215:
--
Description: 
Current multiplier of 500 used in the stochastic balancer cost function 
``PrimaryRegionCountSkewCostFunction`` to calculate the cost of  total primary 
replication skew doesn't seem to be sufficient to prevent the skews (Refer 
HBASE-14110). We would want the default cost to be a higher value so that skews 
in primary region replica has higher cost. The following is the test result by 
setting the multiplier value to 1 (same as the region replica rack cost 
multiplier) on a 3 Rack 9 RS node cluster which seems to get the balancer 
distribute the primaries uniformly.

*Initial Primary replica distribution - using the current multiplier*   
 r1n10  102
 r1n11  85
 r1n988
 r2n10  120
 r2n11  120
 r2n9   124
 r3n10  135
 r3n11  124
 r3n9129
*After long duration of read  writes - using current multiplier*   
 r1n10  102
 r1n11  85
 r1n988
 r2n10  120
 r2n11  120
 r2n9124
 r3n10  135
 r3n11  124
 r3n9129
*After manual balancing*
 r1n10  102
 r1n11  85
 r1n988
 r2n10  120
 r2n11  120
 r2n9124
 r3n10  135
 r3n11  124
 r3n9129
*Increased multiplier for primaryRegionCountSkewCost to 1*  
 r1n10  114
 r1n11  113
 r1n9114
 r2n10  114
 r2n11  114
 r2n9113
 r3n10  115
 r3n11  115
 r3n9115 

Setting the `PrimaryRegionCountSkewCostFunction` multiplier value to 1 
should help HBase general use.

  was:
Current multiplier of 500 used in the stochastic balancer cost function 
``PrimaryRegionCountSkewCostFunction`` to calculate the cost of  total primary 
replication skew doesn't seem to be sufficient to prevent the skews (Refer 
HBASE-14110). We would want the default cost to be a higher value so that skews 
in primary region replica has higher cost. The following is the test result by 
setting the multiplier value to 1 (same as the region replica rack cost 
multiplier) on a 3 Rack 9 RS node cluster which seems to get the balancer 
distribute the primaries uniformly.

**Initial Primary replica distribution - using the current multiplier** 
 r1n10  102
 r1n11  85
 r1n988
 r2n10  120
 r2n11  120
 r2n9   124
 r3n10  135
 r3n11  124
 r3n9129
**After long duration of read  writes - using current multiplier** 
 r1n10  102
 r1n11  85
 r1n988
 r2n10  120
 r2n11  120
 r2n9124
 r3n10  135
 r3n11  124
 r3n9129
**After manual balancing**  
 r1n10  102
 r1n11  85
 r1n988
 r2n10  120
 r2n11  120
 r2n9124
 r3n10  135
 r3n11  124
 r3n9129
**Increased multiplier for primaryRegionCountSkewCost to 1**
 r1n10  114
 r1n11  113
 r1n9114
 r2n10  114
 r2n11  114
 r2n9113
 r3n10  115
 r3n11  115
 r3n9115 

Setting the `PrimaryRegionCountSkewCostFunction` multiplier value to 1 
should help HBase general use.


 Default cost used for PrimaryRegionCountSkewCostFunction is not sufficient 
 ---

 Key: HBASE-14215
 URL: https://issues.apache.org/jira/browse/HBASE-14215
 Project: HBase
  Issue Type: Bug
  Components: Balancer
Reporter: Biju Nair
Priority: Minor

 Current multiplier of 500 used in the stochastic balancer cost function 
 ``PrimaryRegionCountSkewCostFunction`` to calculate the cost of  total 
 primary replication skew doesn't seem to be sufficient to prevent the skews 
 (Refer HBASE-14110). We would want the default cost to be a higher value so 
 that skews in primary region replica has higher cost. The following is the 
 test result by setting the multiplier value to 1 (same as the region 
 replica rack cost multiplier) on a 3 Rack 9 RS node cluster which seems to 
 get the balancer distribute the primaries uniformly.
 *Initial Primary replica distribution - using the current multiplier* 
  r1n10  102
  r1n11  85
  r1n988
  r2n10  120
  r2n11  120
  r2n9   124
  r3n10  135
  r3n11  124
  r3n9129
 *After long duration of read  writes - using current multiplier* 
  r1n10  102
  r1n11  85
  r1n988
  r2n10  120
  r2n11  120
  r2n9124
  r3n10  135
  r3n11  124
  r3n9129
 *After manual balancing*  
  r1n10  102
  r1n11  85
  r1n988
  r2n10  120
  r2n11  120
  r2n9124
  r3n10  135
  r3n11  124
  r3n9129
 *Increased multiplier for primaryRegionCountSkewCost to 1*
  r1n10  114
  r1n11  113
  r1n9114
  r2n10  114
  r2n11  114
  r2n9113
  r3n10  115
  r3n11  115
  r3n9115 
 Setting the `PrimaryRegionCountSkewCostFunction` multiplier value to 1 
 should help HBase general use.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HBASE-14215) Default cost used for PrimaryRegionCountSkewCostFunction is not sufficient

2015-08-11 Thread Ted Yu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-14215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ted Yu updated HBASE-14215:
---
Attachment: 14215-v1.txt

 Default cost used for PrimaryRegionCountSkewCostFunction is not sufficient 
 ---

 Key: HBASE-14215
 URL: https://issues.apache.org/jira/browse/HBASE-14215
 Project: HBase
  Issue Type: Bug
  Components: Balancer
Reporter: Biju Nair
Priority: Minor
 Attachments: 14215-v1.txt


 Current multiplier of 500 used in the stochastic balancer cost function 
 ``PrimaryRegionCountSkewCostFunction`` to calculate the cost of  total 
 primary replication skew doesn't seem to be sufficient to prevent the skews 
 (Refer HBASE-14110). We would want the default cost to be a higher value so 
 that skews in primary region replica has higher cost. The following is the 
 test result by setting the multiplier value to 1 (same as the region 
 replica rack cost multiplier) on a 3 Rack 9 RS node cluster which seems to 
 get the balancer distribute the primaries uniformly.
 *Initial Primary replica distribution - using the current multiplier* 
  r1n10  102
  r1n11  85
  r1n988
  r2n10  120
  r2n11  120
  r2n9   124
  r3n10  135
  r3n11  124
  r3n9129
 *After long duration of read  writes - using current multiplier* 
  r1n10  102
  r1n11  85
  r1n988
  r2n10  120
  r2n11  120
  r2n9124
  r3n10  135
  r3n11  124
  r3n9129
 *After manual balancing*  
  r1n10  102
  r1n11  85
  r1n988
  r2n10  120
  r2n11  120
  r2n9124
  r3n10  135
  r3n11  124
  r3n9129
 *Increased multiplier for primaryRegionCountSkewCost to 1*
  r1n10  114
  r1n11  113
  r1n9114
  r2n10  114
  r2n11  114
  r2n9113
  r3n10  115
  r3n11  115
  r3n9115 
 Setting the `PrimaryRegionCountSkewCostFunction` multiplier value to 1 
 should help HBase general use.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)