[ https://issues.apache.org/jira/browse/HBASE-18215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16064378#comment-16064378 ]
chenxu commented on HBASE-18215: -------------------------------- upload a patch for #4, [~toffer] > some advises about refactoring of rsgroup > ----------------------------------------- > > Key: HBASE-18215 > URL: https://issues.apache.org/jira/browse/HBASE-18215 > Project: HBase > Issue Type: Umbrella > Components: Balancer > Reporter: chenxu > Attachments: HBASE-18215-1.2.4-v1.patch > > > recently we have Integrated rsgroup into our cluster, after Integrated, > found some refactoring points. maybe the points were not right, but i think > there is a need to share with you guys. > # when hbase.balancer.tablesOnMaster configured, RSGroupBasedLoadBalancer > should consider masterServer assignment first in balanceCluster, > roundRobinAssignment, retainAssignment and randomAssignment > do the same thing as BaseLoadBalancer > # why not use a local file as the persistence layer instead of rsgroup table. > in our implementation, we first modify the local rsgroup file, then load the > group info into memory, after that execute the balancer command, everything > is OK. > when loading do some sanity check: > (1) one server can not be owned by multi group > (2) one table can not be owned by multi group > (3) if group has table, it must also has servers > (4) default group must has servers in it > if sanity check can’t pass, give up the following process.work as this, it > can greatly reduce the complexity of rsgroup implementation, there is no need > to wait for the rsgroup table to be online, and methods like moveServers, > moveTables, addRSGroup, removeRSGroup, moveServersAndTables can be removed > from RSGroupAdminService.only a refresh method is need(modify persistence > layer first and refresh the memory) > # we should add some group informations on master web UI > to do this, RSGroupBasedLoadBalancer should move to hbase-server module, > because MasterStatusTmpl.jamon depends on it > # there may be some issues about RSGroupBasedLoadBalancer.roundRobinAssignment > if two groups both include BOGUS_SERVER_NAME, assignments.putAll will > overwrite the previous data > # there may be some issues about RSGroupBasedLoadBalancer.randomAssignment > when the return value is BOGUS_SERVER_NAME, AM can not handle this case. we > should return null value instead of BOGUS_SERVER_NAME. > # when RSGroupBasedLoadBalancer.balanceCluster execute, groups are balanced > one by one, if there are two many groups, we can do this in parallel. -- This message was sent by Atlassian JIRA (v6.4.14#64029)