[ 
https://issues.apache.org/jira/browse/HBASE-9267?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13746553#comment-13746553
 ] 

Jean-Daniel Cryans commented on HBASE-9267:
-------------------------------------------

Yesterday I was playing more with v2 and v3 and although I don't see the 
sublist issue anymore, the time it takes to balance always goes up until it 
reaches 60 seconds. The more I think about it the less I like it... there's no 
way to kill the balancer while it's running (AFAIK) and it blocks a couple of 
other things like HBCK. We could discuss this on dev@ or another jira though.

So I'm still +1 on the patch.
                
> StochasticLoadBalancer goes over its processing time limit
> ----------------------------------------------------------
>
>                 Key: HBASE-9267
>                 URL: https://issues.apache.org/jira/browse/HBASE-9267
>             Project: HBase
>          Issue Type: Bug
>    Affects Versions: 0.95.2
>            Reporter: Jean-Daniel Cryans
>            Assignee: Elliott Clark
>             Fix For: 0.98.0, 0.95.3
>
>         Attachments: HBASE-9267-0.patch, HBASE-9267-1.patch, 
> HBASE-9267-2.patch, HBASE-9267-3.patch, HBASE-9267-4.patch
>
>
> I trying out 0.95.2, I left it running over the weekend (8 RS, average load 
> between 12 and 3 regions) and right now the balancer runs for 12 mins:
> bq. 2013-08-19 21:54:45,534 DEBUG 
> [jdec2hbase0403-1.vpc.cloudera.com,60000,1376689696384-BalancerChore] 
> org.apache.hadoop.hbase.master.balancer.StochasticLoadBalancer: Could not 
> find a better load balance plan.  Tried 0 different configurations in 
> 777309ms, and did not find anything with a computed cost less than 
> 36.32576937689094
> It seems it slowly crept up there, yesterday it was doing:
> bq. 2013-08-18 20:53:17,232 DEBUG 
> [jdec2hbase0403-1.vpc.cloudera.com,60000,1376689696384-BalancerChore] 
> org.apache.hadoop.hbase.master.balancer.StochasticLoadBalancer: Could not 
> find a better load balance plan.  Tried 0 different configurations in 
> 257374ms, and did not find anything with a computed cost less than 
> 36.3251082542424
> And originally it was doing 1 minute.
> In the jstack I see a 1000 of these and jstack doesn't want to show me the 
> whole thing:
> bq.  at java.util.SubList$1.nextIndex(AbstractList.java:713)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to