Re: StochasticLoadBalancer questions

2017-01-13 Thread Timothy Brown
Regarding #1, my main concern is that if we poll the region load at a "bad" time and get back an abnormally high or low value, the balancer could over react. For example if your regions most recent readRequestsCount is 100 and you've been seeing 5 for the last 9 times you polled, the "average" outp

Re: StochasticLoadBalancer questions

2017-01-13 Thread Ted Yu
For #2, you're more than welcome to attach patch on the JIRA. For #1, last time I tried to trace which JIRA introduced the formula but ended up with one Elliott did which just moved that line of code. I can spend more time in the future on this. What downside have you observed for #1 ? Cheers O

Re: StochasticLoadBalancer questions

2017-01-13 Thread Timothy Brown
I tried it out on our staging cluster and saw that the total number of requests per region server a bit more balanced with our current weights for the read and write costs. I did not attempt to calculate the exact requests per second but rather looked at a relative rate by averaging the increase in

Re: StochasticLoadBalancer questions

2017-01-13 Thread Ted Yu
Logged HBASE-17462 for #2. FYI On Thu, Jan 12, 2017 at 8:49 AM, Ted Yu wrote: > For #2, I think MemstoreSizeCostFunction belongs to the same category if > we are to adopt moving average. > > Some factors to consider: > > The data structure used by StochasticLoadBalancer should be concise. The >

Re: StochasticLoadBalancer questions

2017-01-12 Thread Ted Yu
For #2, I think MemstoreSizeCostFunction belongs to the same category if we are to adopt moving average. Some factors to consider: The data structure used by StochasticLoadBalancer should be concise. The number of regions in a cluster can be expected to approach 1 million. We cannot afford to sto

Re: StochasticLoadBalancer questions

2017-01-11 Thread Ted Yu
For #2, I think it makes sense to try out using request rates for cost calculation. If the experiment result turns out to be better, we can consider using such measure. Thanks On Wed, Jan 11, 2017 at 5:34 PM, Timothy Brown wrote: > Hi, > > I have a couple of questions about the StochasticLoadB

StochasticLoadBalancer questions

2017-01-11 Thread Timothy Brown
Hi, I have a couple of questions about the StochasticLoadBalancer. 1) In CostFromRegionLoadFunction.getRegionLoadCost the cost is weights later samples of the RegionLoad more than previous ones. For example, with a queue size of 4 it would be (.5 * load1 + .25*load2 + .125*load3 + .125*load4). Is