Re: distribution of regions to servers

2011-06-07 Thread Stack
Hey Geoff: Is this a 0.90.2 (or .3) hbase? It had some fixup done on the balancer to make it a little more random. Otherwise, yes, a table should be better spread about the cluster. This has come up a few times here in the past. The case has been well-made by now that the balancer needs to take

RE: distribution of regions to servers

2011-06-07 Thread Geoff Hendrey
: user@hbase.apache.org Cc: hbase-u...@hadoop.apache.org Subject: Re: distribution of regions to servers Hey Geoff: Is this a 0.90.2 (or .3) hbase? It had some fixup done on the balancer to make it a little more random. Otherwise, yes, a table should be better spread about the cluster. This has come

Re: distribution of regions to servers

2011-06-07 Thread Stack
saint@gmail.com] On Behalf Of > Stack > Sent: Tuesday, June 07, 2011 2:16 PM > To: user@hbase.apache.org > Cc: hbase-u...@hadoop.apache.org > Subject: Re: distribution of regions to servers > > Hey Geoff: > > Is this a 0.90.2 (or .3) hbase?  It had some fixup done on t

RE: distribution of regions to servers

2011-06-07 Thread Geoff Hendrey
that right? -geoff -Original Message- From: saint@gmail.com [mailto:saint@gmail.com] On Behalf Of Stack Sent: Tuesday, June 07, 2011 3:07 PM To: user@hbase.apache.org Cc: hbase-u...@hadoop.apache.org Subject: Re: distribution of regions to servers I think you are thinking o

Re: distribution of regions to servers

2011-06-07 Thread Stack
On Tue, Jun 7, 2011 at 3:13 PM, Geoff Hendrey wrote: > You're right. I was thinking there was an Hbase balancer tool for spreading > the table's regions around. From your response, sounds like the hbase > balancer is just software that lives inside the Hbase master, as opposed to a > utility th

RE: distribution of regions to servers

2011-06-08 Thread Kleegrewe, Christian
Hi geoff, Since hbase balances not at table but at cluster basis it may happen that all the regions for one table are located at the same region server. The reason for this may be the way hbase does table splits. If a region exceeds the configured maximum size the region is split into two, but

Re: distribution of regions to servers

2011-06-08 Thread Ted Yu
In trunk this behavior has been improved. Load balancer would move the youngest region off heavily loaded region server. See http://zhihongyu.blogspot.com/2011/04/load-balancer-in-hbase-090.html I am thinking of creating a new policy for region assignment at cluster startup which assigns regions

Re: distribution of regions to servers

2011-06-08 Thread Stack
On Wed, Jun 8, 2011 at 12:50 PM, Ted Yu wrote: > I am thinking of creating a new policy for region assignment at cluster > startup which assigns regions from each table in round-robin fashion. > Don't we want to retain assignments on startup since that will ensure greatest locality of data? Roun

RE: distribution of regions to servers

2011-06-08 Thread Doug Meil
t@gmail.com [mailto:saint@gmail.com] On Behalf Of Stack Sent: Wednesday, June 08, 2011 3:56 PM To: user@hbase.apache.org Subject: Re: distribution of regions to servers On Wed, Jun 8, 2011 at 12:50 PM, Ted Yu wrote: > I am thinking of creating a new policy for region assignment at >

Re: distribution of regions to servers

2011-06-08 Thread Ted Yu
The assumption was that regions were not evenly distributed prior to restarting. If they were, user wouldn't select this policy. We can this policy effective only once - retain assignment is selected following this new policy. Of course the dynamic portion of load balancer needs to select the unde

RE: Distribution of regions to servers

2012-09-25 Thread Ramkrishna.S.Vasudevan
Hi Dan Generally if the region distribution is not done properly as per the need then always we end up in region server getting overloaded due to region hotspotting. Write thro put can go down. It is not like the coprocessor performance alone is slow. Please check if the regions are properly ba

RE: Distribution of regions to servers

2012-09-25 Thread Anoop Sam John
Hi Can u share more details pls? What work you are doing within the CPs -Anoop- From: Dan Han [dannahan2...@gmail.com] Sent: Wednesday, September 26, 2012 5:55 AM To: user@hbase.apache.org Subject: Distribution of regions to servers Hi all, I am doing

Re: Distribution of regions to servers

2012-09-26 Thread Dan Han
Thanks for your swift response, Ramkrishna and Anoop. And I will explicate what we are doing now below. We are trying to explore a systematic way to design the appropriate data schema for various applications in HBase. So we first designed several data schemas for each dataset and evaluate th

Re: Distribution of regions to servers

2012-09-26 Thread Eugeny Morozov
Dan, I have additional questions. What is the access pattern of your queries? I mean that f.e. PrefixFilters have to be applied for all KeyValue pairs in HFiles, which could be slow. Or f.e. scanner setCaching option is able to decrease number of network hops to get data from RegionServer. Additi

Re: Distribution of regions to servers

2012-09-26 Thread Dan Han
Hi, Eugeny , Thanks for your response. I answered your questions inline in Blue. And I'd like to give an example to describe my problem. Let's think about two data schemas for the same dataset. The two data schemas have different composite row keys. But there is a same part in both schemas, wh

RE: Distribution of regions to servers

2012-09-26 Thread Ramkrishna.S.Vasudevan
needs to be set accordingly. Regards Ram > -Original Message- > From: Dan Han [mailto:dannahan2...@gmail.com] > Sent: Thursday, September 27, 2012 7:00 AM > To: user@hbase.apache.org > Subject: Re: Distribution of regions to servers > > Hi, Eugeny , > >

Re: Distribution of regions to servers

2012-09-27 Thread Eugeny Morozov
Dan, see inlined. On Thu, Sep 27, 2012 at 5:30 AM, Dan Han wrote: > Hi, Eugeny , > >Thanks for your response. I answered your questions inline in Blue. > And I'd like to give an example to describe my problem. > > Let's think about two data schemas for the same dataset. > The two data schema

Re: Distribution of regions to servers

2012-09-27 Thread Dan Han
Thanks for your advice, Eugeny. Best Wishes Dan Han On Thu, Sep 27, 2012 at 2:34 AM, Eugeny Morozov wrote: > Dan, see inlined. > > On Thu, Sep 27, 2012 at 5:30 AM, Dan Han wrote: > > > Hi, Eugeny , > > > >Thanks for your response. I answered your questions inline in Blue. > > And I'd like t

Re: Distribution of regions to servers

2012-09-27 Thread Dan Han
; Ram > > -Original Message- > > From: Dan Han [mailto:dannahan2...@gmail.com] > > Sent: Thursday, September 27, 2012 7:00 AM > > To: user@hbase.apache.org > > Subject: Re: Distribution of regions to servers > > > > Hi, Eugeny , > > > >

RE: Distribution of regions to servers

2012-09-27 Thread Ramkrishna.S.Vasudevan
ay be we may need to see that the overall scan is not timeout. Regards Ram > -Original Message- > From: Dan Han [mailto:dannahan2...@gmail.com] > Sent: Friday, September 28, 2012 3:05 AM > To: user@hbase.apache.org > Subject: Re: Distribution of regions to servers

Re: Distribution of regions to servers

2012-09-30 Thread Dan Han
overall scan is not timeout. > > Regards > Ram > > > > -Original Message- > > From: Dan Han [mailto:dannahan2...@gmail.com] > > Sent: Friday, September 28, 2012 3:05 AM > > To: user@hbase.apache.org > > Subject: Re: Distribution of regions to serv