Matthew,

What did you set your max region size to be for this table?
14K files totalling 650GB means you have a lot of small files...
On average ~45MB (rough calc).

How many regions? Do you have mslabs set up?
(GC tuning?)

Sorry for jumping in on the end of this conversation. 

-Mike

> From: matt...@tovbin.com
> Date: Mon, 31 Oct 2011 12:07:53 +0200
> Subject: Re: Region has been OPENING for too long
> To: user@hbase.apache.org
> 
> Ted,  thanks for such a rapid response.
> 
> You're right, we use hbase 0.90.3 from cdh3u1.
> 
> So, I suppose I need to make bulk loading in smaller bulks then. Any other
> suggestions?
> 
> 
> Best regards,
>     Matthew Tovbin =)
> 
> >
> >
> >I assume you're using HBase 0.90.x where HBASE-4015 isn't available.
> >
> >>> 5. And so on, till some of Slaves fail with "java.net.SocketException:
> >Too many open files".
> >Do you have some monitoring setup so that you can know the number of open
> >file handles ?
> >
> >Cheers
> >
> >On Sun, Oct 30, 2011 at 7:21 AM, Matthew Tovbin <[EMAIL PROTECTED]> wrote:
> >
> >> Hi guys,
> >>
> >>   I've bulkloaded a solid amount of data (650GB, ~14000 files) into Hbase
> >> (1master + 3regions) and now enabling the table results the
> >> following behavior on the cluster:
> >>
> >>   1. Master says that opening started  -
> >>    "org.apache.hadoop.hbase.master.AssignmentManager: Handling
> >>   transition=RS_ZK_REGION_OPENING, server=slave..."
> >>   2. Slaves report about opening files in progress -
> >>   "org.apache.hadoop.hbase.regionserver.Store: loaded hdfs://...."
> >>   3. Then after ~10 mins the following error occurs on hmaster -
> >>    "org.apache.hadoop.hbase.master.AssignmentManager: Regions in
> transition
> >>   timed out / Region has been OPENING for too long, reassigning
> region=..."
> >>   4. More slaves report about opening files in progress -
> >>   "org.apache.hadoop.hbase.regionserver.Store: loaded hdfs://...."
> >>   5. And so on, till some of Slaves fail with "java.net.SocketException:
> >>   Too many open files".
> >>
> >>
> >> What I've done already to solve the issue (which DID NOT help though):
> >>
> >>   1. Set 'ulimit -n 65536' for hbase user
> >>   2. Set hbase.hbasemaster.maxregionopen=3600000 (1 hour) in
> hbase-site.xml
> >>
> >>
> >>  What else can I try?!
> >>
> >>
> >> Best regards,
> >>    Matthew Tovbin =)
> >>
                                          

Reply via email to