What do you have for your HBASE_OPTS in conf/hbase-env.sh Dmitry?

Remove this:

-XX:+CMSIncrementalMode

... if its present on your HBASE_OPTS.

Change your hbase.zookeeper.property.tickTime from 2 to 3 so that zk session
goes for longer (See the comment in the head of the 0.20 branch for
explaination).

What else is running on the machines where the regionserver times out its
session w/ zk?  Anything?  The uploader?

You've set the ulimit > 1024 and xceivers?  Right (I don't see that in your
old messages)

St.Ack



On Mon, Jan 11, 2010 at 6:21 AM, Dmitriy Lyfar <[email protected]> wrote:

> Hi,
>
> 2010/1/10 Jean-Daniel Cryans <[email protected]>
>
> > You have this line:
> >
> > 2010-01-08 21:25:24,709 WARN org.apache.hadoop.hbase.util.Sleeper: We
> > slept 66413ms, ten times longer than scheduled: 3000
> >
> > That's a garbage collector pause that lasted more than a minute which
> > is higher than the default timeout to consider a region server dead
> > (40 seconds in 0.20 unless you are using 0.20.3RC1). The master
> > replayed the write-ahead-logs and reopened the regions elsewhere.
> >
> > You want to set a higher heap space in conf/hbase-env.sh because the
> > default 1GB is way too low, give it a much as you can without
> > swapping.
> >
> > J-D
> >
> >
> I can try to add more memory to regionservers. But now I already have 5Gb
> per each node.
> (I'm using 0.20.2).
>
> --
> Thank you, Lyfar Dmitriy
>

Reply via email to