yes so here is what happens. the region server reports its address to the zookeeper ( it's been a while ago and it might be the hbase master not zookeeper, but never the less the point stands ), now the regionserver gets its address from looking up the hostname, and if in your /etc/hosts, on that server, the hostname is associated with loopback ( 127.0.0.1 ) then the ip addresss that would be reported to the zookeeper( or hbase master) would be 127.0.0.1 which means that they won't be able to connect back to the regionserver.

another situation that may cause the error message that you mentioned would be the max number of concurrent connection on the zookeeper. I don't remember what the default value was. I think 30 or 100. you probably want about 10 * (number of regionserver). once connections exceed that number zookeeper starts to reject new connection and you get that error.

that being said, I don't recommend hbase for nutch since the latest version of hbase is not supported by gora. and the version that is supported is not really stable under heavy load. I am experimenting with cassandra right now and the jury is still out on that, but it looks promising. never the less, if the number of sites that you want to crawl is anywhere close to the number of sites in DMOZ , then you probably want to start replacing mysql asap.


On 08/13/2013 11:49 AM, Ralf R. Kotowski wrote:
You mean NOT to point to 127.0.0.1? I thought it was supossed to do that
from what I read.

Anyway, I'm using MySQL for now, I'll work on Hbase later when this is all
functional

-----Original Message-----
From: kaveh minooie [mailto:[email protected]]
Sent: Monday, August 12, 2013 10:52 PM
To: [email protected]
Subject: Re: Hbase is able to connect to Zookeeper but the connection closes
immediatly

I don't know if this is your problem but it would cause the same error
message as well.  you have to make sure that your hostnames do not
return the loopback address (127.0.0.1) so your hostname either should
not be in /etc/hosts or it should have a proper IP address which is also
the same on that your region servers are binding to.



On 08/09/2013 10:01 AM, Ralf R. Kotowski wrote:
Thnx,

You're right, I just have been able to reproduce the error via the Hbase
shell without having Nutch in the loop.

-----Original Message-----
From: Lewis John Mcgibbney [mailto:[email protected]]
Sent: Friday, August 09, 2013 5:39 PM
To: [email protected]
Subject: Re: Hbase is able to connect to Zookeeper but the connection
closes
immediatly

Hi Ralf,
AFAICS this would be much better suited to hbase user list.
Sorry I can't help more

On Friday, August 9, 2013, Ralf R. Kotowski <[email protected]> wrote:
Nutch 2.2.1
Hbase 0.90.4
Solr 4.4.0

Fedora Core 19

Sun Java (latest)



Error Msg: Hbase is able to connect to Zookeeper but the connection
closes
immediately



Sanned web/mailing lists, about to make a new install of everything,
could
this be a Firewall issue?





--
Kaveh Minooie

Reply via email to