[ 
https://issues.apache.org/jira/browse/WHIRR-314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13039212#comment-13039212
 ] 

Tom White commented on WHIRR-314:
---------------------------------

Here's a stack trace from the thrift server node:

{noformat}
2011-05-25 16:40:19,672 INFO 
org.apache.hadoop.hbase.client.HConnectionManager$TableServers: getMaster 
attempt 9 of 10 failed; no more retrying.
java.io.IOException: org.apache.zookeeper.KeeperException$NoNodeException: 
KeeperErrorCode = NoNode for /hbase/master
     at 
org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper.readAddressOrThrow(ZooKeeperWrapper.java:481)
     at 
org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper.readMasterAddressOrThrow(ZooKeeperWrapper.java:377)
     at 
org.apache.hadoop.hbase.client.HConnectionManager$TableServers.getMaster(HConnectionManager.java:381)
     at org.apache.hadoop.hbase.client.HBaseAdmin.<init>(HBaseAdmin.java:78)
     at 
org.apache.hadoop.hbase.thrift.ThriftServer$HBaseHandler.<init>(ThriftServer.java:191)
     at 
org.apache.hadoop.hbase.thrift.ThriftServer.doMain(ThriftServer.java:817)
     at org.apache.hadoop.hbase.thrift.ThriftServer.main(ThriftServer.java:874)
Caused by: org.apache.zookeeper.KeeperException$NoNodeException: 
KeeperErrorCode = NoNode for /hbase/master
     at org.apache.zookeeper.KeeperException.create(KeeperException.java:102)
     at org.apache.zookeeper.KeeperException.create(KeeperException.java:42)
     at org.apache.zookeeper.ZooKeeper.getData(ZooKeeper.java:921)
     at 
org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper.readAddressOrThrow(ZooKeeperWrapper.java:477)
     ... 6 more
2011-05-25 16:40:19,677 INFO org.apache.zookeeper.ZooKeeper: Session: 
0x1302806aebc0001 closed
2011-05-25 16:40:19,678 DEBUG 
org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper: 
<173-203-217-78.static.cloud-ips.com:2181:/hbase,org.apache.hadoop.hbase.client.HConnectionManage
r>Closed connection with ZooKeeper; /hbase/root-region-server
{noformat}

> HBase integration test can fail due to Thrift server race
> ---------------------------------------------------------
>
>                 Key: WHIRR-314
>                 URL: https://issues.apache.org/jira/browse/WHIRR-314
>             Project: Whirr
>          Issue Type: Bug
>            Reporter: Tom White
>            Assignee: Tom White
>         Attachments: WHIRR-314.patch
>
>
> There is a race condition where the Thrift server comes up faster than the 
> master, fails to connect (after trying 10 times), then shuts down for good. 
> Both Andrei and I have seen this fail on Rackspace Cloud Servers.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to