Hi all,

I have problem starting a cluster of 1 master and 3 region server. When I 
started the cluster, the HMaster and HRegionserver will automatically exit.

My abase-site.xml:
<configuration>
  <property>
    <name>hbase.master</name>
    <value>hadoopmaster:60000</value>
  </property>
  <property>
    <name>hbase.rootdir</name>
    <value>hdfs://hadoopmaster:9000/hbase</value>
  </property>
  <property>
    <name>hbase.zookeeper.property.dataDir</name>
    <value>/proj/ucare/bo/hadoop_data/zookeeper</value>
  </property>
  <property>
    <name>hbase.zookeeper.quorum</name>
    
<value>pc225.emulab.net<http://pc225.emulab.net>,pc273.emulab.net<http://pc273.emulab.net>,pc210.emulab.net<http://pc210.emulab.net></value>
    <description>The directory shared by RegionServers.
    </description>
  </property>
  <property>
    <name>hbase.zookeeper.property.clientPort</name>
    <value>2181</value>
 </property>
  <property>
    <name>hbase.cluster.distributed</name>
    <value>true</value>
  </property>
  <property>
    <name>dfs.replication</name>
    <value>1</value>
  </property>
</configuration>


Log file:

2015-04-26 11:38:45,938 INFO  
[main-SendThread(pc273.emulab.net<http://pc273.emulab.net>:2181)] 
zookeeper.ClientCnxn: Opening socket connection to server 
pc273.emulab.net/155.98.39.73:2181<http://pc273.emulab.net/155.98.39.73:2181>. 
Will not attempt to authenticate using SASL (unknown error)
2015-04-26 11:38:45,939 INFO  
[main-SendThread(pc273.emulab.net<http://pc273.emulab.net>:2181)] 
zookeeper.ClientCnxn: Socket connection established to 
pc273.emulab.net/155.98.39.73:2181<http://pc273.emulab.net/155.98.39.73:2181>, 
initiating session
2015-04-26 11:38:45,940 INFO  
[main-SendThread(pc273.emulab.net<http://pc273.emulab.net>:2181)] 
zookeeper.ClientCnxn: Unable to read additional data from server sessionid 0x0, 
likely server has closed socket, closing socket connection and attempting 
reconnect
2015-04-26 11:38:47,022 INFO  
[main-SendThread(pc225.emulab.net<http://pc225.emulab.net>:2181)] 
zookeeper.ClientCnxn: Opening socket connection to server 
pc225.emulab.net/155.98.39.25:2181<http://pc225.emulab.net/155.98.39.25:2181>. 
Will not attempt to authenticate using SASL (unknown error)
2015-04-26 11:38:47,023 INFO  
[main-SendThread(pc225.emulab.net<http://pc225.emulab.net>:2181)] 
zookeeper.ClientCnxn: Socket connection established to 
pc225.emulab.net/155.98.39.25:2181<http://pc225.emulab.net/155.98.39.25:2181>, 
initiating session
2015-04-26 11:38:47,025 INFO  
[main-SendThread(pc225.emulab.net<http://pc225.emulab.net>:2181)] 
zookeeper.ClientCnxn: Unable to read additional data from server sessionid 0x0, 
likely server has closed socket, closing socket connection and attempting 
reconnect
2015-04-26 11:38:47,994 INFO  
[main-SendThread(pc332.emulab.net<http://pc332.emulab.net>:2181)] 
zookeeper.ClientCnxn: Opening socket connection to server 
pc332.emulab.net/155.98.39.132:2181<http://pc332.emulab.net/155.98.39.132:2181>.
 Will not attempt to authenticate using SASL (unknown error)
2015-04-26 11:39:17,150 INFO  
[main-SendThread(pc332.emulab.net<http://pc332.emulab.net>:2181)] 
zookeeper.ClientCnxn: Client session timed out, have not heard from server in 
30025ms for sessionid 0x0, closing socket connection and attempting reconnect
2015-04-26 11:39:17,251 WARN  [main] zookeeper.RecoverableZooKeeper: Possibly 
transient ZooKeeper, 
quorum=pc273.emulab.net<http://pc273.emulab.net>:2181,pc225.emulab.net<http://pc225.emulab.net>:2181,pc332.emulab.net<http://pc332.emulab.net>:2181,
 exception=org.apache.zookeeper.KeeperException$ConnectionLossException: 
KeeperErrorCode = ConnectionLoss for /hbase
2015-04-26 11:39:17,251 ERROR [main] zookeeper.RecoverableZooKeeper: ZooKeeper 
create failed after 4 attempts
2015-04-26 11:39:17,255 ERROR [main] master.HMasterCommandLine: Master exiting
java.lang.RuntimeException: Failed construction of Master: class 
org.apache.hadoop.hbase.master.HMaster
        at 
org.apache.hadoop.hbase.master.HMaster.constructMaster(HMaster.java:1982)
        at 
org.apache.hadoop.hbase.master.HMasterCommandLine.startMaster(HMasterCommandLine.java:198)
        at 
org.apache.hadoop.hbase.master.HMasterCommandLine.run(HMasterCommandLine.java:139)
        at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
        at 
org.apache.hadoop.hbase.util.ServerCommandLine.doMain(ServerCommandLine.java:126)
        at org.apache.hadoop.hbase.master.HMaster.main(HMaster.java:1996)
Caused by: org.apache.zookeeper.KeeperException$ConnectionLossException: 
KeeperErrorCode = ConnectionLoss for /hbase
        at org.apache.zookeeper.KeeperException.create(KeeperException.java:99)
        at org.apache.zookeeper.KeeperException.create(KeeperException.java:51)
        at org.apache.zookeeper.ZooKeeper.create(ZooKeeper.java:783)
        at 
org.apache.hadoop.hbase.zookeeper.RecoverableZooKeeper.createNonSequential(RecoverableZooKeeper.java:512)
        at 
org.apache.hadoop.hbase.zookeeper.RecoverableZooKeeper.create(RecoverableZooKeeper.java:491)
        at 
org.apache.hadoop.hbase.zookeeper.ZKUtil.createWithParents(ZKUtil.java:1252)
        at 
org.apache.hadoop.hbase.zookeeper.ZKUtil.createWithParents(ZKUtil.java:1230)
        at 
org.apache.hadoop.hbase.zookeeper.ZooKeeperWatcher.createBaseZNodes(ZooKeeperWatcher.java:174)
        at 
org.apache.hadoop.hbase.zookeeper.ZooKeeperWatcher.<init>(ZooKeeperWatcher.java:167)
        at 
org.apache.hadoop.hbase.regionserver.HRegionServer.<init>(HRegionServer.java:531)
        at org.apache.hadoop.hbase.master.HMaster.<init>(HMaster.java:333)
        at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
        at 
sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:57)
        at 
sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)


Can someone helps me figure out the problem? Is that because the number of 
region server and zookeeper nodes are incorrect?

Thanks a lot!

Bo


Reply via email to