here 's what I get http://pastebin.com/m60c1864b
Ananth T Sarathy On Wed, Jan 13, 2010 at 2:57 PM, Jean-Daniel Cryans <[email protected]>wrote: > Do "jps" then "jstack pid" with the master's pid given by jps. > > J-D > > On Wed, Jan 13, 2010 at 11:41 AM, Ananth T. Sarathy > <[email protected]> wrote: > > well when i do a ps -ef|grep hbase i have 3 processes running. I have > killed > > them all, reinstalled hbase, formated my name node, and still the > master.log > > is the same when I restart. What could be causing it hang? > > > > > > Ananth T Sarathy > > > > > > On Wed, Jan 13, 2010 at 2:26 PM, Jean-Daniel Cryans <[email protected] > >wrote: > > > >> Well it's just weird that your master would just "refuse" to start. Is > >> the process still there? If you jstack it, is there any thread > >> running? > >> > >> You could also clean up everything and retry, but that's just the easy > >> way out :P > >> > >> J-D > >> > >> On Wed, Jan 13, 2010 at 11:23 AM, Ananth T. Sarathy > >> <[email protected]> wrote: > >> > master. out is empty.... could something have cludged up from the > >> previous > >> > issues? Are there files I should delete/ reformat my namenode? > >> > > >> > I don't have any data yet in these, so I can afford to blow things > away, > >> but > >> > I cleaned out the tmp dir already so I am not sure what else i need to > >> do. > >> > Ananth T Sarathy > >> > > >> > > >> > On Wed, Jan 13, 2010 at 2:14 PM, Jean-Daniel Cryans < > [email protected] > >> >wrote: > >> > > >> >> If that's everything from your master log, then I would suggest you > >> >> take a look at the .out file (instead of .log) since it might be a > >> >> problem on startup. > >> >> > >> >> J-D > >> >> > >> >> On Wed, Jan 13, 2010 at 11:09 AM, Ananth T. Sarathy > >> >> <[email protected]> wrote: > >> >> > Master log > >> >> > > >> >> > http://pastebin.com/m469d1b39 > >> >> > > >> >> > Zookeeper log > >> >> > http://pastebin.com/m47f0503 > >> >> > > >> >> > region server > >> >> > > >> >> > http://pastebin.com/m305fab14 > >> >> > > >> >> > Ananth T Sarathy > >> >> > > >> >> > > >> >> > On Wed, Jan 13, 2010 at 2:02 PM, Jean-Daniel Cryans < > >> [email protected] > >> >> >wrote: > >> >> > > >> >> >> Looks like your master didn't register itself in zookeeper, you > >> should > >> >> >> look in its log. > >> >> >> > >> >> >> J-D > >> >> >> > >> >> >> On Wed, Jan 13, 2010 at 10:59 AM, Ananth T. Sarathy > >> >> >> <[email protected]> wrote: > >> >> >> > ok, we got that to work and zookeeper is coming up, but now I am > >> >> getting > >> >> >> > something else... the regionserver are connecting cause of > >> >> >> > > >> >> >> > 2010-01-13 13:57:56,029 WARN > >> >> >> > org.apache.hadoop.hbase.regionserver.HRegionServer: Unable to > read > >> >> master > >> >> >> > address from ZooKeeper. Retrying. Error was: > >> >> >> > java.io.IOException: > >> >> >> org.apache.zookeeper.KeeperException$NoNodeException: > >> >> >> > KeeperErrorCode = NoNode for /hbase/master > >> >> >> > at > >> >> >> > > >> >> >> > >> >> > >> > org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper.readAddressOrThrow(ZooKeeperWrapper.java:332) > >> >> >> > at > >> >> >> > > >> >> >> > >> >> > >> > org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper.readMasterAddressOrThrow(ZooKeeperWrapper.java:240) > >> >> >> > at > >> >> >> > > >> >> >> > >> >> > >> > org.apache.hadoop.hbase.regionserver.HRegionServer.getMaster(HRegionServer.java:1339) > >> >> >> > at > >> >> >> > > >> >> >> > >> >> > >> > org.apache.hadoop.hbase.regionserver.HRegionServer.reportForDuty(HRegionServer.java:1371) > >> >> >> > at > >> >> >> > > >> >> >> > >> >> > >> > org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:427) > >> >> >> > at java.lang.Thread.run(Thread.java:636) > >> >> >> > Caused by: org.apache.zookeeper.KeeperException$NoNodeException: > >> >> >> > KeeperErrorCode = NoNode for /hbase/master > >> >> >> > at > >> >> >> > > >> org.apache.zookeeper.KeeperException.create(KeeperException.java:102) > >> >> >> > at > >> >> >> > > >> org.apache.zookeeper.KeeperException.create(KeeperException.java:42) > >> >> >> > at > >> org.apache.zookeeper.ZooKeeper.getData(ZooKeeper.java:892) > >> >> >> > at > >> >> >> > > >> >> >> > >> >> > >> > org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper.readAddressOrThrow(ZooKeeperWrapper.java:328) > >> >> >> > ... 5 more > >> >> >> > > >> >> >> > > >> >> >> > any ideas? > >> >> >> > Ananth T Sarathy > >> >> >> > > >> >> >> > > >> >> >> > On Wed, Jan 13, 2010 at 12:52 PM, Jean-Daniel Cryans < > >> >> >> [email protected]>wrote: > >> >> >> > > >> >> >> >> HBase 0.20.2 and previous only checked one address against the > >> list > >> >> >> >> that is provided, the one returned was the default Java knew > of. > >> It > >> >> >> >> seems that in your case your /etc/hosts makes it that this > >> machines > >> >> >> >> resolves itself only as localhost. You can: > >> >> >> >> > >> >> >> >> 1) Try to fix your network configuration to have your machine > >> always > >> >> >> >> resolve by its hostname first, or > >> >> >> >> > >> >> >> >> 2) Use HBase 0.20.3RC1 which contains a fix that tries harder > to > >> >> match > >> >> >> >> the address. You can get it here: > >> >> >> >> http://people.apache.org/~jdcryans/hbase-0.20.3-candidate-1/<http://people.apache.org/%7Ejdcryans/hbase-0.20.3-candidate-1/> > <http://people.apache.org/%7Ejdcryans/hbase-0.20.3-candidate-1/> > >> <http://people.apache.org/%7Ejdcryans/hbase-0.20.3-candidate-1/> > >> >> <http://people.apache.org/%7Ejdcryans/hbase-0.20.3-candidate-1/> > >> >> >> <http://people.apache.org/%7Ejdcryans/hbase-0.20.3-candidate-1/> > >> >> >> >> > >> >> >> >> Sorry for that, > >> >> >> >> > >> >> >> >> J-D > >> >> >> >> > >> >> >> >> On Wed, Jan 13, 2010 at 9:43 AM, Ananth T. Sarathy > >> >> >> >> <[email protected]> wrote: > >> >> >> >> > I have Hbase.env set to manage Zookeeper. When I try to start > >> >> hbase, > >> >> >> the > >> >> >> >> > zookeeper out says > >> >> >> >> > > >> >> >> >> > java.io.IOException: Could not find my address: localhost in > >> list > >> >> of > >> >> >> >> > ZooKeeper quorum servers > >> >> >> >> > at > >> >> >> >> > > >> >> >> >> > >> >> >> > >> >> > >> > org.apache.hadoop.hbase.zookeeper.HQuorumPeer.writeMyID(HQuorumPeer.java:128) > >> >> >> >> > at > >> >> >> >> > > >> >> >> > >> org.apache.hadoop.hbase.zookeeper.HQuorumPeer.main(HQuorumPeer.java:67) > >> >> >> >> > ~ > >> >> >> >> > > >> >> >> >> > in my hbase-site.xml > >> >> >> >> > > >> >> >> >> > <property> > >> >> >> >> > <name>hbase.zookeeper.quorum</name> > >> >> >> >> > <value>gs2,gs3,gs4</value> > >> >> >> >> > <description>Comma separated list of servers in the > ZooKeeper > >> >> >> Quorum. > >> >> >> >> > For example, "host1.mydomain.com,host2.mydomain.com, > >> >> >> host3.mydomain.com > >> >> >> >> ". > >> >> >> >> > By default this is set to localhost for local and > >> >> pseudo-distributed > >> >> >> >> > modes > >> >> >> >> > of operation. For a fully-distributed setup, this should be > >> set > >> >> to a > >> >> >> >> full > >> >> >> >> > list of ZooKeeper quorum servers. If HBASE_MANAGES_ZK is > set > >> in > >> >> >> >> > hbase-env.sh > >> >> >> >> > this is the list of servers which we will start/stop > ZooKeeper > >> >> on. > >> >> >> >> > </description> > >> >> >> >> > </property> > >> >> >> >> > > >> >> >> >> > in my /etc/hosts > >> >> >> >> > > >> >> >> >> > # hostname gs2 added to /etc/hosts by anaconda > >> >> >> >> > 127.0.0.1 localhost localhost.localdomain localhost4 > >> >> >> >> > localhost4.localdomain4 gs2 > >> >> >> >> > ::1 localhost localhost.localdomain localhost6 > >> >> >> >> > localhost6.localdomain6 gs2 > >> >> >> >> > > >> >> >> >> > 192.168.20.101 gs1 > >> >> >> >> > 192.168.20.102 gs2 > >> >> >> >> > 192.168.20.103 gs3 > >> >> >> >> > 192.168.20.104 gs4 > >> >> >> >> > 192.168.20.105 gs5 > >> >> >> >> > 192.168.20.106 gs6 > >> >> >> >> > 192.168.20.107 gs7 > >> >> >> >> > 192.168.20.108 gs8 > >> >> >> >> > 192.168.20.110 gs10 > >> >> >> >> > 192.168.20.111 gs11 > >> >> >> >> > 192.168.20.112 gs12 > >> >> >> >> > 192.168.20.113 gs13 > >> >> >> >> > 192.168.20.114 gs14 > >> >> >> >> > 192.168.20.115 gs15 > >> >> >> >> > 192.168.20.116 gs16 > >> >> >> >> > 192.168.20.117 gs17 > >> >> >> >> > > >> >> >> >> > am I missing something here? Why does it insist on localhost > in > >> the > >> >> >> >> quorum > >> >> >> >> > list? What do i need to do to unconfuse it? > >> >> >> >> > > >> >> >> >> > > >> >> >> >> > Ananth T Sarathy > >> >> >> >> > > >> >> >> >> > >> >> >> > > >> >> >> > >> >> > > >> >> > >> > > >> > > >
