Hi Xenia, 

I think there is some problem with Zookeeper. Can you make sure that Zookeeper 
server is running. If it is running then is it on port 22181? (because your 
Giraph job is trying to connect on this port). If Zookeeper is running on some 
different port then try running your Giraph job with -Dgiraph.zkList=<zookeper 
server ip>:<zookeeper port> 

I'm not sure whether you have to start an instance of zookeeper separately or 
Giraph will start one for you, I have a separate instance running on my cluster 
and I specify the server and port via -Dgiraph.zkList option. 

I hope that works. 

Vivek

________________________________________
From: xeniad20 <xenia...@gmail.com>
Sent: Thursday, August 7, 2014 3:46 PM
To: user@giraph.apache.org
Subject: giraph 1.1.0 Execution Error

Hi experts,

I try to execute Giraph 1.1.0 on a small cluster but I have the
following Errors:

2014-08-07 23:35:46,141 INFO org.apache.zookeeper.ClientCnxn: Opening
socket connection to server DataNode2/10.190.12.33:22181. Will not
attempt to authenticate using SASL (unknown error)
2014-08-07 23:35:46,142 WARN org.apache.zookeeper.ClientCnxn: Session
0x147b22ebf420001 for server null, unexpected error, closing socket
connection and attempting reconnect
java.net.ConnectException: Connection refused
     at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
     at
sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:708)
     at
org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:350)
     at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1068)
2014-08-07 23:35:46,243 WARN org.apache.giraph.zk.ZooKeeperExt:
deleteExt: Connection loss on attempt 2, waiting 5000 msecs before retrying.
org.apache.zookeeper.KeeperException$ConnectionLossException:
KeeperErrorCode = ConnectionLoss for
/_hadoopBsp/job_201408072332_0003/_applicationAttemptsDir/0/_superstepDir/1/_workerHealthyDir/datanode1_1
     at org.apache.zookeeper.KeeperException.create(KeeperException.java:99)
     at org.apache.zookeeper.KeeperException.create(KeeperException.java:51)
     at org.apache.zookeeper.ZooKeeper.delete(ZooKeeper.java:873)
     at org.apache.giraph.zk.ZooKeeperExt.deleteExt(ZooKeeperExt.java:302)
     at
org.apache.giraph.worker.BspServiceWorker.unregisterHealth(BspServiceWorker.java:768)
     at
org.apache.giraph.worker.BspServiceWorker.failureCleanup(BspServiceWorker.java:782)
     at
org.apache.giraph.graph.GraphTaskManager.workerFailureCleanup(GraphTaskManager.java:900)
     at org.apache.giraph.graph.GraphMapper.run(GraphMapper.java:100)
     at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:764)
     at org.apache.hadoop.mapred.MapTask.run(MapTask.java:364)
     at org.apache.hadoop.mapred.Child$4.run(Child.java:255)
     at java.security.AccessController.doPrivileged(Native Method)
     at javax.security.auth.Subject.doAs(Subject.java:415)
     at
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1190)
     at org.apache.hadoop.mapred.Child.main(Child.java:249)
2014-08-07 23:35:48,126 INFO org.apache.zookeeper.ClientCnxn: Opening
socket connection to server DataNode2/10.190.12.33:22181. Will not
attempt to authenticate using SASL (unknown error)
2014-08-07 23:35:48,127 WARN org.apache.zookeeper.ClientCnxn: Session
0x147b22ebf420001 for server null, unexpected error, closing socket
connection and attempting reconnect
java.net.ConnectException: Connection refused
     at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
     at
sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:708)
     at
org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:350)
     at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1068)
2014-08-07 23:35:49,368 FATAL org.apache.giraph.graph.GraphMapper:
uncaughtException: OverrideExceptionHandler on thread Thread-12, msg =
createExt: Failed to create
/_hadoopBsp/job_201408072332_0003/_workerProgresses/1 after 3 tries!,
exiting...
java.lang.IllegalStateException: createExt: Failed to create
/_hadoopBsp/job_201408072332_0003/_workerProgresses/1 after 3 tries!
     at org.apache.giraph.zk.ZooKeeperExt.createExt(ZooKeeperExt.java:182)
     at
org.apache.giraph.zk.ZooKeeperExt.createOrSetExt(ZooKeeperExt.java:247)
     at
org.apache.giraph.worker.WorkerProgress.writeToZnode(WorkerProgress.java:110)
     at
org.apache.giraph.worker.WorkerProgressWriter$1.run(WorkerProgressWriter.java:59)
     at java.lang.Thread.run(Thread.java:724)

However Giraph 1.0.0 version run without any problems.
What might be the solution for the above errors?

Any help is appreciated.

Thanks
Xenia

Reply via email to