Hi Xenia, I think there is some problem with Zookeeper. Can you make sure that Zookeeper server is running. If it is running then is it on port 22181? (because your Giraph job is trying to connect on this port). If Zookeeper is running on some different port then try running your Giraph job with -Dgiraph.zkList=<zookeper server ip>:<zookeeper port>
I'm not sure whether you have to start an instance of zookeeper separately or Giraph will start one for you, I have a separate instance running on my cluster and I specify the server and port via -Dgiraph.zkList option. I hope that works. Vivek ________________________________________ From: xeniad20 <xenia...@gmail.com> Sent: Thursday, August 7, 2014 3:46 PM To: user@giraph.apache.org Subject: giraph 1.1.0 Execution Error Hi experts, I try to execute Giraph 1.1.0 on a small cluster but I have the following Errors: 2014-08-07 23:35:46,141 INFO org.apache.zookeeper.ClientCnxn: Opening socket connection to server DataNode2/10.190.12.33:22181. Will not attempt to authenticate using SASL (unknown error) 2014-08-07 23:35:46,142 WARN org.apache.zookeeper.ClientCnxn: Session 0x147b22ebf420001 for server null, unexpected error, closing socket connection and attempting reconnect java.net.ConnectException: Connection refused at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method) at sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:708) at org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:350) at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1068) 2014-08-07 23:35:46,243 WARN org.apache.giraph.zk.ZooKeeperExt: deleteExt: Connection loss on attempt 2, waiting 5000 msecs before retrying. org.apache.zookeeper.KeeperException$ConnectionLossException: KeeperErrorCode = ConnectionLoss for /_hadoopBsp/job_201408072332_0003/_applicationAttemptsDir/0/_superstepDir/1/_workerHealthyDir/datanode1_1 at org.apache.zookeeper.KeeperException.create(KeeperException.java:99) at org.apache.zookeeper.KeeperException.create(KeeperException.java:51) at org.apache.zookeeper.ZooKeeper.delete(ZooKeeper.java:873) at org.apache.giraph.zk.ZooKeeperExt.deleteExt(ZooKeeperExt.java:302) at org.apache.giraph.worker.BspServiceWorker.unregisterHealth(BspServiceWorker.java:768) at org.apache.giraph.worker.BspServiceWorker.failureCleanup(BspServiceWorker.java:782) at org.apache.giraph.graph.GraphTaskManager.workerFailureCleanup(GraphTaskManager.java:900) at org.apache.giraph.graph.GraphMapper.run(GraphMapper.java:100) at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:764) at org.apache.hadoop.mapred.MapTask.run(MapTask.java:364) at org.apache.hadoop.mapred.Child$4.run(Child.java:255) at java.security.AccessController.doPrivileged(Native Method) at javax.security.auth.Subject.doAs(Subject.java:415) at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1190) at org.apache.hadoop.mapred.Child.main(Child.java:249) 2014-08-07 23:35:48,126 INFO org.apache.zookeeper.ClientCnxn: Opening socket connection to server DataNode2/10.190.12.33:22181. Will not attempt to authenticate using SASL (unknown error) 2014-08-07 23:35:48,127 WARN org.apache.zookeeper.ClientCnxn: Session 0x147b22ebf420001 for server null, unexpected error, closing socket connection and attempting reconnect java.net.ConnectException: Connection refused at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method) at sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:708) at org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:350) at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1068) 2014-08-07 23:35:49,368 FATAL org.apache.giraph.graph.GraphMapper: uncaughtException: OverrideExceptionHandler on thread Thread-12, msg = createExt: Failed to create /_hadoopBsp/job_201408072332_0003/_workerProgresses/1 after 3 tries!, exiting... java.lang.IllegalStateException: createExt: Failed to create /_hadoopBsp/job_201408072332_0003/_workerProgresses/1 after 3 tries! at org.apache.giraph.zk.ZooKeeperExt.createExt(ZooKeeperExt.java:182) at org.apache.giraph.zk.ZooKeeperExt.createOrSetExt(ZooKeeperExt.java:247) at org.apache.giraph.worker.WorkerProgress.writeToZnode(WorkerProgress.java:110) at org.apache.giraph.worker.WorkerProgressWriter$1.run(WorkerProgressWriter.java:59) at java.lang.Thread.run(Thread.java:724) However Giraph 1.0.0 version run without any problems. What might be the solution for the above errors? Any help is appreciated. Thanks Xenia