Re: zookeeper connection issue while running for second time

2013-10-01 Thread Jyotirmoy Sundi
Thanks a lot Avery for your response, I increased the timeout to 10 minutes *changed:* -Dgiraph.zkSessionMsecTimeout=60 and -Dgiraph.useInputSplitLocality=false , It is working for consecutive runs now without any errors. Thanks Sundi On Tue, Oct 1, 2013 at 10:18 PM, Avery Ching wrote: >

Re: zookeeper connection issue while running for second time

2013-10-01 Thread Avery Ching
We did have this error a few times. This can happen due to GC pauses, so I would check the worker for long GC issues. Also, you can increase the ZooKeeper timeouts, see /** ZooKeeper session millisecond timeout */ IntConfOption ZOOKEEPER_SESSION_TIMEOUT = new IntConfOption("giraph.zk

Re: zookeeper connection issue while running for second time

2013-10-01 Thread Jyotirmoy Sundi
Hi , I am able to run apache giraph successfully with around 500M pairs to find Connected components. It works great but not always, the issue seems to be with the time out zookeeper time out. Some of the client(around 5-10 ) out of 100, produces this error and the master fails due to this.Do you h

zookeeper connection issue while running for second time

2013-09-26 Thread Jyotirmoy Sundi
Hi , I got the connected component working for 1B nodes, but when I run the job again, it fails with the below error. Aprt form this in zookeeper the data is not cleared in the data directory. For successful jobs the data in zookeper from giraph is cleared. The following errors seems to be com