Thanks a lot Avery for your response, I increased the timeout to 10 minutes
*changed:*
-Dgiraph.zkSessionMsecTimeout=60 and
-Dgiraph.useInputSplitLocality=false ,
It is working for consecutive runs now without any errors.
Thanks
Sundi
On Tue, Oct 1, 2013 at 10:18 PM, Avery Ching wrote:
>
We did have this error a few times. This can happen due to GC pauses, so
I would check the worker for long GC issues. Also, you can increase the
ZooKeeper timeouts, see
/** ZooKeeper session millisecond timeout */
IntConfOption ZOOKEEPER_SESSION_TIMEOUT =
new IntConfOption("giraph.zk
Hi ,
I am able to run apache giraph successfully with around 500M pairs to
find Connected components. It works great but not always, the issue seems
to be with the time out zookeeper time out. Some of the client(around 5-10
) out of 100, produces this error and the master fails due to this.Do you
h
Hi ,
I got the connected component working for 1B nodes, but when I run
the job again, it fails with the below error. Aprt form this in
zookeeper the data is not cleared in the data directory. For
successful jobs the data in zookeper from giraph is cleared.
The following errors seems to be com