Errors while running large graph

2014-05-27 Thread Praveen kumar s.k
Hi All,
I am getting several errors consistently while processing large graph.
The code works when the size of the graph is in terms of GB's.
we have implemented compression and removing the dead end nodes in de
Bruijn graph
My cluster settings are

Cores WorkersRAM/Core  GraphsizeAggregateRAM
252 250  10.5 GB  2.3 TB2.6 TB

Below are the type of errors I am getting.

1.  I believe that this error occurred because of zookeeper session
expired. To address this I changed the parameter minSessionTimeout in
configuration to large value. However some workers still throw this
error.

2014-05-27 00:19:55,187 FATAL org.apache.giraph.graph.GraphMapper:
uncaughtException: OverrideExceptionHandler on thread
org.apache.giraph.master.MasterThread, msg = java.lang.Il$
java.lang.IllegalStateException: java.lang.IllegalStateException:
Failed to create job state path due to KeeperException
at org.apache.giraph.master.MasterThread.run(MasterThread.java:185)
Caused by: java.lang.IllegalStateException: Failed to create job state
path due to KeeperException
at org.apache.giraph.bsp.BspService.getJobState(BspService.java:679)
at 
org.apache.giraph.master.BspServiceMaster.becomeMaster(BspServiceMaster.java:843)
at org.apache.giraph.master.MasterThread.run(MasterThread.java:98)
Caused by: org.apache.zookeeper.KeeperException$SessionExpiredException:
KeeperErrorCode = Session expired for
/_hadoopBsp/job_201405262302_0003/_masterJobState
at org.apache.zookeeper.KeeperException.create(KeeperException.java:118)
at org.apache.zookeeper.KeeperException.create(KeeperException.java:42)
at org.apache.zookeeper.ZooKeeper.create(ZooKeeper.java:637)
at org.apache.giraph.zk.ZooKeeperExt.createExt(ZooKeeperExt.java:152)
at org.apache.giraph.bsp.BspService.getJobState(BspService.java:670)
... 2 more

2. I dont know why this below error is thrown. My guess is that,
master worker is failing for some reason

2014-05-27 00:19:55,184 ERROR org.apache.giraph.master.MasterThread:
masterThread: Master algorithm failed with IllegalStateException
java.lang.IllegalStateException: Failed to create job state path due
to KeeperException
at org.apache.giraph.bsp.BspService.getJobState(BspService.java:679)
at 
org.apache.giraph.master.BspServiceMaster.becomeMaster(BspServiceMaster.java:843)
at org.apache.giraph.master.MasterThread.run(MasterThread.java:98)
Caused by: org.apache.zookeeper.KeeperException$SessionExpiredException:
KeeperErrorCode = Session expired for
/_hadoopBsp/job_201405262302_0003/_masterJobState
at org.apache.zookeeper.KeeperException.create(KeeperException.java:118)
at org.apache.zookeeper.KeeperException.create(KeeperException.java:42)
at org.apache.zookeeper.ZooKeeper.create(ZooKeeper.java:637)
at org.apache.giraph.zk.ZooKeeperExt.createExt(ZooKeeperExt.java:152)
at org.apache.giraph.bsp.BspService.getJobState(BspService.java:670)
... 2 more

3. Below is one more type of error
java.lang.IllegalStateException: Failed to create job state path due
to KeeperException
at org.apache.giraph.bsp.BspService.getJobState(BspService.java:679)
at 
org.apache.giraph.master.BspServiceMaster.becomeMaster(BspServiceMaster.java:843)
at org.apache.giraph.master.MasterThread.run(MasterThread.java:98)
Caused by: org.apache.zookeeper.KeeperException$SessionExpiredException:
KeeperErrorCode = Session expired for
/_hadoopBsp/job_201405261249_0008/_masterJobState
at org.apache.zookeeper.KeeperException.create(KeeperException.java:118)
at org.apache.zookeeper.KeeperException.create(KeeperException.java:42)
at org.apache.zookeeper.ZooKeeper.create(ZooKeeper.java:637)
at org.apache.giraph.zk.ZooKeeperExt.createExt(ZooKeeperExt.java:152)
at org.apache.giraph.bsp.BspService.getJobState(BspService.java:670)
... 2 more
2014-05-26 18:19:54,269 FATAL org.apache.giraph.graph.GraphMapper:
uncaughtException: OverrideExceptionHandler on thread
org.apache.giraph.master.MasterThread, msg = java.lang.Il$
java.lang.IllegalStateException: java.lang.IllegalStateException:
Failed to create job state path due to KeeperException
at org.apache.giraph.master.MasterThread.run(MasterThread.java:185)
Caused by: java.lang.IllegalStateException: Failed to create job state
path due to KeeperException
at org.apache.giraph.bsp.BspService.getJobState(BspService.java:679)
at 
org.apache.giraph.master.BspServiceMaster.becomeMaster(BspServiceMaster.java:843)
at org.apache.giraph.master.MasterThread.run(MasterThread.java:98)
Caused by: org.apache.zookeeper.KeeperException$SessionExpiredException:
KeeperErrorCode = Session expired for
/_hadoopBsp/job_201405261249_0008/_masterJobState
at org.apache.zookeeper.KeeperException.create(KeeperException.java:118)
at 

Re: Errors while running large graph

2014-05-27 Thread Avery Ching

You might also want to check the zookeeper memory options.

Some of our production jobs use parameters such as

-Xmx5g -XX:ParallelGCThreads=4 -XX:+UseConcMarkSweepGC 
-XX:CMSInitiatingOccupancyFraction=70 -XX:MaxGCPauseMillis=100


Since the master doesn't use much memory letting zk have more is reasonable.

On 5/27/14, 9:25 AM, Praveen kumar s.k wrote:

Hi All,
I am getting several errors consistently while processing large graph.
The code works when the size of the graph is in terms of GB's.
we have implemented compression and removing the dead end nodes in de
Bruijn graph
My cluster settings are

Cores WorkersRAM/Core  GraphsizeAggregateRAM
252 250  10.5 GB  2.3 TB2.6 TB

Below are the type of errors I am getting.

1.  I believe that this error occurred because of zookeeper session
expired. To address this I changed the parameter minSessionTimeout in
configuration to large value. However some workers still throw this
error.

2014-05-27 00:19:55,187 FATAL org.apache.giraph.graph.GraphMapper:
uncaughtException: OverrideExceptionHandler on thread
org.apache.giraph.master.MasterThread, msg = java.lang.Il$
java.lang.IllegalStateException: java.lang.IllegalStateException:
Failed to create job state path due to KeeperException
 at org.apache.giraph.master.MasterThread.run(MasterThread.java:185)
Caused by: java.lang.IllegalStateException: Failed to create job state
path due to KeeperException
 at org.apache.giraph.bsp.BspService.getJobState(BspService.java:679)
 at 
org.apache.giraph.master.BspServiceMaster.becomeMaster(BspServiceMaster.java:843)
 at org.apache.giraph.master.MasterThread.run(MasterThread.java:98)
Caused by: org.apache.zookeeper.KeeperException$SessionExpiredException:
KeeperErrorCode = Session expired for
/_hadoopBsp/job_201405262302_0003/_masterJobState
 at 
org.apache.zookeeper.KeeperException.create(KeeperException.java:118)
 at org.apache.zookeeper.KeeperException.create(KeeperException.java:42)
 at org.apache.zookeeper.ZooKeeper.create(ZooKeeper.java:637)
 at org.apache.giraph.zk.ZooKeeperExt.createExt(ZooKeeperExt.java:152)
 at org.apache.giraph.bsp.BspService.getJobState(BspService.java:670)
 ... 2 more

2. I dont know why this below error is thrown. My guess is that,
master worker is failing for some reason

2014-05-27 00:19:55,184 ERROR org.apache.giraph.master.MasterThread:
masterThread: Master algorithm failed with IllegalStateException
java.lang.IllegalStateException: Failed to create job state path due
to KeeperException
 at org.apache.giraph.bsp.BspService.getJobState(BspService.java:679)
 at 
org.apache.giraph.master.BspServiceMaster.becomeMaster(BspServiceMaster.java:843)
 at org.apache.giraph.master.MasterThread.run(MasterThread.java:98)
Caused by: org.apache.zookeeper.KeeperException$SessionExpiredException:
KeeperErrorCode = Session expired for
/_hadoopBsp/job_201405262302_0003/_masterJobState
 at 
org.apache.zookeeper.KeeperException.create(KeeperException.java:118)
 at org.apache.zookeeper.KeeperException.create(KeeperException.java:42)
 at org.apache.zookeeper.ZooKeeper.create(ZooKeeper.java:637)
 at org.apache.giraph.zk.ZooKeeperExt.createExt(ZooKeeperExt.java:152)
 at org.apache.giraph.bsp.BspService.getJobState(BspService.java:670)
 ... 2 more

3. Below is one more type of error
java.lang.IllegalStateException: Failed to create job state path due
to KeeperException
 at org.apache.giraph.bsp.BspService.getJobState(BspService.java:679)
 at 
org.apache.giraph.master.BspServiceMaster.becomeMaster(BspServiceMaster.java:843)
 at org.apache.giraph.master.MasterThread.run(MasterThread.java:98)
Caused by: org.apache.zookeeper.KeeperException$SessionExpiredException:
KeeperErrorCode = Session expired for
/_hadoopBsp/job_201405261249_0008/_masterJobState
 at 
org.apache.zookeeper.KeeperException.create(KeeperException.java:118)
 at org.apache.zookeeper.KeeperException.create(KeeperException.java:42)
 at org.apache.zookeeper.ZooKeeper.create(ZooKeeper.java:637)
 at org.apache.giraph.zk.ZooKeeperExt.createExt(ZooKeeperExt.java:152)
 at org.apache.giraph.bsp.BspService.getJobState(BspService.java:670)
 ... 2 more
2014-05-26 18:19:54,269 FATAL org.apache.giraph.graph.GraphMapper:
uncaughtException: OverrideExceptionHandler on thread
org.apache.giraph.master.MasterThread, msg = java.lang.Il$
java.lang.IllegalStateException: java.lang.IllegalStateException:
Failed to create job state path due to KeeperException
 at org.apache.giraph.master.MasterThread.run(MasterThread.java:185)
Caused by: java.lang.IllegalStateException: Failed to create job state
path due to KeeperException
 at org.apache.giraph.bsp.BspService.getJobState(BspService.java:679)
 at 

Re: Errors while running large graph

2014-05-27 Thread Praveen kumar s.k
Do need to put this in the zookeeper configuration file or giraph job
configuration?

On Tue, May 27, 2014 at 12:14 PM, Avery Ching ach...@apache.org wrote:
 You might also want to check the zookeeper memory options.

 Some of our production jobs use parameters such as

 -Xmx5g -XX:ParallelGCThreads=4 -XX:+UseConcMarkSweepGC
 -XX:CMSInitiatingOccupancyFraction=70 -XX:MaxGCPauseMillis=100

 Since the master doesn't use much memory letting zk have more is reasonable.


 On 5/27/14, 9:25 AM, Praveen kumar s.k wrote:

 Hi All,
 I am getting several errors consistently while processing large graph.
 The code works when the size of the graph is in terms of GB's.
 we have implemented compression and removing the dead end nodes in de
 Bruijn graph
 My cluster settings are

 Cores WorkersRAM/Core  GraphsizeAggregateRAM
 252 250  10.5 GB  2.3 TB2.6 TB

 Below are the type of errors I am getting.

 1.  I believe that this error occurred because of zookeeper session
 expired. To address this I changed the parameter minSessionTimeout in
 configuration to large value. However some workers still throw this
 error.

 2014-05-27 00:19:55,187 FATAL org.apache.giraph.graph.GraphMapper:
 uncaughtException: OverrideExceptionHandler on thread
 org.apache.giraph.master.MasterThread, msg = java.lang.Il$
 java.lang.IllegalStateException: java.lang.IllegalStateException:
 Failed to create job state path due to KeeperException
 at org.apache.giraph.master.MasterThread.run(MasterThread.java:185)
 Caused by: java.lang.IllegalStateException: Failed to create job state
 path due to KeeperException
 at org.apache.giraph.bsp.BspService.getJobState(BspService.java:679)
 at
 org.apache.giraph.master.BspServiceMaster.becomeMaster(BspServiceMaster.java:843)
 at org.apache.giraph.master.MasterThread.run(MasterThread.java:98)
 Caused by: org.apache.zookeeper.KeeperException$SessionExpiredException:
 KeeperErrorCode = Session expired for
 /_hadoopBsp/job_201405262302_0003/_masterJobState
 at
 org.apache.zookeeper.KeeperException.create(KeeperException.java:118)
 at
 org.apache.zookeeper.KeeperException.create(KeeperException.java:42)
 at org.apache.zookeeper.ZooKeeper.create(ZooKeeper.java:637)
 at
 org.apache.giraph.zk.ZooKeeperExt.createExt(ZooKeeperExt.java:152)
 at org.apache.giraph.bsp.BspService.getJobState(BspService.java:670)
 ... 2 more

 2. I dont know why this below error is thrown. My guess is that,
 master worker is failing for some reason

 2014-05-27 00:19:55,184 ERROR org.apache.giraph.master.MasterThread:
 masterThread: Master algorithm failed with IllegalStateException
 java.lang.IllegalStateException: Failed to create job state path due
 to KeeperException
 at org.apache.giraph.bsp.BspService.getJobState(BspService.java:679)
 at
 org.apache.giraph.master.BspServiceMaster.becomeMaster(BspServiceMaster.java:843)
 at org.apache.giraph.master.MasterThread.run(MasterThread.java:98)
 Caused by: org.apache.zookeeper.KeeperException$SessionExpiredException:
 KeeperErrorCode = Session expired for
 /_hadoopBsp/job_201405262302_0003/_masterJobState
 at
 org.apache.zookeeper.KeeperException.create(KeeperException.java:118)
 at
 org.apache.zookeeper.KeeperException.create(KeeperException.java:42)
 at org.apache.zookeeper.ZooKeeper.create(ZooKeeper.java:637)
 at
 org.apache.giraph.zk.ZooKeeperExt.createExt(ZooKeeperExt.java:152)
 at org.apache.giraph.bsp.BspService.getJobState(BspService.java:670)
 ... 2 more

 3. Below is one more type of error
 java.lang.IllegalStateException: Failed to create job state path due
 to KeeperException
 at org.apache.giraph.bsp.BspService.getJobState(BspService.java:679)
 at
 org.apache.giraph.master.BspServiceMaster.becomeMaster(BspServiceMaster.java:843)
 at org.apache.giraph.master.MasterThread.run(MasterThread.java:98)
 Caused by: org.apache.zookeeper.KeeperException$SessionExpiredException:
 KeeperErrorCode = Session expired for
 /_hadoopBsp/job_201405261249_0008/_masterJobState
 at
 org.apache.zookeeper.KeeperException.create(KeeperException.java:118)
 at
 org.apache.zookeeper.KeeperException.create(KeeperException.java:42)
 at org.apache.zookeeper.ZooKeeper.create(ZooKeeper.java:637)
 at
 org.apache.giraph.zk.ZooKeeperExt.createExt(ZooKeeperExt.java:152)
 at org.apache.giraph.bsp.BspService.getJobState(BspService.java:670)
 ... 2 more
 2014-05-26 18:19:54,269 FATAL org.apache.giraph.graph.GraphMapper:
 uncaughtException: OverrideExceptionHandler on thread
 org.apache.giraph.master.MasterThread, msg = java.lang.Il$
 java.lang.IllegalStateException: java.lang.IllegalStateException:
 Failed to create job state path due to KeeperException
 at org.apache.giraph.master.MasterThread.run(MasterThread.java:185)
 Caused 

Re: Errors while running large graph

2014-05-27 Thread Avery Ching

*giraph.zkJavaOpts*

On 5/27/14, 10:27 AM, Praveen kumar s.k wrote:

Do need to put this in the zookeeper configuration file or giraph job
configuration?

On Tue, May 27, 2014 at 12:14 PM, Avery Chingach...@apache.org  wrote:

You might also want to check the zookeeper memory options.

Some of our production jobs use parameters such as

-Xmx5g -XX:ParallelGCThreads=4 -XX:+UseConcMarkSweepGC
-XX:CMSInitiatingOccupancyFraction=70 -XX:MaxGCPauseMillis=100

Since the master doesn't use much memory letting zk have more is reasonable.


On 5/27/14, 9:25 AM, Praveen kumar s.k wrote:

Hi All,
I am getting several errors consistently while processing large graph.
The code works when the size of the graph is in terms of GB's.
we have implemented compression and removing the dead end nodes in de
Bruijn graph
My cluster settings are

Cores WorkersRAM/Core  GraphsizeAggregateRAM
252 250  10.5 GB  2.3 TB2.6 TB

Below are the type of errors I am getting.

1.  I believe that this error occurred because of zookeeper session
expired. To address this I changed the parameter minSessionTimeout in
configuration to large value. However some workers still throw this
error.

2014-05-27 00:19:55,187 FATAL org.apache.giraph.graph.GraphMapper:
uncaughtException: OverrideExceptionHandler on thread
org.apache.giraph.master.MasterThread, msg = java.lang.Il$
java.lang.IllegalStateException: java.lang.IllegalStateException:
Failed to create job state path due to KeeperException
 at org.apache.giraph.master.MasterThread.run(MasterThread.java:185)
Caused by: java.lang.IllegalStateException: Failed to create job state
path due to KeeperException
 at org.apache.giraph.bsp.BspService.getJobState(BspService.java:679)
 at
org.apache.giraph.master.BspServiceMaster.becomeMaster(BspServiceMaster.java:843)
 at org.apache.giraph.master.MasterThread.run(MasterThread.java:98)
Caused by: org.apache.zookeeper.KeeperException$SessionExpiredException:
KeeperErrorCode = Session expired for
/_hadoopBsp/job_201405262302_0003/_masterJobState
 at
org.apache.zookeeper.KeeperException.create(KeeperException.java:118)
 at
org.apache.zookeeper.KeeperException.create(KeeperException.java:42)
 at org.apache.zookeeper.ZooKeeper.create(ZooKeeper.java:637)
 at
org.apache.giraph.zk.ZooKeeperExt.createExt(ZooKeeperExt.java:152)
 at org.apache.giraph.bsp.BspService.getJobState(BspService.java:670)
 ... 2 more

2. I dont know why this below error is thrown. My guess is that,
master worker is failing for some reason

2014-05-27 00:19:55,184 ERROR org.apache.giraph.master.MasterThread:
masterThread: Master algorithm failed with IllegalStateException
java.lang.IllegalStateException: Failed to create job state path due
to KeeperException
 at org.apache.giraph.bsp.BspService.getJobState(BspService.java:679)
 at
org.apache.giraph.master.BspServiceMaster.becomeMaster(BspServiceMaster.java:843)
 at org.apache.giraph.master.MasterThread.run(MasterThread.java:98)
Caused by: org.apache.zookeeper.KeeperException$SessionExpiredException:
KeeperErrorCode = Session expired for
/_hadoopBsp/job_201405262302_0003/_masterJobState
 at
org.apache.zookeeper.KeeperException.create(KeeperException.java:118)
 at
org.apache.zookeeper.KeeperException.create(KeeperException.java:42)
 at org.apache.zookeeper.ZooKeeper.create(ZooKeeper.java:637)
 at
org.apache.giraph.zk.ZooKeeperExt.createExt(ZooKeeperExt.java:152)
 at org.apache.giraph.bsp.BspService.getJobState(BspService.java:670)
 ... 2 more

3. Below is one more type of error
java.lang.IllegalStateException: Failed to create job state path due
to KeeperException
 at org.apache.giraph.bsp.BspService.getJobState(BspService.java:679)
 at
org.apache.giraph.master.BspServiceMaster.becomeMaster(BspServiceMaster.java:843)
 at org.apache.giraph.master.MasterThread.run(MasterThread.java:98)
Caused by: org.apache.zookeeper.KeeperException$SessionExpiredException:
KeeperErrorCode = Session expired for
/_hadoopBsp/job_201405261249_0008/_masterJobState
 at
org.apache.zookeeper.KeeperException.create(KeeperException.java:118)
 at
org.apache.zookeeper.KeeperException.create(KeeperException.java:42)
 at org.apache.zookeeper.ZooKeeper.create(ZooKeeper.java:637)
 at
org.apache.giraph.zk.ZooKeeperExt.createExt(ZooKeeperExt.java:152)
 at org.apache.giraph.bsp.BspService.getJobState(BspService.java:670)
 ... 2 more
2014-05-26 18:19:54,269 FATAL org.apache.giraph.graph.GraphMapper:
uncaughtException: OverrideExceptionHandler on thread
org.apache.giraph.master.MasterThread, msg = java.lang.Il$
java.lang.IllegalStateException: java.lang.IllegalStateException:
Failed to create job state path due to KeeperException
 at