Re: Problem occured when running job with 1 worker.

2014-01-20 Thread Sertuğ Kaya

Hi Jyoti;
I assume this is the log of master vertex. It seems like master can not 
reach a worker for some reason. Did you also check the worker vertex's 
log? Maybe you can share it too.

Sertug

On 20-01-2014 09:22, Jyoti Yadav wrote:
*h.master.MasterThread: masterThread: Master algorithm failed with 
ArrayIndexOutOfBoundsException

java.lang.ArrayIndexOutOfBoundsException: -1*




Re: Null Pointer Exception when running a job

2013-11-09 Thread Sertuğ Kaya

Hello Artie;
I also had such problems when working on large datasets. It probably 
needs a better hadoop/mapreduce configuration. For example did you try 
increasing the memory of task tracker child processes in mapred-site.xml 
such as:

property
namemapred.child.java.opts/name
value-Xmx3000m/value
  /property

Regards;
Sertug

On 08-11-2013 01:26, Artie Pesh-Imam wrote:

Hey

It does work against a smaller data set and Im running 1.0.0 for hadoop 2.


--
Artie Pesh-Imam
Software Engineer @ Tapad
201-253-9647

From: Mohammad Islam Mohammad Islam mailto:misla...@yahoo.com
Reply: user@giraph.apache.org user@giraph.apache.org 
mailto:user@giraph.apache.org, Mohammad Islam misla...@yahoo.com 
mailto:misla...@yahoo.com

Date: November 7, 2013 at 6:20:53 PM
To: user@giraph.apache.org user@giraph.apache.org 
mailto:user@giraph.apache.org

Subject: Re: Null Pointer Exception when running a job

Hi Artie,
Hopefully someone would give you a concrete answer.

My rambling comments:
Did you try with a smaller number of vertices? I want to know if the 
problem is due to increased load.

What version of Giraph are you using?

Regards,
Mohammad


On Thursday, November 7, 2013 2:10 PM, Artie Pesh-Imam 
artie.pesh-i...@tapad.com wrote:
Please let me know if any additional context would be useful. With no 
detail in the error message, I have no idea what the root issue could be.


--
Artie Pesh-Imam
Software Engineer @ Tapad
201-253-9647

From: Artie Pesh-Imam Artie Pesh-Imam mailto:artie.pesh-i...@tapad.com
Reply: Artie Pesh-Imam artie.pesh-i...@tapad.com 
mailto:artie.pesh-i...@tapad.com

Date: November 7, 2013 at 4:25:30 PM
To: user@giraph.apache.org user@giraph.apache.org 
mailto:user@giraph.apache.org

Subject: Null Pointer Exception when running a job
Hi all,

Im running a simple connected components algorithm against a graph of 
almost 200 million vertices.


I have pretty much no guess on what this issue could be:
java.lang.IllegalStateException: run: Caught an unrecoverable exception null
 at org.apache.giraph.graph.GraphMapper.run(GraphMapper.java:102)
 at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:672)
 at org.apache.hadoop.mapred.MapTask.run(MapTask.java:330)
 at org.apache.hadoop.mapred.Child$4.run(Child.java:268)
 at java.security.AccessController.doPrivileged(Native Method)
 at javax.security.auth.Subject.doAs(Subject.java:415)
 at 
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1408)
 at org.apache.hadoop.mapred.Child.main(Child.java:262)
Caused by: java.lang.NullPointerException
 at 
org.apache.giraph.comm.SendCache.removeWorkerData(SendCache.java:157)
 at org.apache.giraph.comm.SendCache.removeAllData(SendCache.java:171)
 at 
org.apache.giraph.comm.SendMessageCache.removeAllMessages(SendMessageCache.java:92)
 at 
org.apache.giraph.comm.netty.NettyWorkerClientRequestProcessor.flush(NettyWorkerClientRequestProcessor.java:410)
 at 
org.apache.giraph.worker.BspServiceWorker.sendWorkerPartitions(BspServiceWorker.java:1316)
 at 
org.apache.giraph.worker.BspServiceWorker.exchangeVertexPartitions(BspServiceWorker.java:1361)
 at 
org.apache.giraph.graph.GraphTaskManager.execute(GraphTaskManager.java:255)
 at org.apache.giraph.graph.GraphMapper.run(GraphMapper.java:92)


--
Artie Pesh-Imam
Software Engineer @ Tapad
201-253-9647






Some of the workers hanging while others finish

2013-11-08 Thread Sertuğ Kaya

Hi all;

I have a 10 machine cluster; 8 Tasktrackers, 1 namenode and 1 
Jobtracker. All of them have 8GB memory.
When I run a giraph job with 16 workers on a hbase table with 15 
millions of rows (12GB of data) my job does not finish due to 2 
(sometimes 3) workers stuck at superstep -1. Below you can see the logs 
of these workers. Any helps/ideas?

Thanks
Sertug


*_syslog logs_*
2013-11-08 19:00:06,576 INFO org.apache.hadoop.util.NativeCodeLoader: Loaded 
the native-hadoop library
2013-11-08 19:00:07,449 INFO org.apache.zookeeper.ZooKeeper: Client 
environment:zookeeper.version=3.3.3-1073969, built on 02/23/2011 22:27 GMT
2013-11-08 19:00:07,449 INFO org.apache.zookeeper.ZooKeeper: Client 
environment:host.name=host-10-6-149-104
2013-11-08 19:00:07,449 INFO org.apache.zookeeper.ZooKeeper: Client 
environment:java.version=1.7.0_21
2013-11-08 19:00:07,449 INFO org.apache.zookeeper.ZooKeeper: Client 
environment:java.vendor=Oracle Corporation
2013-11-08 19:00:07,449 INFO org.apache.zookeeper.ZooKeeper: Client 
environment:java.home=/usr/java/jdk1.7.0_21/jre
2013-11-08 19:00:07,449 INFO org.apache.zookeeper.ZooKeeper: Client 
environment:java.class.path=/hadoop/mapred/taskTracker/root/jobcache/job_201311081157_0003/jars/classes:/hadoop/mapred/taskTracker/root/jobcache/job_201311081157_0003/jars:/hadoop/mapred/taskTracker/root/jobcache/job_201311081157_0003/attempt_201311081157_0003_m_09_0/work:/usr/local/hadoop-1.2.1/libexec/../conf:/usr/java/default/lib/tools.jar:/usr/local/hadoop-1.2.1/libexec/..:/usr/local/hadoop-1.2.1/libexec/../hadoop-core-1.2.1.jar:/usr/local/hadoop-1.2.1/libexec/../lib/asm-3.2.jar:/usr/local/hadoop-1.2.1/libexec/../lib/aspectjrt-1.6.11.jar:/usr/local/hadoop-1.2.1/libexec/../lib/aspectjtools-1.6.11.jar:/usr/local/hadoop-1.2.1/libexec/../lib/commons-beanutils-1.7.0.jar:/usr/local/hadoop-1.2.1/libexec/../lib/commons-beanutils-core-1.8.0.jar:/usr/local/hadoop-1.2.1/libexec/../lib/commons-cli-1.2.jar:/usr/local/hadoop-1.2.1/libexec/../lib/commons-codec-1.4.jar:/usr/local/hadoop-1.2.1/libexec/../lib/commons-collections-3.2.1.jar:/usr/local/hadoop-1.2.1/libexec/../lib/commons-configuration-1.6.jar:/usr/local/hadoop-1.2.1/libexec/../lib/commons-daemon-1.0.1.jar:/usr/local/hadoop-1.2.1/libexec/../lib/commons-digester-1.8.jar:/usr/local/hadoop-1.2.1/libexec/../lib/commons-el-1.0.jar:/usr/local/hadoop-1.2.1/libexec/../lib/commons-httpclient-3.0.1.jar:/usr/local/hadoop-1.2.1/libexec/../lib/commons-io-2.1.jar:/usr/local/hadoop-1.2.1/libexec/../lib/commons-lang-2.4.jar:/usr/local/hadoop-1.2.1/libexec/../lib/commons-logging-1.1.1.jar:/usr/local/hadoop-1.2.1/libexec/../lib/commons-logging-api-1.0.4.jar:/usr/local/hadoop-1.2.1/libexec/../lib/commons-math-2.1.jar:/usr/local/hadoop-1.2.1/libexec/../lib/commons-net-3.1.jar:/usr/local/hadoop-1.2.1/libexec/../lib/core-3.1.1.jar:/usr/local/hadoop-1.2.1/libexec/../lib/giraph-nutch-1.1.0-SNAPSHOT-for-hadoop-0.20.203.0-jar-with-dependencies.jar:/usr/local/hadoop-1.2.1/libexec/../lib/hadoop-capacity-scheduler-1.2.1.jar:/usr/local/hadoop-1.2.1/libexec/../lib/hadoop-fairscheduler-1.2.1.jar:/usr/local/hadoop-1.2.1/libexec/../lib/hadoop-thriftfs-1.2.1.jar:/usr/local/hadoop-1.2.1/libexec/../lib/hsqldb-1.8.0.10.jar:/usr/local/hadoop-1.2.1/libexec/../lib/jackson-core-asl-1.8.8.jar:/usr/local/hadoop-1.2.1/libexec/../lib/jackson-mapper-asl-1.8.8.jar:/usr/local/hadoop-1.2.1/libexec/../lib/jasper-compiler-5.5.12.jar:/usr/local/hadoop-1.2.1/libexec/../lib/jasper-runtime-5.5.12.jar:/usr/local/hadoop-1.2.1/libexec/../lib/jdeb-0.8.jar:/usr/local/hadoop-1.2.1/libexec/../lib/jersey-core-1.8.jar:/usr/local/hadoop-1.2.1/libexec/../lib/jersey-json-1.8.jar:/usr/local/hadoop-1.2.1/libexec/../lib/jersey-server-1.8.jar:/usr/local/hadoop-1.2.1/libexec/../lib/jets3t-0.6.1.jar:/usr/local/hadoop-1.2.1/libexec/../lib/jetty-6.1.26.jar:/usr/local/hadoop-1.2.1/libexec/../lib/jetty-util-6.1.26.jar:/usr/local/hadoop-1.2.1/libexec/../lib/jsch-0.1.42.jar:/usr/local/hadoop-1.2.1/libexec/../lib/junit-4.5.jar:/usr/local/hadoop-1.2.1/libexec/../lib/kfs-0.2.2.jar:/usr/local/hadoop-1.2.1/libexec/../lib/log4j-1.2.15.jar:/usr/local/hadoop-1.2.1/libexec/../lib/mockito-all-1.8.5.jar:/usr/local/hadoop-1.2.1/libexec/../lib/oro-2.0.8.jar:/usr/local/hadoop-1.2.1/libexec/../lib/servlet-api-2.5-20081211.jar:/usr/local/hadoop-1.2.1/libexec/../lib/slf4j-api-1.4.3.jar:/usr/local/hadoop-1.2.1/libexec/../lib/slf4j-log4j12-1.4.3.jar:/usr/local/hadoop-1.2.1/libexec/../lib/xmlenc-0.52.jar:/usr/local/hadoop-1.2.1/libexec/../lib/jsp-2.1/jsp-2.1.jar:/usr/local/hadoop-1.2.1/libexec/../lib/jsp-2.1/jsp-api-2.1.jar
2013-11-08 19:00:07,449 INFO org.apache.zookeeper.ZooKeeper: Client 
environment:java.library.path=/usr/local/hadoop-1.2.1/libexec/../lib/native/Linux-amd64-64:/hadoop/mapred/taskTracker/root/jobcache/job_201311081157_0003/attempt_201311081157_0003_m_09_0/work
2013-11-08 19:00:07,449 INFO org.apache.zookeeper.ZooKeeper: Client