how to build a stable version of giraph

2014-05-16 Thread yeshwanth kumar
hi i am using giraph on

Hortonworks Data platform Hadoop 2.2

while building on branch Trunk i am unable to run simple shortestpath
example,

my issue is similar to exactly same as this one,

http://mail-archives.apache.org/mod_mbox/giraph-user/201401.mbox/browser

i am thinking there might be some borken things on trunk,
so i looked at other branches and tags where they are 1 year old.
i want to try reverting to a stable giraph.
can someone specify a commit id to revert to a stable version of giraph.

thanks,
yeshwanth


Re: Superstep duration increases

2014-05-16 Thread Claudio Martella
I'd start by taking HBase out of the equation.


On Thu, May 8, 2014 at 1:46 PM, Pascal Jäger  wrote:

> Hi all,
>
> I have implemented a label propagation algorithm to find clusters in a
> graph.
> I just realized that the time the algorithm takes for one superstep is
> increasing and I don’t know why.
>
> The graph is static and the number of messages is the same throughout all
> supersteps.
> During every superstep each node sends its label to its neighbors which
> then calculate their label based on the received messages and then again
> send their label.
> At the end of each superstep each node writes a nodeID - label pair to an
> HBase table.
>
> Do you have any general hints where I can look at?
>
> I absolutely have no clue where to start
>
> Thanks for your help!
>
> Regards
>
> Pascal
>
>


-- 
   Claudio Martella


Re: issues while running giraph on Hadoop 2.2

2014-05-16 Thread Kiru Pakkirisamy
I have the same issue running giraph built with -Phadoop_yarn 
-Dhadoop.version=2.2.0 on Hortonworks.
I get classnotfoundexceptions, looks like the GiraphRunner may not be able to 
load the classes correctly while running on Yarn.
So if anybody has had success please share your experience, wondering what I 
have been doing wrong.

Exception in thread "main" java.lang.ClassNotFoundException: 
org.apache.giraph.examples.SimpleShortestPathsComputation

 
Regards,
- kiru



 From: yeshwanth kumar 
To: "user@giraph.apache.org"  
Sent: Thursday, May 8, 2014 4:47 AM
Subject: issues while running giraph on Hadoop 2.2
 


hi i am running Hortonworks hadoop 2.2
trying to run giraph shortest path example.
i am getting this exception



$ hadoop jar 
giraph-examples/target/giraph-examples-1.1.0-SNAPSHOT-for-hadoop-2.2.0-jar-with-dependencies.jar
 org.apache.giraph.GiraphRunner 
org.apache.giraph.examples.SimpleShortestPathsComputation -vif 
org.apache.giraph.io.formats.JsonLongDoubleFloatDoubleVertexInputFormat -vip 
/user/hdfs/input/tiny.txt -vof 
org.apache.giraph.io.formats.IdWithValueTextOutputFormat -op 
/user/hdfs/output/shortestpaths -w 1
14/05/07 23:45:42 INFO utils.ConfigurationUtils: No edge input format 
specified. Ensure your InputFormat does not require one.
14/05/07 23:45:42 INFO utils.ConfigurationUtils: No edge output format 
specified. Ensure your OutputFormat does not require one.
14/05/07 23:45:42 INFO yarn.GiraphYarnClient: Final output path is: 
hdfs://server02.infnet:8020/user/hdfs/output/shortestpaths
14/05/07 23:45:42 INFO yarn.GiraphYarnClient: Running Client
14/05/07 23:45:42 INFO client.RMProxy: Connecting to ResourceManager at 
server02.infnet/192.168.1.85:8050
14/05/07 23:45:42 INFO yarn.GiraphYarnClient: Defaulting per-task heap size to 
1024MB.
Exception in thread "main" java.lang.IllegalStateException: Giraph's estimated 
cluster heap 2048MB ask is greater than the current available cluster heap of 
0MB. Aborting Job.
at 
org.apache.giraph.yarn.GiraphYarnClient.checkPerNodeResourcesAvailable(GiraphYarnClient.java:230)
at org.apache.giraph.yarn.GiraphYarnClient.run(GiraphYarnClient.java:124)
at org.apache.giraph.GiraphRunner.run(GiraphRunner.java:96)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:84)
at org.apache.giraph.GiraphRunner.main(GiraphRunner.java:126)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:606)
at org.apache.hadoop.util.RunJar.main(RunJar.java:212)



i am missing some configuration, can someone point me in the right direction.

thanks,
Yeshwanth

Re: Error while executing large graph

2014-05-16 Thread Arun Kumar
Hi
Thanks for the replay .

I am running this example in a cluster of 5 machines each machine is having
16 GB of ram.The java  heap size is set as 2000mb and java.child.options is
set with 2000mb and each machine has 4 cores and total number of map
instance is set as 3.
So for each slave machine 10 gb will be used.

My input data is of 1gb size
In this scenario how can out of memory error occur  .Please clarify

Regards
Arun




On Thu, May 15, 2014 at 12:43 AM, Avery Ching  wrote:

>  I think this is the key message.
>
>
> 0 out of 196 partitions computed; min free memory on worker 6 - 0.81MB,
> average 11.56MB
>
> Having less than 1 MB free won't work.  Your workers are likely OOM,
> killing the job.  Can you get more memory for your job?
>
>
> On 5/14/14, 3:13 AM, Arun Kumar wrote:
>
>  Hi when i run giraph job against a data of 1 gb i am getting the below
> exception after some times can somebody tell me what is the issue?
> 14/05/14 01:54:01 INFO job.JobProgressTracker: Data from 14 workers -
> Compute superstep 2: 0 out of 4847571 vertices computed; 0 out of 196
> partitions computed; min free memory on worker 6 - 0.81MB, average 11.56MB
> 14/05/14 01:54:03 INFO zookeeper.ClientCnxn: Unable to read additional
> data from server sessionid 0x145f9cff031000f, likely server has closed
> socket, closing socket connection and attempting reconnect
> 14/05/14 01:54:04 INFO zookeeper.ClientCnxn: Opening socket connection to
> server mercado-12.hpl.hp.com/15.25.119.147:22181. Will not attempt to
> authenticate using SASL (unknown error)
> 14/05/14 01:54:04 WARN zookeeper.ClientCnxn: Session 0x145f9cff031000f for
> server null, unexpected error, closing socket connection and attempting
> reconnect
> java.net.ConnectException: Connection refused
> at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
> at
> sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739)
> at
> org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:350)
> at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1068)
> 14/05/14 01:54:06 INFO zookeeper.ClientCnxn: Opening socket connection to
> server mercado-12.hpl.hp.com/15.25.119.147:22181. Will not attempt to
> authenticate using SASL (unknown error)
> 14/05/14 01:54:06 WARN zookeeper.ClientCnxn: Session 0x145f9cff031000f for
> server null, unexpected error, closing socket connection and attempting
> reconnect
> java.net.ConnectException: Connection refused
> at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
> at
> sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739)
> at
> org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:350)
> at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1068)
> 14/05/14 01:54:06 WARN zk.ZooKeeperExt: exists: Connection loss on attempt
> 0, waiting 5000 msecs before retrying.
> org.apache.zookeeper.KeeperException$ConnectionLossException:
> KeeperErrorCode = ConnectionLoss for
> /_hadoopBsp/job_201405140108_0003/_workerProgresses
> at org.apache.zookeeper.KeeperException.create(KeeperException.java:99)
> at org.apache.zookeeper.KeeperException.create(KeeperException.java:51)
> at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:1041)
> at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:1069)
> at org.apache.giraph.zk.ZooKeeperExt.exists(ZooKeeperExt.java:360)
> at
> org.apache.giraph.job.JobProgressTracker$2.run(JobProgressTracker.java:87)
> at java.lang.Thread.run(Thread.java:745)
> 14/05/14 01:54:08 INFO zookeeper.ClientCnxn: Opening socket connection to
> server mercado-12.hpl.hp.com/15.25.119.147:22181. Will not attempt to
> authenticate using SASL (unknown error)
> 14/05/14 01:54:08 WARN zookeeper.ClientCnxn: Session 0x145f9cff031000f for
> server null, unexpected error, closing socket connection and attempting
> reconnect
> java.net.ConnectException: Connection refused
> at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
> at
> sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739)
> at
> org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:350)
> at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1068)
> 14/05/14 01:54:09 INFO mapred.JobClient:  map 93% reduce 0%
> 14/05/14 01:54:10 INFO zookeeper.ClientCnxn: Opening socket connection to
> server mercado-12.hpl.hp.com/15.25.119.147:22181. Will not attempt to
> authenticate using SASL (unknown error)
> 14/05/14 01:54:10 WARN zookeeper.ClientCnxn: Session 0x145f9cff031000f for
> server null, unexpected error, closing socket connection and attempting
> reconnect
> java.net.ConnectException: Connection refused
> at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
> at
> sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739)
> at
> org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO

Re: resolveMutations error

2014-05-16 Thread Jyotirmoy Sundi
Hi,
   Not throwing the exception in resolveMutations worked for me. Is it
possible to get some understanding of what effects it would have if any
because the results are working as expected.

for (I vertexId : destinations) {
  if (partition.getVertex(vertexId) == null) {
if (!resolveVertexIndices.put(partitionId, vertexId)) {
  //throw new IllegalStateException(
  //"resolveMutations: Already has missing vertex on this "
+
  //"worker for " + vertexId);
}
  }
}


On Wed, May 14, 2014 at 7:23 AM, Jyotirmoy Sundi  wrote:

> Hi Folks,
>
> I am seeing this error recently in jobs, can you please throw some 
> light on what resolveMutations in NettyWorkerServer.java tries to achieve ?
>
> *Trace:*
>
> Caused by: java.lang.IllegalStateException: resolveMutations: Already has 
> missing vertex on this worker for 8345381246748292335
>   at 
> org.apache.giraph.comm.netty.NettyWorkerServer.resolveMutations(NettyWorkerServer.java:184)
>   at 
> org.apache.giraph.comm.netty.NettyWorkerServer.prepareSuperstep(NettyWorkerServer.java:152)
>   at 
> org.apache.giraph.worker.BspServiceWorker.startSuperstep(BspServiceWorker.java:677)
>   at 
> org.apache.giraph.graph.GraphTaskManager.execute(GraphTaskManager.java:249)
>   at org.apache.giraph.graph.GraphMapper.run(GraphMapper.java:92)
>
>
> --
> Best Regards,
> Jyotirmoy Sundi
>
>


-- 
Best Regards,
Jyotirmoy Sundi