Re: Input format problems running Giraph 1.1.0 on Twitter dataset

2015-05-04 Thread Steven Harenberg
(even if >> the input dataset has no "double" vertex values). An example (for Giraph >> 1.0.0) can be found here: >> https://github.com/xvz/graph-processing/blob/master/giraph-1.0.0/giraph-examples/src/main/java/org/apache/giraph/examples/SimplePageRankInputFormat.

Re: Optimal configuration for Giraph on YARN

2015-04-29 Thread Steven Harenberg
defaults to 1) or > yarn.nodemanager.resource.cpu-vcores (defaults to 8), while we set > giraph.numComputeThreads (say to 16). I expect every worker will run 16 > threads on 1 core, but wanted to see if you have the same understanding. > > Thanks, > Arjun. > > On Wed, Apr 29, 2015 at 8:50 AM,

Re: Optimal configuration for Giraph on YARN

2015-04-29 Thread Steven Harenberg
re any > other parameters that must be set in order to make sure we are *really* > using the cores, not just multi-threading on a single core? > > > On Wed, Mar 18, 2015 at 11:48 AM, Steven Harenberg > wrote: > >> Hi all, >> >> Previously with MapReduceV1, t

Re: Input format problems running Giraph 1.1.0 on Twitter dataset

2015-04-29 Thread Steven Harenberg
Hey Kenrick, First, your commands above are wrong since you are specifying adjacency list format with the -vif argument and since I believe *LongLongNullTextInputFormat *refers to adjacency list format. However, even with the right commands there will be issues and more things you need to do. I d

Re: Giraph 1.1.0 not running on full cluster with Hadoop 2.6.0

2015-03-26 Thread Steven Harenberg
Thanks, > Kenrick > > On Tue, Mar 24, 2015 at 3:06 PM, Steven Harenberg > wrote: > >> Hey Kenrick, >> >> For the issue with GiraphApplicationMaster, I followed what Phillip did >> here: >> http://mail-archives.apache.org/mod_mbox/giraph-user/201503

Re: Giraph 1.1.0 not running on full cluster with Hadoop 2.6.0

2015-03-24 Thread Steven Harenberg
Hey Kenrick, For the issue with GiraphApplicationMaster, I followed what Phillip did here: http://mail-archives.apache.org/mod_mbox/giraph-user/201503.mbox/%3CCAO3ErG_obGV8mELzX1j%2Be%3DaL6C%3D6%3DtdiSOVRBia2gh0H9tYLZA%40mail.gmail.com%3E Basically you need the jar for giraph-examples to be in th

Re: Giraph 1.1.0 not running on full cluster with Hadoop 2.6.0

2015-03-23 Thread Steven Harenberg
Try adding this option: -ca giraph.SplitMasterWorker=false --Steve On Sun, Mar 22, 2015 at 4:28 PM, Kenrick Fernandes wrote: > Hi, > > I am working with Giraph 1.1.0 and a YARN cluster with Hadoop 2.6.0. > I build Giraph with > mvn -Phadoop_2 -Dhadoop.version=2.6.0 clean package -DskipTest

Optimal configuration for Giraph on YARN

2015-03-18 Thread Steven Harenberg
Hi all, Previously with MapReduceV1, the suggestion was to have a 1:1 correspondence between workers and compute nodes (machines) and set the number of the threads to be the number of cores per machines. To achieve this configuration, we would set "mapred.tasktracker.map.tasks.maximum=1". Since wo

Re: [SOLVED] Re: Giraph job never ends

2015-03-15 Thread Steven Harenberg
is-running-beyond-virtual-memory-limits Everything seems to be working for me now. On Fri, Mar 13, 2015 at 10:24 PM, Steven Harenberg wrote: > Thanks Phil, I appreciate the help. Your posts over the past couple days > have already been quite helpful. > > There were a few things I was goin

Re: [SOLVED] Re: Giraph job never ends

2015-03-13 Thread Steven Harenberg
the area (I see you have an @ncsu.edu address), and you can come by, I > might be able to help you then. Part of my presentation will be > walking through how to setup a Giraph / YARN cluster, based on my > experiences over the past few days... > > > Phil > > This message op

Re: [SOLVED] Re: Giraph job never ends

2015-03-13 Thread Steven Harenberg
Hey Phil, I have been having the exact same problems as you (I am also setting up Giraph on EC2), but this solution did not work for me. Do you recall what error you saw in resourcemanager logs? I am also looking at these logs, but nothing is standing out to me. In fact, it almost seems like the

Re: How to format Giraph input dataset

2015-03-13 Thread Steven Harenberg
Hi Ralph, I also wanted to use edge-list input format as well since I am running examples from SNAP. I ran into a lot of issues and at this point if I could go back in time I would probably just make a script to convert the graphs into giraphs standard format. To deal with the type of errors you

Vertex without outgoing edges is not active in superstep 0

2015-02-20 Thread Steven Harenberg
Hi all, I am wondering if there is a way to make all vertices active in superstep 0. By default, it seems that Giraph only makes vertices active in superstep 0 iff the vertex has an outgoing edge. This creates an issue with SSSP which sets the initial values to Double.MAX_VALUE at superstep 0. Th

Re: Performance metrics / calling external scripts

2015-01-16 Thread Steven Harenberg
etrics after each superstep to each worker's system.out log. > > Ursprüngliche Nachricht ---- > Von: Steven Harenberg > Datum:14.01.2015 19:55 (GMT+01:00) > An: user > Betreff: Performance metrics / calling external scripts > > Hi all, > > I am attempting

Performance metrics / calling external scripts

2015-01-14 Thread Steven Harenberg
Hi all, I am attempting to measure some performance metrics (such as runtime, memory usage, network communication, etc.) using an external bash script that grabs some machine stats. I am having difficulty figuring out where to externally call this script in Giraph. Particularly, I would like to c