Pipes of computation

2013-07-16 Thread Ahmet Emre Aladağ
Hi, Is there a way to pipe the Computation classes? Example: Run PageRankComputation and then run NormalizationComputation on the same graph without activating Output/Input formats between the two. This way we can re-use some computations like log-scale CDF normalization and don't mess up th

Re: Problem occurs when I increase the number of workers

2013-07-16 Thread Ahmet Emre Aladağ
It should be related to Hadoop. In what mode you're running Hadoop? You may need distributed mode to achieve multiple workers. I don't remember the reason but I had the same problem and have been testing my codes with 1 worker. But it should run on a cluster with multiple workers. On 07/09/2

Re: Problem occurs when I increase the number of workers

2013-07-16 Thread Wonbae Kim
How can I check what you asked ? I'm too newbie sorry... 2013/7/9 Jan van der Lugt > Could be something with memory. Are the workers running on the same > machine? How much memory has been allocated to each mapper slot? How much > physical memory is in the machine(s)? > > - Jan > > > On Tue, Ju

How can I check when the job starts and ends.

2013-07-16 Thread Wonbae Kim
Hello, I'm working on multiple nodes system. I want to check whether jobs are distributed evenly when I submit multiple jobs. In my opinion, one possible way to do it is to print when the job starts and when it ends. So, how can I print these stuffs? or Is there any better way to achieve my goal

Re: Multiple Data Sources

2013-07-16 Thread Tom M
Thanks a lot for your answer. I will try it and get back in case I have another issue. On Mon, Jul 15, 2013 at 11:02 PM, Maja Kabiljo wrote: > Hi Tom, > > We recently added something like this, please take a look > at MultiVertexInputFormat. That one can basically wrap any number of vertex >

RE: SequenceFileVertexInputFormatClass

2013-07-16 Thread david
Actually I'm guessing that I'm just not setting the input path correctly - does this work: GiraphConfiguration conf = job.getConfiguration(); ... GiraphFileInputFormat.addVertexInputPath(conf, new Path(args[0])); conf.setVertexInputFormatClass(SequenceFileVertexInputFormat.class); (in 1.

SequenceFileVertexInputFormatClass

2013-07-16 Thread david
Does anyone have a simple example of how to use it? Do I need to extend it for my classes or can I just use it like I would SequenceFileInputFormat in hadoop. I'm getting the following error; java.lang.IllegalStateException: generateVertexInputSplits: Got IOException at org.apache.gi

Re: GiraphApplicationMaster not found (and other newbie questions)

2013-07-16 Thread Nicholas Karkoulias
Thank you very much for your response. I had built Giraph with the following command: mvn -e -Phadoop_yarn -Dhadoop.version=2.0.5-alpha -DskipTests clean install The patch for issue 688 had been already applied manually, since it wasn't part of the Git repository yet. A few days later I git-fe

Re: Failed to compile Giraph for Hadoop YARN

2013-07-16 Thread Chui-Hui Chiu
Hello, The "clean" argument solved my current problem. However, I see more when I go on. On "http://giraph.apache.org/build.html";, (1) There're commands for secure and non-secure Hadoop. What are the commands for YARN? (2) What do the compile, package, and test do respectively? Do I have to r

Serialization Error

2013-07-16 Thread Lukas Nalezenec
Hi, I have got problem i cannot solve. When one node loads given data and send vertexes belonging to another node, the anorther mode throw this exception. It looks like it expects number of edges but gets random bytes. Did somebody solved something similar ? Thanks Lukas 2013-07-16 14:34:54,329