Giraph job doesn't work on Oozie workflow

2014-10-23 Thread 이성일
Hi, dears Now, I'm trying to run giraph job on Apache Oozie workflow. But, it's not finish and running eternally. Is there any one like me? I'm attaching my oozie files, hadoop log message and s/w version Thanks, Seong-il - Hadoop : 2.2.0 - Giraph : release-1.1.0-RC0 (compiled as mvn

Excessive Memory Usage Compared to Graph Size

2014-10-23 Thread Matthew Saltz
Hi everyone, I'm working on a community detection algorithm for giraph and I'm trying to execute the algorithm on the Friendster graph, which has about 65M nodes and about 1.8 billion edges. Running on 16 machines, before doing ANY processing, it's taking about 50G of RAM. That's 800G total for

Giraph job can not finish last superstep

2014-10-23 Thread RainShine79
Hello all, i have a giraph job which seems to executed successfully: in the logs and on the hadoop webinterface i can see that all supersteps are executed successfully. The only problem i got is that the output seems to not get written to hdfs.  As far as i know from personal research

Resource Allocation Model Of Apache Giraph

2014-10-23 Thread Charith Wickramarachchi
Hi Folks, I'm wondering what is the resource allocation model for Apache Giraph. As I understand each worker is one to one Mapped with a Mapper and a worker can process multiple partitions with a user defined number of threads. Is it possible to make sure that one worker, only process a single

Re: Resource Allocation Model Of Apache Giraph

2014-10-23 Thread Claudio Martella
the way mappers (or containers) and hence workers are assigned to machines is not under the control of giraph, but of the underlying hadoop environment (with different responsibilities that depend on the hadoop version, e.g. YARN). You'll have to tweak your hadoop configuration to control the

Re: Resource Allocation Model Of Apache Giraph

2014-10-23 Thread Charith Wickramarachchi
Thanks Claudio and Avery, I find a way way to configure hadoop to have desired number of mappers per machine as Claudio mentioned. Avery, Could you please tell me how I can configure giraph to make each worker handle only a single partition? Thanks, Charith On Thu, Oct 23, 2014 at 2:26 PM,