Resources or advice on minimising memory usage in Giraph/Hadoop code ?

2012-06-05 Thread Benjamin Heitmann
Hello, can somebody recommend a web page, article or book on minimising the memory usage of Giraph/Hadoop code ? I am looking for non-obvious advice on what *not* to do, and for best practices on what to do inside of Hadoop... E.g. is it preferable to use Java Strings or Hadoop Text Writables

Re: Multiple jobs on same graph, aggregator use and LocalRunner issue

2012-06-05 Thread Benjamin Heitmann
Hi Clive, On 5 Jun 2012, at 22:21, Clive Cox wrote: > > I recently started playing with Giraph and I have a few questions. > > 1. I'm writing a simple spreading activation algorithm I am also working on a spreading activation algorithm. My original data is in the form of an RDF graph, which

Multiple jobs on same graph, aggregator use and LocalRunner issue

2012-06-05 Thread Clive Cox
Hi, I recently started playing with Giraph and I have a few questions. 1. I'm writing a simple spreading activation algorithm which would be run many times over the same graph with different initial vertices activated. Doing this as separate jobs in which a potentially large graph is loaded each