Optimal configuration for benchmark

Christian Krause Thu, 27 Jun 2013 09:22:17 -0700

Hi,

I implemented a benchmark that allows me to generate an arbitrarilylarge graph (depending on the number of iterations). Now I would like toconfigure Giraph so that I can make the best use of my hardware for thisbenchmark. Based on the number of nodes in my cluster, their amount ofmain memory and number of cores, I am asking myself how do I determinethe optimal parameters of Giraph / Hadoop, specifically:


- the number of used mappers
- the HEAP_SIZE environment variable
- the memory specified in the mapred.map.child.java.opts property

(any other relevant parameters?)

Also, I was wondering how well Giraph can handle computations whichstart with a very small graph and mutate it to a very large one. Forexample, if I understand correctly the number of mappers is notdynamically adjusted.


Any hints (or links to documentation) are highly appreciated.

Cheers,
Christian

Optimal configuration for benchmark

Reply via email to