Hi all,

Previously with MapReduceV1, the suggestion was to have a 1:1
correspondence between workers and compute nodes (machines) and set the
number of the threads to be the number of cores per machines. To achieve
this configuration, we would set "mapred.tasktracker.map.tasks.maximum=1".
Since workers correspond to mappers this would ensure there was one worker
per machine.

Now I am reading that with Yarn this property longer exists as there aren't
tasktrackers. Instead, we have the global properties
"yarn.nodemanager.resource.cpu-vcores", which specifies the cores _per
node_, and the property "mapreduce.map.cpu.vcores", which specifies the
cores _per map task_.

If we want to have one mapper per node that is fully utilizing the machine,
I assume we should just set mapreduce.map.cpu.vcores =
yarn.nodemanager.resource.cpu-vcores = the # of cores per node. Is this
correct?

Do I still need to set giraph.numComputeThreads to be the number of cores
per node?

Thanks,
Steve

Reply via email to