What is the down side of increasing both
mapred.tasktracker.map.tasks.maximum
and mapred.tasktracker.reduce.tasks.maximum to same value ?
I read on this
linkhttp://developer.yahoo.com/hadoop/tutorial/module7.htmlthat:
mapred.tasktracker.map.tasks.maximum 1/2 * (cores/node) to 2 *
Hi,
Our Hadoop cluster is running 0.20.203. The cluster currently has 'Map Task
Capacity' of 8900+ 'Reduce Task Capacity' of 3300+ resulting in a ratio of
2.7. We have a lot of variety of jobs running and we want to increase the
throughput.
My manual observation was that we hit the Mapper
Hi Himanshu,
Changing the ratio is definitely a reasonable thing to do. The capacities
come from the mapred.tasktracker.map.tasks.maximum
and mapred.tasktracker.reduce.tasks.maximum tasktracker configurations.
You can tweak these on your nodes to get your desired ratio.
-Sandy
On Mon, Sep