Re: Cluster config: Mapper:Reducer Task Capapcity

2013-10-01 Thread Himanshu Vijay
What is the down side of increasing both mapred.tasktracker.map.tasks.maximum and mapred.tasktracker.reduce.tasks.maximum to same value ? I read on this linkhttp://developer.yahoo.com/hadoop/tutorial/module7.htmlthat: mapred.tasktracker.map.tasks.maximum 1/2 * (cores/node) to 2 *

Cluster config: Mapper:Reducer Task Capapcity

2013-09-30 Thread Himanshu Vijay
Hi, Our Hadoop cluster is running 0.20.203. The cluster currently has 'Map Task Capacity' of 8900+ 'Reduce Task Capacity' of 3300+ resulting in a ratio of 2.7. We have a lot of variety of jobs running and we want to increase the throughput. My manual observation was that we hit the Mapper

Re: Cluster config: Mapper:Reducer Task Capapcity

2013-09-30 Thread Sandy Ryza
Hi Himanshu, Changing the ratio is definitely a reasonable thing to do. The capacities come from the mapred.tasktracker.map.tasks.maximum and mapred.tasktracker.reduce.tasks.maximum tasktracker configurations. You can tweak these on your nodes to get your desired ratio. -Sandy On Mon, Sep