One thing to keep in mind is that overcommitting the number of cores can lead 
to map timeouts unless the map task submits progress updates to jobtracker.  I 
found out the hard way that with a few computationally expensive maps.

Nick Jones

-----Original Message-----
From: Vitaliy Semochkin [] 
Sent: Thursday, July 08, 2010 5:15 AM
Subject: Re: How to control the number of map tasks for each nodes?


in mapred-site.xml you should place

   <description>the number of available cores on the tasktracker machines
for map tasks
   <description>the number of available cores on the tasktracker machines
for reduce tasks

where 8 is number of your CORES not CPUS, if you have 8 dual core processors
place 16 there.
I found out that having number of map tasks a bit bigger than number of
cores is better cause sometimes hadoop waits for IO operations and task do

Vitaliy S

On Thu, Jul 8, 2010 at 1:07 PM, edward choi <> wrote:

> Hi,
> I have a cluster consisting of 11 slaves and a single master.
> The thing is that 3 of my slaves have i7 cpu which means that they can have
> up to 8 simultaneous processes.
> But other slaves only have dual core cpus.
> So I was wondering if I can specify the number of map tasks for each of my
> slaves.
> For example, I want to give 8 map tasks to the slaves that have i7 cpus and
> only two map tasks to the others.
> Is there a way to do this?

Reply via email to