Re: Fairscheduler won't run more than 3 map tasks (from 48 available)

2011-08-18 Thread Arun C Murthy
0.20.203 fixed the TT to be more aggressive about heartbeats, but not overtly so - that should help a lot. Arun On Aug 18, 2011, at 2:25 PM, Mck wrote: > >> How long are your tasks, and which version of Hadoop are you using? > > Hadoop-0.20.1 (eventually we're looking to upgrade to Brisk). >

Re: Fairscheduler won't run more than 3 map tasks (from 48 available)

2011-08-18 Thread Matei Zaharia
Okay, great! Unfortunately the task launching is still slow even if you have JVM reuse set to -1 because of heartbeats (the slave node only updates its state with the master every ~5 seconds). Matei On Aug 18, 2011, at 5:25 PM, Mck wrote: > >> How long are your tasks, and which version of Ha

Re: Fairscheduler won't run more than 3 map tasks (from 48 available)

2011-08-18 Thread Mck
> How long are your tasks, and which version of Hadoop are you using? Hadoop-0.20.1 (eventually we're looking to upgrade to Brisk). Tasks take 5-30 seconds. > In older versions (0.20.*), the fair > scheduler doesn't launch multiple tasks per heartbeat, so it performs poorly > when your tasks a

Re: Fairscheduler won't run more than 3 map tasks (from 48 available)

2011-08-18 Thread Matei Zaharia
How long are your tasks, and which version of Hadoop are you using? In older versions (0.20.*), the fair scheduler doesn't launch multiple tasks per heartbeat, so it performs poorly when your tasks are small (less than 5-10 seconds). You may be able to improve it a bit by setting mapred.fairsch

Fairscheduler won't run more than 3 map tasks (from 48 available)

2011-08-18 Thread Mick Semb Wever
I'm new to setting up hadoop's scheduler and i'm trying to set up Fairscheduler on a 3-node cluster. The initial setup is fine but throughput is abysmal. Each node is configured with 16 map task capacity and 8 reduce task capacity. Most jobs being run are reading data from cassandra installed on t