> Is the configured amount of tasks for reuse a suggestion or will it actually
> use it?  For example, if I’ve configured it to use a JVM for 4 tasks, will a
> TaskTracker that has 8 tasks to process use 2 JVMs?  Or does it decide if it
> actually wants to reuse one up to the maximum configured number?

It will launch mapred.tasktracker.map.tasks.maximum +
mapred.tasktracker.reduce.tasks.maximum JVMs. If a job has more than
that many tasks for that task tracker, it will then reuse those.

> Will a TaskTracker use the same JVM for tasks from different jobs?  Related
> to this, will a TaskTracker idle a JVM until it has enough tasks to fit the
> maximum JVM reuse variable?

It will not reuse JVMs across jobs. When the job finishes, all task
JVMs will be shutdown.


-Joey

-- 
Joseph Echeverria
Cloudera, Inc.
443.305.9434

Reply via email to