Hi,
I am using nutch with 10 node cluster.
I want to configure nutch-site.xml
What is difference between mapred.map.tasks and
mapred.tasktracker.map.tasks.maximum
Or
mapred.reduce.tasks and mapred.tasktracker.reduce.tasks.maximum
Thanks
-Pravin
From: Pravin Karne
Sent: Thursday, July 02, 2009
property
namemapred.map.tasks/name
value2/value
descriptionThe default number of map tasks per job. Typically set
to a prime several times greater than number of available hosts.
Ignored when mapred.job.tracker is local.
/description
/property
We have a question on this
fetching, the total number of hosts you're fetching can also be a
factor, since fetch tasks are hostwise-disjoint. If you're only
fetching a few hosts, then a large value for mapred.map.tasks will cause
there to be a few big fetch tasks and a bunch of empty ones. This could
be a problem if the big
[EMAIL PROTECTED] wrote:
Why we need parameter mapred.map.tasks greater than number of available
host? If we set it equal to number of host, we got negative progress
percentages problem.
Can you please post a simple example that demonstrates the negative
progress problem? E.g., the minimal
, November 22, 2005 2:10 AM
To: nutch-dev@lucene.apache.org
Subject: Re: mapred.map.tasks
[EMAIL PROTECTED] wrote:
Why we need parameter mapred.map.tasks greater than number of available
host? If we set it equal to number of host, we got negative progress
percentages problem.
Can you please post