what is diff between mapred.map.tasks and mapred.tasktracker.map.tasks.maximum

2009-07-02 Thread Pravin Karne
Hi, I am using nutch with 10 node cluster. I want to configure nutch-site.xml What is difference between mapred.map.tasks and mapred.tasktracker.map.tasks.maximum Or mapred.reduce.tasks and mapred.tasktracker.reduce.tasks.maximum Thanks -Pravin From: Pravin Karne Sent: Thursday, July 02, 2009

mapred.map.tasks

2006-04-20 Thread Anton Potehin
property namemapred.map.tasks/name value2/value descriptionThe default number of map tasks per job. Typically set to a prime several times greater than number of available hosts. Ignored when mapred.job.tracker is local. /description /property We have a question on this

Re: mapred.map.tasks

2006-04-20 Thread Doug Cutting
fetching, the total number of hosts you're fetching can also be a factor, since fetch tasks are hostwise-disjoint. If you're only fetching a few hosts, then a large value for mapred.map.tasks will cause there to be a few big fetch tasks and a bunch of empty ones. This could be a problem if the big

Re: mapred.map.tasks

2005-11-21 Thread Doug Cutting
[EMAIL PROTECTED] wrote: Why we need parameter mapred.map.tasks greater than number of available host? If we set it equal to number of host, we got negative progress percentages problem. Can you please post a simple example that demonstrates the negative progress problem? E.g., the minimal

RE: mapred.map.tasks

2005-11-21 Thread anton
, November 22, 2005 2:10 AM To: nutch-dev@lucene.apache.org Subject: Re: mapred.map.tasks [EMAIL PROTECTED] wrote: Why we need parameter mapred.map.tasks greater than number of available host? If we set it equal to number of host, we got negative progress percentages problem. Can you please post