Re: How the number of mapper tasks is calculated

2011-07-25 Thread Chiku Singh
re than one dfs block, you lose the data locality scheduling benefits." (https://issues.apache.org/jira/browse/HADOOP-2560) On Tue, Jul 26, 2011 at 12:53 AM, Anfernee Xu wrote: > I have a generic question about how the number of mapper tasks is > calculated, as far as I know, the number

How the number of mapper tasks is calculated

2011-07-25 Thread Anfernee Xu
I have a generic question about how the number of mapper tasks is calculated, as far as I know, the number is primarily based on the number of splits, say if I have 5 splits and I have 10 tasktracker running in the cluster, I will have 5 mapper tasks running in my MR job, right? But what I found