re than one dfs block, you lose the data locality
scheduling benefits." (https://issues.apache.org/jira/browse/HADOOP-2560)
On Tue, Jul 26, 2011 at 12:53 AM, Anfernee Xu wrote:
> I have a generic question about how the number of mapper tasks is
> calculated, as far as I know, the number
I have a generic question about how the number of mapper tasks is
calculated, as far as I know, the number is primarily based on the number of
splits, say if I have 5 splits and I have 10 tasktracker running in the
cluster, I will have 5 mapper tasks running in my MR job, right?
But what I found