Hello, I am trying to run a biagram count on a 12-node cluster setup. For an input file of 135 splits (around 7.5 GB), the job fails for some of the runs.
The error that I get on the jobtracker that out of 135 mappers, 1 of the mapper fails because of "Too many fetch-failures Too many fetch-failures Too many fetch-failures Too many fetch-failures " As a result of this mapper failure, whole job fails -- the reducers which are making progress also stalls. Could anyone please help in regard of solving this error. thanks in advance.
