Hello All, I am confused over how MapReduce tasks select data blocks for processing user requests ?
As data block replication replicates single data block over multiple datanodes, during job processing how uniquely data blocks are selected for processing user requests ? How does it guarantees that no same block gets chosen twice or thrice for different mapper task. Thank you -Mehal
