All splits sent for processing in a Job carry a list of locations
where their blocks reside -- this plus the network hierarchy details
held by the JT is used to determine the locality level.
Have a look at JobInProgress.getLocalityLevel(), which takes a given
TaskInProgress object, and a TaskTrack
Hi,
I've hadoop installed in a cluster and I would like that JT could
guess in the network topology what are the input files in HDFS that
are closer to him, and further.
So, how can a JT know if an input file is located on local-level, on
rack-level, or on the other level?
Thanks,
--
Pedro