Re: know a location of an input file in a network

2011-01-13 Thread Harsh J
All splits sent for processing in a Job carry a list of locations where their blocks reside -- this plus the network hierarchy details held by the JT is used to determine the locality level. Have a look at JobInProgress.getLocalityLevel(), which takes a given TaskInProgress object, and a TaskTrack

know a location of an input file in a network

2011-01-13 Thread Pedro Costa
Hi, I've hadoop installed in a cluster and I would like that JT could guess in the network topology what are the input files in HDFS that are closer to him, and further. So, how can a JT know if an input file is located on local-level, on rack-level, or on the other level? Thanks, -- Pedro