Raj, There is no overlap. Data read from HDFS FileSystem instances go to HDFS_BYTES_READ, and data read from Local FileSystem instances go to FILE_BYTES_READ. These are two different FileSystems, and have no overlap at all.
On Thu, Jul 28, 2011 at 5:56 AM, R V <cattiv...@yahoo.com> wrote: > Hello > > I don't know if the question has been answered. I am trying to understand > the overlap between FILE_BYTES_READ and HDFS_BYTES_READ. What are the various > components that provide value to this counter? For example when I see > FILE_BYTES_READ for a specific task ( Map or Reduce ) , is it purely due to > the spill during sort phase? If a HDFS read happens on a non local node, does > the counter increase on the node where the data block resides? What happens > when the data is local? does the counter increase for both HDFS_BYTES_READ > and FILE_BYTES_READ? From the values I am seeing, this looks to be the case > but I am not sure. > > I am not very fluent in Java , and hence I don't fully understand the source > . :-( > > Raj -- Harsh J