Raj,

There is no overlap. Data read from HDFS FileSystem instances go to
HDFS_BYTES_READ, and data read from Local FileSystem instances go to
FILE_BYTES_READ. These are two different FileSystems, and have no
overlap at all.

On Thu, Jul 28, 2011 at 5:56 AM, R V <cattiv...@yahoo.com> wrote:
> Hello
>
> I don't know if the question has been answered. I  am trying to understand 
> the overlap between FILE_BYTES_READ and HDFS_BYTES_READ. What are the various 
> components that provide value to this counter? For example when I see 
> FILE_BYTES_READ for a specific task ( Map or Reduce ) , is it purely due to 
> the spill during sort phase? If a HDFS read happens on a non local node, does 
> the counter increase on the node where the data block resides? What happens 
> when the data is local? does the counter increase for both HDFS_BYTES_READ 
> and FILE_BYTES_READ? From the values I am seeing, this looks to be the case 
> but I am not sure.
>
> I am not very fluent in Java , and hence I don't fully understand the source 
> . :-(
>
> Raj



-- 
Harsh J

Reply via email to