[ 
https://issues.apache.org/jira/browse/HADOOP-6859?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12888032#action_12888032
 ] 

Suresh Srinivas commented on HADOOP-6859:
-----------------------------------------

I am planning to introduce the following additional statistics, that is 
accumulated at the client per file system as it is done now.
# read operations - number of read operations such as listStatus, 
getFileBlockLocations, open etc.
# write operations - number of write operations such as create, append, 
setPermission etc.
# large read operations - on file system, most of the operations are small 
except listFiles for a large directory. Iterative listFiles was introduced in 
HDFS to break down a single large operation into smaller steps. This counter is 
incremented is incremented for every iteration of listFiles, when listing files 
under a large directory.

These statistics are collected in job history for analysis of how HDFS is 
loaded by map reduce tasks. This is useful in the interim to identify jobs that 
heavily load HDFS. In future this could also be used to throttle the load at 
the map reduce framework.

> Introduce additional statistics to FileSystem
> ---------------------------------------------
>
>                 Key: HADOOP-6859
>                 URL: https://issues.apache.org/jira/browse/HADOOP-6859
>             Project: Hadoop Common
>          Issue Type: Improvement
>    Affects Versions: 0.22.0
>            Reporter: Suresh Srinivas
>            Assignee: Suresh Srinivas
>             Fix For: 0.22.0
>
>
> Currently FileSystem#statistics tracks bytesRead and bytesWritten. Additional 
> statistics that gives summary of operations performed will be useful for 
> tracking file system use.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to