[ 
https://issues.apache.org/jira/browse/SPARK-9004?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15572414#comment-15572414
 ] 

Steve Loughran commented on SPARK-9004:
---------------------------------------

HADOOP-13605 added a whole new set of counters for HDFS, S3 and hopefully soon 
Azure; there's an API call on the FS {{getStorageStatistics()}} to query these.

One problem though: this isn't shipping in Hadoop branch-2 yet, so you can't 
write code that uses it, not unless there's some introspection/plugin 
mechanism. 

All the stats are just {{name: String -> value: Long}}, so a something to 
collect a {{Map[String, Long]}} would work. 

> Add s3 bytes read/written metrics
> ---------------------------------
>
>                 Key: SPARK-9004
>                 URL: https://issues.apache.org/jira/browse/SPARK-9004
>             Project: Spark
>          Issue Type: Improvement
>          Components: Input/Output
>            Reporter: Abhishek Modi
>            Priority: Minor
>
> s3 read/write metrics can be pretty useful in finding the total aggregate 
> data processed



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to