[ 
https://issues.apache.org/jira/browse/SPARK-5647?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14313020#comment-14313020
 ] 

Patrick Wendell commented on SPARK-5647:
----------------------------------------

Isn't it just possible to get the file path in the case of file output format, 
and then read the size of that file? The main challenge I see is how quickly 
that size becomes visible to the HDFS client. In general I think it's worth 
doing because a lot of people still use older versions of the Spark HDFS 
client, for instance people based on AWS who primarily read from S3 and don't 
keep up to date with the newest Hadoop API's.

> Output metrics do not show up for older hadoop versions (< 2.5)
> ---------------------------------------------------------------
>
>                 Key: SPARK-5647
>                 URL: https://issues.apache.org/jira/browse/SPARK-5647
>             Project: Spark
>          Issue Type: New Feature
>          Components: Spark Core
>            Reporter: Kostas Sakellis
>
> Need to add output metrics for hadoop < 2.5. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to