[ 
https://issues.apache.org/jira/browse/SPARK-3577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15563038#comment-15563038
 ] 

Gaoxiang Liu edited comment on SPARK-3577 at 10/11/16 4:49 AM:
---------------------------------------------------------------

[~rxin] I find that the spill size metrics is already added  in 
https://github.com/apache/spark/commit/bb8098f203e61111faddf2e1a04b03d62037e6c7#diff-1bd3dc38f6306e0a822f93d62c32b1d0,
 and I have confirm in the UI. (please refer to the attachment of this JIRA - 
https://issues.apache.org/jira/secure/attachment/12832515/spill_size.jpg)

Also, we noticed that it's wield that the spill size is somehow not reported in 
the reducer , but reported in the mapper.

Back to the previous question, for the spill time, if it's still relevant to 
add, then I plan to work on it if there is no objections. 


was (Author: dreamworks007):
I find that the spill size metrics is already added  in 
https://github.com/apache/spark/commit/bb8098f203e61111faddf2e1a04b03d62037e6c7#diff-1bd3dc38f6306e0a822f93d62c32b1d0,
 and I have confirm in the UI. (please refer to the attachment of this JIRA - 
https://issues.apache.org/jira/secure/attachment/12832515/spill_size.jpg)

Also, we noticed that it's wield that the spill size is somehow not reported in 
the reducer , but reported in the mapper.

Back to the previous question, for the spill time, if it's still relevant to 
add, then I plan to work on it if there is no objections. 

> Add task metric to report spill time
> ------------------------------------
>
>                 Key: SPARK-3577
>                 URL: https://issues.apache.org/jira/browse/SPARK-3577
>             Project: Spark
>          Issue Type: Bug
>          Components: Shuffle, Spark Core
>    Affects Versions: 1.1.0
>            Reporter: Kay Ousterhout
>            Priority: Minor
>         Attachments: spill_size.jpg
>
>
> The {{ExternalSorter}} passes its own {{ShuffleWriteMetrics}} into 
> {{ExternalSorter}}.  The write time recorded in those metrics is never used.  
> We should probably add task metrics to report this spill time, since for 
> shuffles, this would have previously been reported as part of shuffle write 
> time (with the original hash-based sorter).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to