[ 
https://issues.apache.org/jira/browse/SPARK-44209?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17744500#comment-17744500
 ] 

Deependra Patel commented on SPARK-44209:
-----------------------------------------

Created PR - [https://github.com/apache/spark/pull/42071]

> Expose amount of shuffle data available on the node
> ---------------------------------------------------
>
>                 Key: SPARK-44209
>                 URL: https://issues.apache.org/jira/browse/SPARK-44209
>             Project: Spark
>          Issue Type: New Feature
>          Components: Shuffle
>    Affects Versions: 3.4.1
>            Reporter: Deependra Patel
>            Priority: Trivial
>
> [ShuffleMetrics|https://github.com/apache/spark/blob/43f7a86a05ad8c7ec7060607e43d9ca4d0fe4166/common/network-shuffle/src/main/java/org/apache/spark/network/shuffle/ExternalBlockHandler.java#L318]
>  doesn't have metrics like 
> "totalShuffleDataBytes" and "numAppsWithShuffleData", these metrics are per 
> node published by External Shuffle Service.
>  
> Adding these metrics would help in - 
> 1. Deciding if we can decommission the node if no shuffle data present
> 2. Better live monitoring of customer's workload to see if there is skewed 
> shuffle data present on the node



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to