attilapiros opened a new pull request #31450:
URL: https://github.com/apache/spark/pull/31450


   
   ### What changes were proposed in this pull request?
   
   This PR adds the following metrics to track executor remove reasons during 
dynamic allocation:
   -  `numberExecutorsGracefullyDecommissioned`: number of executors which 
reached the finished decommissioning state and shut itself down cleanly
   - `numberExecutorsDecommissionUnfinished`: executors which requested to 
decommission but they stopped without reaching the finished decommissioning 
state 
   - `numberExecutorsKilledByDriver`: executors killed by the driver (requested 
to stop)  
   -  `numberExecutorsExitedUnexpectedly`: executors exited without driver 
request
   
   ### Why are the changes needed?
   
   For supporting monitoring of dynamic allocation better with these metrics.  
   
   ### Does this PR introduce _any_ user-facing change?
   
   Yes. The new metrics will be available for monitoring.
   
   ### How was this patch tested?
   
   With unit and integration tests.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to