holdenk opened a new pull request #30155:
URL: https://github.com/apache/spark/pull/30155


   
   ### What changes were proposed in this pull request?
   
   Make pod allocation executor timeouts configurable. Keep all known pods in 
mind when allocating executors to avoid over allocating if the pending time is 
much higher than the allocation interval.
   
   ### Why are the changes needed?
   The current executor timeouts do not match that of all real world clusters 
especially under load. While this can be worked around by increasing the 
allocation batch delay, that will decrease the speed at which the total number 
of executors will be able to be requested.
   
   ### Does this PR introduce _any_ user-facing change?
   
   Yes new configuration property
   
   
   ### How was this patch tested?
   
   Updated existing test to use the timeout from the new configuration 
property. Verified test failed without the update.
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to