[GitHub] [spark] prakharjain09 opened a new pull request #28619: [WIP][SPARK-21040][CORE] Speculate tasks which are running on decommission executors

GitBox Fri, 22 May 2020 21:49:46 -0700


prakharjain09 opened a new pull request #28619:
URL: https://github.com/apache/spark/pull/28619



   ### What changes were proposed in this pull request?
   This PR adds functionality to consider the running tasks on decommission 
executors based on some config.
   In spark-on-cloud , we sometimes already know that an executor won't be 
alive for more than fix amount of time. Ex- In AWS Spot nodes, once we get the 
notification, we know that a node will be gone in 120 seconds.
   So if the running tasks on the decommissioning executors may run beyond 
currentTime+120 seconds, then they are candidate for speculation.
   
   ### Why are the changes needed?
   Currently when an executor is decommission, we stop scheduling new tasks on 
those executors but the already running tasks keeps on running on them. Based 
on the cloud, we might know beforehand that an executor won't be alive for more 
than a preconfigured time. Different cloud providers gives different timeouts 
before they take away the nodes. For Ex- In case of AWS spot nodes, an executor 
won't be alive for more than 120 seconds. We can utilize this information in 
cloud environments and take better decisions about speculating the already 
running tasks on decommission executors.
   
   ### Does this PR introduce _any_ user-facing change?
   Yes. This PR adds a new config "spark.executor.decommission.killInterval" 
which they can explicitly set based on the cloud environment where they are 
running.
   
   
   ### How was this patch tested?
   Added UT.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] prakharjain09 opened a new pull request #28619: [WIP][SPARK-21040][CORE] Speculate tasks which are running on decommission executors

Reply via email to