holdenk opened a new pull request #26440: [WIP][SPARK-20628][CORE] Start to 
improve Spark decommissioning & preemption support
URL: https://github.com/apache/spark/pull/26440
 
 
   This PR is based on an existing/previou PR - 
https://github.com/apache/spark/pull/19045
   
   ### What changes were proposed in this pull request?
   
   This changes adds a decommissioning state that we can enter when the cloud 
provider/scheduler lets us know we aren't going to be removed immediately but 
instead will be removed soon. This concept fits nicely in K8s and also with 
spot-instances on AWS / preemptible instances all of which we can get a notice 
that our host is going away. For now we simply stop scheduling jobs, in the 
future we could perform some kind of migration of data during scale-down, or at 
least stop accepting new blocks to cache.
   
   There is a design document at 
https://docs.google.com/document/d/1xVO1b6KAwdUhjEJBolVPl9C6sLj7oOveErwDSYdT-pE/edit?usp=sharing
   
   ### Why are the changes needed?
   
   With more move to preemptible multi-tenancy, serverless environments, and 
spot-instances better handling of node scale down is required.
   
   ### Does this PR introduce any user-facing change?
   
   There is no API change, however an additional configuration flag is added to 
enable/disable this behaviour.
   
   ### How was this patch tested?
   
   New integration tests in the Spark K8s integration testing. Extension of the 
AppClientSuite to test decommissioning seperate from the K8s.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to