prakharjain09 opened a new pull request #27864: [SPARK-20732][CORE] 
Decommission cache blocks to other executors when an executor is decommissioned
URL: https://github.com/apache/spark/pull/27864
 
 
   ### What changes were proposed in this pull request?
   After changes in SPARK-20628, CoarseGrainedSchedulerBackend can decommission 
an executor and stop assigning new tasks on it. We should also decommission the 
corresponding blockmanagers in the same way. i.e. Move the cached RDD blocks 
from those executors to other active executors.
   
   ### Why are the changes needed?
   We need to gracefully decommission the block managers so that the underlying 
RDD cache blocks are not lost in case the executors are taken away forcefully 
after some timeout (because of spotloss/pre-emptible VM etc). Its good to save 
as much cache data as possible. 
   
   Also In future once the decommissioning signal comes from Cluster Manager 
(say YARN/Mesos etc), dynamic allocation + this change gives us opportunity to 
downscale the executors faster by making the executors free of cache data.
   
   ### Does this PR introduce any user-facing change?
   NO
   
   ### How was this patch tested?
   Added UTs.
   
   
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to