zhouyejoe opened a new pull request #33078:
URL: https://github.com/apache/spark/pull/33078


   ### What changes were proposed in this pull request?
   This is one of the patches for SPIP SPARK-30602 which is needed for 
push-based shuffle.
   
   ### Summary of the change:
   When Executor registers with Shuffle Service, it will encode the merged 
shuffle dir created and also the application attemptId into the 
ShuffleManagerMeta into Json. Then in Shuffle Service, it will decode the Json 
string and get the correct merged shuffle dir and also the attemptId. If the 
registration comes from a newer attempt, the merged shuffle information will be 
updated to store the information from the newer attempt.
   
   This PR also refactored the management of the merged shuffle information to 
avoid concurrency issues.
   ### Why are the changes needed?
   Refer to the SPIP in SPARK-30602.
   
   ### Does this PR introduce _any_ user-facing change?
   No.
   
   ### How was this patch tested?
   Added unit tests.
   The reference PR with the consolidated changes covering the complete 
implementation is also provided in SPARK-30602.
   We have already verified the functionality and the improved performance as 
documented in the SPIP doc.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to