Victsm opened a new pull request #29855:
URL: https://github.com/apache/spark/pull/29855


   ### What changes were proposed in this pull request?
   
   This is the first patch for SPIP SPARK-30602 for push-based shuffle.
   Summary of changes:
   * Introduce new API in ExternalBlockStoreClient to push blocks to a remote 
shuffle service.
   * Extend SPARK-6237 to allow responding the metadata header back to the 
client.
   * Leveraging the streaming upload functionality in SPARK-6237, it also 
enables the ExternalBlockHandler to delegate the handling of block push 
requests to MergedShuffleFileManager.
   * Propose the API for MergedShuffleFileManager, where the core logic on the 
shuffle service side to handle block push requests is defined. The actual 
implementation of this API is deferred into a later RB to restrict the size of 
this PR.
   * Introduce OneForOneBlockPusher to enable pushing blocks to remote shuffle 
services in shuffle RPC layer.
   * New protocols in shuffle RPC layer to support the functionalities.
   
   ### Why are the changes needed?
   
   Refer to the SPIP in SPARK-30602
   
   ### Does this PR introduce _any_ user-facing change?
   No.
   
   ### How was this patch tested?
   Added unit tests.
   The reference PR with the consolidated changes covering the complete 
implementation is also provided in SPARK-30602.
   We have already verified the functionality and the improved performance as 
documented in the SPIP doc.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to