jiangxb1987 opened a new pull request, #40690:
URL: https://github.com/apache/spark/pull/40690

   ### What changes were proposed in this pull request?
   
   The PR changes the implementation of MapOutputTracker.updateMapOutput() to 
search for the MapStatus under the help of a mapping from mapId to mapIndex, 
previously it was performing a linear search, which would become performance 
bottleneck if a large proportion of all blocks in the map are migrated.
   
   ### Why are the changes needed?
   
   To avoid performance bottleneck when block decommission is enabled and a lot 
of blocks are migrated within a short time window.
   
   ### Does this PR introduce _any_ user-facing change?
   
   No, it's pure performance improvement.
   
   
   ### How was this patch tested?
   
   Manually test.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to