Hi folks, I have a proposal to implement Spark stage resubmission to handle shuffle fetch failure in Celeborn
https://docs.google.com/document/d/1dkG6fww3g99VAb1wkphNlUES_MpngVPNg8601chmVp8 please have a look and let me know what you think Regards, Erik