Question on stage rerun in Celeborn 0.5.1

Sungwoo Park Sun, 06 Oct 2024 23:20:50 -0700

(I left this message in Celeborn Slack channel, but perhaps this mailinglist is the right place.)


Hello,

Previously we implemented an extension of MR3 (an execution engine) tosupport Celeborn 0.3.1. For a short introduction, please see:

https://mr3docs.datamonad.com/docs/mr3/features/celeborn/

Now we are upgrading Celeborn to 0.5.1 and working on supporting stagererun, much like Spark-Celeborn.

To my (pleasant) surprise, upgrading Celeborn from 0.3.1 to 0.5.1 wasquite smooth. After recompiling with Celeborn 0.5.1, MR3-Celeborn justworked fine. I was surprised because the current code does not obtainCeleborn shuffle IDs at all (because there was no notion of Celebornshuffle IDs back in 0.3.1) and we use only application shuffle IDs whichare generated by MR3 (similarly to Spark shuffle IDs).


I have a few questions.

1. Suppose that a reducer fails to read the output of a certain mapper. Insuch a case, should we re-execute all the mappers in the previous stage?Or, is it okay to re-execute only the mapper whose output is lost?In our previous implementation, MR3-Celeborn does not fully support taskrerun (similar to stage rerun) because Celeborn does not return theidentity of mapper tasks whose output has been lost.

2. When a reducer tries to read the output of mappers, when is it okay touse the application shuffle ID?

3. Along the same line of question 2, should we always get Celebornshuffle IDs when trying to read the output of mappers? Considering thefact the the current code of MR3-Celeborn works fine, it seems like thisis not always necessary.


Thank you.

--- Sungwoo Park

Question on stage rerun in Celeborn 0.5.1

Reply via email to