xuanyuanking edited a comment on issue #24110: [SPARK-25341][Core] Support 
rolling back a shuffle map stage and re-generate the shuffle files
URL: https://github.com/apache/spark/pull/24110#issuecomment-474424797
 
 
   ```
   To help other people review this patch, can you add the following 
information in your PR description?
   ```
   Thanks Wenchen, the PR description updated.
   ```
   A side question: can we skip the temp file when writing shuffle files?
   ```
   ~~Let me check this, it's related to 
https://issues.apache.org/jira/browse/SPARK-8029.~~
   As current implement, the answer is no, because we only keep attempt id for 
the indeterminate stage.
   For further checking, I think we can achieve the goal of skipping the temp 
file if we need. We need to do two more things below:
   - All the block id should be extended by adding attempt id, not only the 
indeterminate ones.
   - We should keep the mapping of (stage, partition) -> attemptId in 
MapOutputTracker, because, for the determinate stage, we will meet different 
stage attemptId in the single stage while fetch fail happened. Current 
implement just keeps the mapping of shuffleId to attemptId for each stage and 
only use them in the indeterminate stage.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to