Tengfei Huang created SPARK-54556:
-------------------------------------
Summary: Ensure succeeding stages will be re-submitted and
fully-retry when shuffle checksum detected
Key: SPARK-54556
URL: https://issues.apache.org/jira/browse/SPARK-54556
Project: Spark
Issue Type: Bug
Components: Spark Core
Affects Versions: 4.0.1
Reporter: Tengfei Huang
When there's a shuffle checksum mismatch detected, we need to ensure all the
succeeding stages would be re-submitted and fully retry:
# All the stages should be submitted;
# Results from elder attempt which may consume inconsistent data should be
ignored;
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]