Tengfei Huang created SPARK-54663:
-------------------------------------

             Summary: Computes RowBasedChecksum in ShuffleWriters
                 Key: SPARK-54663
                 URL: https://issues.apache.org/jira/browse/SPARK-54663
             Project: Spark
          Issue Type: Sub-task
          Components: Spark Core
    Affects Versions: 4.0.0
            Reporter: Tengfei Huang


To help detect the shuffle output changes due to indeterminate stage retry, we 
need to introduce an order independent checksum for the shuffle output.

Once the checksum changes, we'll know that this is from a indeterminate stage 
retry, will need to recompute all the succeeding stages for data correctness.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to