[ 
https://issues.apache.org/jira/browse/SPARK-38965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Wan Kun updated SPARK-38965:
----------------------------
    Description: 
We should retry transfer blocks if *errorHandler.shouldRetryError(e)* return 
true, 

Even though that exception may not a IOException, for example:
{code:java}
org.apache.spark.network.server.BlockPushNonFatalFailure: Block 
shufflePush_0_0_3316_5647 experienced merge collision on the server side
{code}

  was:
For those exceptions which errorHandler.shouldRetryError(e) return true, we 
should retry transfer blocks.

Even though that exception may not a IOException, for example:
{code:java}
org.apache.spark.network.server.BlockPushNonFatalFailure: Block 
shufflePush_0_0_3316_5647 experienced merge collision on the server side
{code}


> Retry transfer blocks for exceptions listed in the error handler 
> -----------------------------------------------------------------
>
>                 Key: SPARK-38965
>                 URL: https://issues.apache.org/jira/browse/SPARK-38965
>             Project: Spark
>          Issue Type: Bug
>          Components: Shuffle
>    Affects Versions: 3.3.0
>            Reporter: Wan Kun
>            Priority: Minor
>
> We should retry transfer blocks if *errorHandler.shouldRetryError(e)* return 
> true, 
> Even though that exception may not a IOException, for example:
> {code:java}
> org.apache.spark.network.server.BlockPushNonFatalFailure: Block 
> shufflePush_0_0_3316_5647 experienced merge collision on the server side
> {code}



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to