Xingbo Jiang created SPARK-37151:
------------------------------------
Summary: Avoid executor state sync attempt fail continuously in a
short timeframe
Key: SPARK-37151
URL: https://issues.apache.org/jira/browse/SPARK-37151
Project: Spark
Issue Type: Improvement
Components: Spark Core
Affects Versions: 3.2.0
Reporter: Xingbo Jiang
Assignee: Xingbo Jiang
An executor would retry sending the ExecutorStateChanged message when the
previous attempt failed. This would not be an issue when the attempt failed
with TimeoutException. But if the connection between the executor and the
Master is broken, the attempt would fail immediately, leading to the retry
attempt also fail, and quickly reaches the max attempt limitation.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]