[jira] [Created] (SPARK-40069) Extend the new heartbeat mechanism to Kubernetes

2022-08-12 Thread Kai-Hsun Chen (Jira)
Kai-Hsun Chen created SPARK-40069: - Summary: Extend the new heartbeat mechanism to Kubernetes Key: SPARK-40069 URL: https://issues.apache.org/jira/browse/SPARK-40069 Project: Spark Issue Type

[jira] [Created] (SPARK-40068) Extend new heartbeat mechanism to YARN

2022-08-12 Thread Kai-Hsun Chen (Jira)
Kai-Hsun Chen created SPARK-40068: - Summary: Extend new heartbeat mechanism to YARN Key: SPARK-40068 URL: https://issues.apache.org/jira/browse/SPARK-40068 Project: Spark Issue Type: Improvem

[jira] [Created] (SPARK-39984) Check workerLastHeartbeat with master before HeartbeatReceiver expires an executor

2022-08-04 Thread Kai-Hsun Chen (Jira)
Kai-Hsun Chen created SPARK-39984: - Summary: Check workerLastHeartbeat with master before HeartbeatReceiver expires an executor Key: SPARK-39984 URL: https://issues.apache.org/jira/browse/SPARK-39984

[jira] [Created] (SPARK-39970) Introduce ThrottledLogger to prevent log message flooding caused by network issues

2022-08-03 Thread Kai-Hsun Chen (Jira)
Kai-Hsun Chen created SPARK-39970: - Summary: Introduce ThrottledLogger to prevent log message flooding caused by network issues Key: SPARK-39970 URL: https://issues.apache.org/jira/browse/SPARK-39970

[jira] [Created] (SPARK-39957) Delay onDisconnected to enable Driver receives ExecutorExitCode

2022-08-02 Thread Kai-Hsun Chen (Jira)
Kai-Hsun Chen created SPARK-39957: - Summary: Delay onDisconnected to enable Driver receives ExecutorExitCode Key: SPARK-39957 URL: https://issues.apache.org/jira/browse/SPARK-39957 Project: Spark

[jira] [Created] (SPARK-39956) Determine task failures based on ExecutorExitCode

2022-08-02 Thread Kai-Hsun Chen (Jira)
Kai-Hsun Chen created SPARK-39956: - Summary: Determine task failures based on ExecutorExitCode Key: SPARK-39956 URL: https://issues.apache.org/jira/browse/SPARK-39956 Project: Spark Issue Typ

[jira] [Created] (SPARK-39955) Improve LaunchTask process to avoid Stage failures caused by fail-to-send LaunchTask messages

2022-08-02 Thread Kai-Hsun Chen (Jira)
Kai-Hsun Chen created SPARK-39955: - Summary: Improve LaunchTask process to avoid Stage failures caused by fail-to-send LaunchTask messages Key: SPARK-39955 URL: https://issues.apache.org/jira/browse/SPARK-39955