Zhu Zhu created FLINK-20626: ------------------------------- Summary: Canceling a job when it is failing will result in job hanging in CANCELING state Key: FLINK-20626 URL: https://issues.apache.org/jira/browse/FLINK-20626 Project: Flink Issue Type: Bug Components: Runtime / Coordination Affects Versions: 1.11.2, 1.12.0 Reporter: Zhu Zhu Assignee: Zhu Zhu Fix For: 1.13.0, 1.11.4, 1.12.1
If user manually cancels a job when the job is failing(here failing means the job encounters unrecoverable failure and is about to fail), the job will hang in CANCELING state and cannot terminate. The cause is that DefaultScheduler currently will always try to transition from `FAILING` to `FAILED` to terminate the job. However, job canceling will change job status to `CANCELING` so that the transition to `FAILED` will not success. -- This message was sent by Atlassian Jira (v8.3.4#803005)