Yuxin Tan created FLINK-29329: --------------------------------- Summary: Checkpoint can not be triggered if encountering OOM Key: FLINK-29329 URL: https://issues.apache.org/jira/browse/FLINK-29329 Project: Flink Issue Type: Bug Reporter: Yuxin Tan Fix For: 1.13.7 Attachments: job-exceptions.txt
When writing a checkpoint, an OOM error is thrown. But the JM is not failed and is restored because I found a log "No master state to restore". Then JM never makes checkpoints anymore. Currently, the root cause is not that clear, maybe this is a bug and we should deal with the OOM or other exceptions when making checkpoints. [^job-exceptions.txt] -- This message was sent by Atlassian Jira (v8.20.10#820010)