Ming Li created FLINK-31474:
-------------------------------
Summary: [Flink] Add failure information for out-of-order
checkpoints
Key: FLINK-31474
URL: https://issues.apache.org/jira/browse/FLINK-31474
Project: Flink
Issue Type: Improvement
Components: Runtime / Checkpointing
Reporter: Ming Li
At present, when the checkpoint is out of order, only out-of-order logs will be
printed on the {{Task}} side, while on the {{JM}} side, the checkpoint can only
fail through timeout, and the real reason cannot be confirmed.
Therefore, I think we should add failure information on the JM side for the
out-of-order checkpoint.
{code:java}
if (lastCheckpointId >= metadata.getCheckpointId()) {
LOG.info(
"Out of order checkpoint barrier (aborted previously?): {} >= {}",
lastCheckpointId,
metadata.getCheckpointId());
channelStateWriter.abort(metadata.getCheckpointId(), new
CancellationException(), true);
checkAndClearAbortedStatus(metadata.getCheckpointId());
return;
} {code}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)