[
https://issues.apache.org/jira/browse/FLINK-37131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
altenchen updated FLINK-37131:
------------------------------
Description:
The situation is that the job does not fail as expected when its checkpoint has
failed 6 times, as I configured the parameter
'execution.checkpointing.tolerable-failed-checkpoints' to 5 in advance.
!image-2025-01-15-13-38-39-428.png|width=548,height=325!!image-2025-01-15-13-39-16-990.png|width=479,height=311!
The error log is as below:
!image-2025-01-15-13-45-50-068.png|width=812,height=201!
The log time is after the sixth failed checkpoint which is so confused.
was:
The situation is that the job does not fail when its checkpoint has failed 6
times, as I configured the parameter
'execution.checkpointing.tolerable-failed-checkpoints' to 5 in advance.
!image-2025-01-15-13-38-39-428.png|width=548,height=325!!image-2025-01-15-13-39-16-990.png|width=479,height=311!
> checkpoint tolerable-failed invalid
> -----------------------------------
>
> Key: FLINK-37131
> URL: https://issues.apache.org/jira/browse/FLINK-37131
> Project: Flink
> Issue Type: Bug
> Components: Runtime / Checkpointing
> Affects Versions: 1.18.1
> Environment: flink1.18.1 + jdk11
> Reporter: altenchen
> Priority: Major
> Attachments: image-2025-01-15-13-38-39-428.png,
> image-2025-01-15-13-39-16-990.png, image-2025-01-15-13-45-22-798.png,
> image-2025-01-15-13-45-50-068.png
>
>
> The situation is that the job does not fail as expected when its checkpoint
> has failed 6 times, as I configured the parameter
> 'execution.checkpointing.tolerable-failed-checkpoints' to 5 in advance.
> !image-2025-01-15-13-38-39-428.png|width=548,height=325!!image-2025-01-15-13-39-16-990.png|width=479,height=311!
> The error log is as below:
> !image-2025-01-15-13-45-50-068.png|width=812,height=201!
> The log time is after the sixth failed checkpoint which is so confused.
>
>
>
--
This message was sent by Atlassian Jira
(v8.20.10#820010)