[ 
https://issues.apache.org/jira/browse/FLINK-37131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

altenchen updated FLINK-37131:
------------------------------
    Description: 
The situation is that the job does not fail as expected when its checkpoint has 
failed 6 times, as I configured the parameter 
'execution.checkpointing.tolerable-failed-checkpoints' to 5 in advance.

!image-2025-01-15-13-38-39-428.png|width=548,height=325!!image-2025-01-15-13-39-16-990.png|width=479,height=311!

The error log is as below:

!image-2025-01-15-13-45-50-068.png|width=812,height=201!

The log time is after the sixth failed checkpoint which is so confused. 

 

 

 

  was:
The situation is that the job does not fail when its checkpoint has failed 6 
times, as I configured the parameter 
'execution.checkpointing.tolerable-failed-checkpoints' to 5 in advance.

!image-2025-01-15-13-38-39-428.png|width=548,height=325!!image-2025-01-15-13-39-16-990.png|width=479,height=311!


> checkpoint tolerable-failed invalid
> -----------------------------------
>
>                 Key: FLINK-37131
>                 URL: https://issues.apache.org/jira/browse/FLINK-37131
>             Project: Flink
>          Issue Type: Bug
>          Components: Runtime / Checkpointing
>    Affects Versions: 1.18.1
>         Environment: flink1.18.1 + jdk11
>            Reporter: altenchen
>            Priority: Major
>         Attachments: image-2025-01-15-13-38-39-428.png, 
> image-2025-01-15-13-39-16-990.png, image-2025-01-15-13-45-22-798.png, 
> image-2025-01-15-13-45-50-068.png
>
>
> The situation is that the job does not fail as expected when its checkpoint 
> has failed 6 times, as I configured the parameter 
> 'execution.checkpointing.tolerable-failed-checkpoints' to 5 in advance.
> !image-2025-01-15-13-38-39-428.png|width=548,height=325!!image-2025-01-15-13-39-16-990.png|width=479,height=311!
> The error log is as below:
> !image-2025-01-15-13-45-50-068.png|width=812,height=201!
> The log time is after the sixth failed checkpoint which is so confused. 
>  
>  
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to