[ https://issues.apache.org/jira/browse/FLINK-4808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16210019#comment-16210019 ]
Jing Fan commented on FLINK-4808: --------------------------------- Any update about this PR and ticket? I think this is an important feature. In industry, state checkpoint is used frequently and unable to skip failed checkpoints will block bringing flink into production environment. > Allow skipping failed checkpoints > --------------------------------- > > Key: FLINK-4808 > URL: https://issues.apache.org/jira/browse/FLINK-4808 > Project: Flink > Issue Type: New Feature > Components: State Backends, Checkpointing > Affects Versions: 1.1.2, 1.1.3 > Reporter: Stephan Ewen > Fix For: 1.4.0 > > > Currently, if Flink cannot complete a checkpoint, it results in a failure and > recovery. > To make the impact of less stable storage infrastructure on the performance > of Flink less severe, Flink should be able to tolerate a certain number of > failed checkpoints and simply keep executing. > This should be controllable via a parameter, for example: > {code} > env.getCheckpointConfig().setAllowedFailedCheckpoints(3); > {code} > A value of {{-1}} could indicate an infinite number of checkpoint failures > tolerated by Flink. > The default value should still be {{0}}, to keep compatibility with the > existing behavior. -- This message was sent by Atlassian JIRA (v6.4.14#64029)