[ https://issues.apache.org/jira/browse/FLINK-4809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16242916#comment-16242916 ]
Jing Fan edited comment on FLINK-4809 at 11/7/17 9:26 PM: ---------------------------------------------------------- Do we have any update on the PR? It has been hanging for weeks. was (Author: pangzhi): Do we have any update on the PR? It has been handing for weeks. > Operators should tolerate checkpoint failures > --------------------------------------------- > > Key: FLINK-4809 > URL: https://issues.apache.org/jira/browse/FLINK-4809 > Project: Flink > Issue Type: Sub-task > Components: State Backends, Checkpointing > Reporter: Stephan Ewen > Assignee: Stefan Richter > Fix For: 1.4.0 > > > Operators should try/catch exceptions in the synchronous and asynchronous > part of the checkpoint and send a {{DeclineCheckpoint}} message as a result. > The decline message should have the failure cause attached to it. > The checkpoint barrier should be sent anyways as a first step before > attempting to make a state checkpoint, to make sure that downstream operators > do not block in alignment. -- This message was sent by Atlassian JIRA (v6.4.14#64029)