[ https://issues.apache.org/jira/browse/FLINK-7783?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Aljoscha Krettek closed FLINK-7783. ----------------------------------- Resolution: Fixed Fixed on release-1.3 in 3f85fa1bead4e19eda9fc7d36e6facb8cd9b27db Fixed on master in 9071e3befb8c279f73c3094c9f6bddc0e7cce9e5 > Don't always remove checkpoints in ZooKeeperCompletedCheckpointStore#recover() > ------------------------------------------------------------------------------ > > Key: FLINK-7783 > URL: https://issues.apache.org/jira/browse/FLINK-7783 > Project: Flink > Issue Type: Sub-task > Components: State Backends, Checkpointing > Affects Versions: 1.4.0, 1.3.2 > Reporter: Aljoscha Krettek > Assignee: Aljoscha Krettek > Priority: Blocker > Fix For: 1.4.0, 1.3.3 > > > Currently, we always delete checkpoint handles if they (or the data from the > DFS) cannot be read: > https://github.com/apache/flink/blob/91a4b276171afb760bfff9ccf30593e648e91dfb/flink-runtime/src/main/java/org/apache/flink/runtime/checkpoint/ZooKeeperCompletedCheckpointStore.java#L180 > This can lead to problems in case the DFS is temporarily not available, i.e. > we could inadvertently > delete all checkpoints even though they are still valid. > A user reported this problem on the mailing list: > https://lists.apache.org/thread.html/9dc9b719cf8449067ad01114fedb75d1beac7b4dff171acdcc24903d@%3Cuser.flink.apache.org%3E -- This message was sent by Atlassian JIRA (v6.4.14#64029)