Matthias Pohl created FLINK-31766: ------------------------------------- Summary: Restoring from a retained checkpoint that was generated with changelog backend enabled might fail due to missing files Key: FLINK-31766 URL: https://issues.apache.org/jira/browse/FLINK-31766 Project: Flink Issue Type: Bug Components: Runtime / Coordination Affects Versions: 1.16.1, 1.17.0, 1.18.0 Reporter: Matthias Pohl
in FLINK-31593 we discovered a instability when generating the test data for {{StatefulJobSavepointMigrationITCase}} and {{StatefulJobWBroadcastStateMigrationITCase}}. It appears that files are deleted that shouldn't be deleted (see [~Yanfei Lei]'s [comment in FLINK-31593|https://issues.apache.org/jira/browse/FLINK-31593?focusedCommentId=17706679&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-17706679]). It's quite reproducible when generating the 1.17 test data for {{StatefulJobWBroadcastStateMigrationITCase}} and doing a test run to verify it. I'm attaching the debug logs of such two runs that I generated for FLINK-31593 in this issue as well. -- This message was sent by Atlassian Jira (v8.20.10#820010)