[ https://issues.apache.org/jira/browse/FLINK-25458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Yuan Mei reassigned FLINK-25458: -------------------------------- Assignee: Yanfei Lei > Support local recovery > ---------------------- > > Key: FLINK-25458 > URL: https://issues.apache.org/jira/browse/FLINK-25458 > Project: Flink > Issue Type: Sub-task > Components: Runtime / Checkpointing, Runtime / State Backends > Reporter: Yun Tang > Assignee: Yanfei Lei > Priority: Major > Fix For: 1.16.0 > > > Currently, changelog state-backend doesn't support local recovery. Thus, > recovery times might be sub-optimal. > > Materialized state issues: > Current periodic materialization would call state backend snapshot method > with a materialization id. However, current local state managment would rely > on checkpoint id as storing, confirming and discarding. The gap between them > would break how local recovery works. > > Non-materialized state issues: > * non-materialized state (i.e. changelog) is shared across checkpoints, and > therefore needs some tracking (in TM or hard-linking in FS) > * the writer does not enforce boundary between checkpoints (when writing to > DFS); if local stream simply duplicates DFS stream then it would break on > cleanup > * files can be shared across tasks, which will also break on cleanup -- This message was sent by Atlassian Jira (v8.20.7#820007)