[ https://issues.apache.org/jira/browse/FLINK-26683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17512165#comment-17512165 ]
Liu commented on FLINK-26683: ----------------------------- I wonder what situations will cause the savepoint complete but fail to notify. In this case for stop-with-savepoint, can we just restore to retry committing for both drain and no-drain modes since all the tasks are ready to commit? If not for the no-drain mode, I am afraid that the next stop-with-savepoint may repeat the same problem, such as encountering the disk error problem. > Terminate the job anyway if savepoint finished when stop-with-savepoint > ----------------------------------------------------------------------- > > Key: FLINK-26683 > URL: https://issues.apache.org/jira/browse/FLINK-26683 > Project: Flink > Issue Type: Improvement > Components: Runtime / Checkpointing, Runtime / Coordination > Affects Versions: 1.15.0, 1.14.4 > Reporter: Liu > Priority: Major > Fix For: 1.16.0 > > > When we stop with savepoint, the savepoint finishes. But some tasks failover > for some reason and restart to running. In the end, some tasks are finished > and some tasks are running. In this case, I think that we should terminate > all the tasks anyway instead of restarting since the savepoint is finished > and the job stops consuming data. What do you think? -- This message was sent by Atlassian Jira (v8.20.1#820001)