[ https://issues.apache.org/jira/browse/FLINK-32700?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17748092#comment-17748092 ]
Gyula Fora commented on FLINK-32700: ------------------------------------ Sounds good, we have to be careful with simply calling cancel in a savepoint timeout, that may cause state loss. Users need to switch to a stateless upgrade mode to allow dropping state which would call cancel anyways. > Support job drain for Savepoint upgrade mode jobs in Flink Operator > ------------------------------------------------------------------- > > Key: FLINK-32700 > URL: https://issues.apache.org/jira/browse/FLINK-32700 > Project: Flink > Issue Type: Improvement > Components: Kubernetes Operator > Affects Versions: kubernetes-operator-1.5.0 > Reporter: Manan Mangal > Assignee: Manan Mangal > Priority: Major > > During cancel job with savepoint upgrade mode, jobs can be allowed to drain > by advancing the watermark to the end, before they are stopped, so that the > in-flight data is not lost. > If the job fails to drain and hits timeout or any other error, it can be > cancelled without taking a savepoint. -- This message was sent by Atlassian Jira (v8.20.10#820010)