[jira] [Commented] (FLINK-32700) Support job drain for Savepoint upgrade mode jobs in Flink Operator

Gyula Fora (Jira) Thu, 27 Jul 2023 04:19:05 -0700


    [ 
https://issues.apache.org/jira/browse/FLINK-32700?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17748092#comment-17748092
 ]


Gyula Fora commented on FLINK-32700:
------------------------------------

Sounds good, we have to be careful with simply calling cancel in a savepoint 
timeout, that may cause state loss. Users need to switch to a stateless upgrade 
mode to allow dropping state which would call cancel anyways.

> Support job drain for Savepoint upgrade mode jobs in Flink Operator
> -------------------------------------------------------------------
>
>                 Key: FLINK-32700
>                 URL: https://issues.apache.org/jira/browse/FLINK-32700
>             Project: Flink
>          Issue Type: Improvement
>          Components: Kubernetes Operator
>    Affects Versions: kubernetes-operator-1.5.0
>            Reporter: Manan Mangal
>            Assignee: Manan Mangal
>            Priority: Major
>
> During cancel job with savepoint upgrade mode, jobs can be allowed to drain 
> by advancing the watermark to the end, before they are stopped, so that the 
> in-flight data is not lost. 
> If the job fails to drain and hits timeout or any other error, it can be 
> cancelled without taking a savepoint.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[jira] [Commented] (FLINK-32700) Support job drain for Savepoint upgrade mode jobs in Flink Operator

Reply via email to