Thomas Weise created FLINK-30004:
------------------------------------

             Summary: Cannot resume deployment after suspend with savepoint due 
to leftover confgmaps
                 Key: FLINK-30004
                 URL: https://issues.apache.org/jira/browse/FLINK-30004
             Project: Flink
          Issue Type: Bug
          Components: Kubernetes Operator
    Affects Versions: 1.2
            Reporter: Thomas Weise
            Assignee: Thomas Weise


Due to the possibility of incomplete cleanup of HA data in Flink 1.14, the 
deployment can get into a limbo state that requires manual intervention after 
suspend with savepoint. If the config maps are not cleaned up the resumed job 
will be considered finished and the operator recognize the JM deployment as 
missing. Due to check for HA data which are now cleaned up, the job fails to 
start and manual redeployment with initial savepoint is necessary.

This can be avoided by removing any leftover HA config maps after the job has 
successfully stopped with savepoint (upgrade mode savepoint).



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to