Matthias Pohl created FLINK-27355:
-------------------------------------

             Summary: JobManagerRunnerRegistry.localCleanupAsync does not call 
the JobManagerRunner.close method repeatedly
                 Key: FLINK-27355
                 URL: https://issues.apache.org/jira/browse/FLINK-27355
             Project: Flink
          Issue Type: Bug
          Components: Runtime / Coordination
    Affects Versions: 1.15.0
            Reporter: Matthias Pohl


The {{DefaultJobManagerRunner.localCleanupAsync}} method deregisters the 
JobManagerRunner and calls close on it. If close fails for whatever reason, it 
will be identified but the next retry would just notice that the 
JobManagerRunner is already deregistered and not do anything.

Hence, JobMaster shutdown won't be retriggered (i.e. errors in the 
{{CompletedCheckpointStore}} or the {{CheckpointIDCounter}} won't be handled). 
FLINK-26114 is related: Both components don't expose any errors right now, 
anyway.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

Reply via email to