[ https://issues.apache.org/jira/browse/FLINK-21667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17324636#comment-17324636 ]
Xintong Song commented on FLINK-21667: -------------------------------------- A PR has been opened for this ticket. We are waiting for the release 1.13 branch cut. > Standby RM might remove resources from Kubernetes > ------------------------------------------------- > > Key: FLINK-21667 > URL: https://issues.apache.org/jira/browse/FLINK-21667 > Project: Flink > Issue Type: Bug > Components: Deployment / Kubernetes, Runtime / Coordination > Affects Versions: 1.12.2 > Reporter: Xintong Song > Assignee: Xintong Song > Priority: Major > Labels: pull-request-available > Fix For: 1.14.0 > > > Currently, on initialization {{KubernetesResourceManagerDriver}} starts a > watch for receiving pod events. It could happen that it starts to receive > events before obtaining leadership. Consequently, a standby RM may remove > terminated pods from Kubernetes during handling the events. > This is not very damaging atm, since the removed pods are already terminated > anyway. However, it would still be good for a standby RM to strictly > following the contract and make no modifications before obtaining leadership. > We might consider to postpone starting of the watch to when the leadership is > granted. -- This message was sent by Atlassian Jira (v8.3.4#803005)