Fei Feng created FLINK-35123:
--------------------------------
Summary: Flink Kubernetes Operator should not do deleteHAData
Key: FLINK-35123
URL: https://issues.apache.org/jira/browse/FLINK-35123
Project: Flink
Issue Type: Technical Debt
Components: Kubernetes Operator
Affects Versions: kubernetes-operator-1.8.0, kubernetes-operator-1.7.0
Reporter: Fei Feng
Attachments: image-2024-04-16-15-56-33-426.png
we use flink HA based on zookeeper. when a lots of FlinkDeployment was
deleting, operator will be spend to many time in cleanHaData. the jstack show
that reconcile thread was hang on disconnect with zookeeper. this made deleting
flinkdeployment was slowly.
!image-2024-04-16-15-56-33-426.png|width=502,height=263!
I don't understand why flink kubernetes operator need cleanHAdata , as
[~aitozi] comment in PR [FLINK-26336 Call cancel on deletion & clean up
configmaps as well
#28|https://github.com/apache/flink-kubernetes-operator/pull/28#discussion_r815968841]
{quote}it's a bit of out of scope of the operator responsibility or ability
{quote}
and I'm totally agree with his point.
and I want to know why we call don't call RestClusterClient#shutDownCluster
interface, which is
1. more graceful and reasonable (operator need not care whether flink app
enable ha or not) 2. compatible across flink versions .
--
This message was sent by Atlassian Jira
(v8.20.10#820010)