Matt Casters created FLINK-28204: ------------------------------------ Summary: Deleting a FlinkDeployment results in an error on the pod Key: FLINK-28204 URL: https://issues.apache.org/jira/browse/FLINK-28204 Project: Flink Issue Type: Bug Components: Kubernetes Operator Affects Versions: kubernetes-operator-1.0.0 Environment: AWS EKS
{code:java} kubectl version Client Version: version.Info{Major:"1", Minor:"23", GitVersion:"v1.23.8", GitCommit:"a12b886b1da059e0190c54d09c5eab5219dd7acf", GitTreeState:"clean", BuildDate:"2022-06-17T22:27:29 Z", GoVersion:"go1.17.10", Compiler:"gc", Platform:"linux/amd64"} Server Version: version.Info{Major:"1", Minor:"22+", GitVersion:"v1.22.9-eks-a64ea69", GitCommit:"540410f9a2e24b7a2a870ebfacb3212744b5f878", GitTreeState:"clean", BuildDate:"2022-0 5-12T19:15:31Z", GoVersion:"go1.16.15", Compiler:"gc", Platform:"linux/amd64"} {code} Reporter: Matt Casters I didn't configure the memory settings of my Flink cluster correctly in the Flink deployment Yaml. So I thought I would delete the deployment but I'm getting this error in the log of the f-k-o pod: {code:java} 2022-06-22 13:19:13,521 o.a.f.k.o.c.FlinkDeploymentController [INFO ][default/apache-hop-flink] Deleting FlinkDeployment 2022-06-22 13:19:13,521 i.j.o.p.e.ReconciliationDispatcher [ERROR][default/apache-hop-flink] Error during event processing ExecutionScope{ resource id: CustomResourceID{name='apache-hop-flink', namespace='default'}, version: 23709} failed. java.lang.RuntimeException: Cannot create observe config before first deployment, this indicates a bug. at org.apache.flink.kubernetes.operator.config.FlinkConfigManager.getObserveConfig(FlinkConfigManager.java:137) at org.apache.flink.kubernetes.operator.service.FlinkService.cancelJob(FlinkService.java:357) at org.apache.flink.kubernetes.operator.reconciler.deployment.ApplicationReconciler.shutdown(ApplicationReconciler.java:327) at org.apache.flink.kubernetes.operator.reconciler.deployment.AbstractDeploymentReconciler.cleanup(AbstractDeploymentReconciler.java:56) at org.apache.flink.kubernetes.operator.reconciler.deployment.AbstractDeploymentReconciler.cleanup(AbstractDeploymentReconciler.java:37) at org.apache.flink.kubernetes.operator.controller.FlinkDeploymentController.cleanup(FlinkDeploymentController.java:107) at org.apache.flink.kubernetes.operator.controller.FlinkDeploymentController.cleanup(FlinkDeploymentController.java:59) at io.javaoperatorsdk.operator.processing.Controller$1.execute(Controller.java:68) at io.javaoperatorsdk.operator.processing.Controller$1.execute(Controller.java:50) at io.javaoperatorsdk.operator.api.monitoring.Metrics.timeControllerExecution(Metrics.java:34) at io.javaoperatorsdk.operator.processing.Controller.cleanup(Controller.java:49) at io.javaoperatorsdk.operator.processing.event.ReconciliationDispatcher.handleCleanup(ReconciliationDispatcher.java:252) at io.javaoperatorsdk.operator.processing.event.ReconciliationDispatcher.handleDispatch(ReconciliationDispatcher.java:72) at io.javaoperatorsdk.operator.processing.event.ReconciliationDispatcher.handleExecution(ReconciliationDispatcher.java:50) at io.javaoperatorsdk.operator.processing.event.EventProcessor$ControllerExecution.run(EventProcessor.java:349) at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown Source) at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source) at java.base/java.lang.Thread.run(Unknown Source) {code} So in essence this leaves me in a state between not deployed and not able to delete the flinkdeployment. -- This message was sent by Atlassian Jira (v8.20.7#820007)