I haven’t used OKD but it sounds like OLM. If that’s the case, I’m assuming the operator was deployed to the “operators” namespace. In that case, you’ll need to create the RBACs and such in the Flink namespace for that deployment to work.
For example this needs to be in each namespace that you want to have Flink deployments in. kubectl apply -f - <<EOF apiVersion: v1 kind: ServiceAccount metadata: labels: app.kubernetes.io/name: flink-kubernetes-operator app.kubernetes.io/version: 1.5.0 name: flink --- apiVersion: rbac.authorization.k8s.io/v1 kind: Role metadata: labels: app.kubernetes.io/name: flink-kubernetes-operator app.kubernetes.io/version: 1.5.0 name: flink rules: - apiGroups: - "" resources: - pods - configmaps verbs: - '*' - apiGroups: - apps resources: - deployments - deployments/finalizers verbs: - '*' --- apiVersion: rbac.authorization.k8s.io/v1 kind: RoleBinding metadata: labels: app.kubernetes.io/name: flink-kubernetes-operator app.kubernetes.io/version: 1.5.0 name: flink-role-binding roleRef: apiGroup: rbac.authorization.k8s.io kind: Role name: flink subjects: - kind: ServiceAccount name: flink EOF Hopefully that helps. On Tue, Sep 19, 2023 at 5:40 PM Krzysztof Chmielewski < krzysiek.chmielew...@gmail.com> wrote: > Hi community, > I was wondering if anyone tried to deploy Flink using Flink k8s operator > on machine where OKD [1] is installed? > > We have tried to install Flink k8s operator version 1.6 which seems to > succeed, however when we try to deploy simple Flink deployment we are > getting an error. > > 2023-09-19 10:11:36,440 i.j.o.p.e.ReconciliationDispatcher > [ERROR][flink/test] Error during event processing ExecutionScope{ resource > id: ResourceID{name='test', namespace='flink'}, version: 684949788} failed. > > io.fabric8.kubernetes.client.KubernetesClientException: Failure executing: > PUT at: > https://172.30.0.1:443/apis/flink.apache.org/v1beta1/namespaces/flink/flinkdeployments/test. > Message: FlinkDeployment.flink.apache.org "test" is invalid: > [spec.ingress: Invalid value: "null": spec.ingress in body must be of type > object: "null", spec.mode: Invalid value: "null": spec.mode in body must be > of type string: "null", spec.mode: Unsupported value: "null": supported > values: "native", "standalone", spec.logConfiguration: Invalid value: > "null": spec.logConfiguration in body must be of type object: "null", > spec.imagePullPolicy: Invalid value: "null": spec.imagePullPolicy in body > must be of type string: "null", spec.jobManager.podTemplate: Invalid value: > "null": spec.jobManager.podTemplate in body must be of type object: "null", > spec.jobManager.resource.ephemeralStorage: Invalid value: "null": > spec.jobManager.resource.ephemeralStorage in body must be of type string: > "null", spec.podTemplate: Invalid value: "null": spec.podTemplate in body > must be of type object: "null", spec.restartNonce: Invalid value: "null": > spec.restartNonce in body must be of type integer: "null", > spec.taskManager.replicas: Invalid value: "null": spec.taskManager.replicas > in body must be of type integer: "null", > spec.taskManager.resource.ephemeralStorage: Invalid value: "null": > spec.taskManager.resource.ephemeralStorage in body must be of type string: > "null", spec.taskManager.podTemplate: Invalid value: "null": > spec.taskManager.podTemplate in body must be of type object: "null", > spec.job: Invalid value: "null": spec.job in body must be of type object: > "null", .spec.taskManager.replicas: Invalid value: 0: > .spec.taskManager.replicas accessor error: <nil> is of the type <nil>, > expected int64]. Received status: Status(apiVersion=v1, code=422, > details=StatusDetails(causes=[StatusCause(field=spec.ingress, > message=Invalid value: "null": spec.ingress in body must be of type object: > "null", reason=FieldValueInvalid, additionalProperties={}), > StatusCause(field=spec.mode, message=Invalid value: "null": spec.mode in > body must be of type string: "null", reason=FieldValueInvalid, > additionalProperties={}), StatusCause(field=spec.mode, message=Unsupported > value: "null": supported values: "native", "standalone", > reason=FieldValueNotSupported, additionalProperties={}), > StatusCause(field=spec.logConfiguration, message=Invalid value: "null": > spec.logConfiguration in body must be of type object: "null", > reason=FieldValueInvalid, additionalProperties={}), > StatusCause(field=spec.imagePullPolicy, message=Invalid value: "null": > spec.imagePullPolicy in body must be of type string: "null", > reason=FieldValueInvalid, additionalProperties={}), > StatusCause(field=spec.jobManager.podTemplate, message=Invalid value: > "null": spec.jobManager.podTemplate in body must be of type object: "null", > reason=FieldValueInvalid, additionalProperties={}), > StatusCause(field=spec.jobManager.resource.ephemeralStorage, > message=Invalid value: "null": spec.jobManager.resource.ephemeralStorage in > body must be of type string: "null", reason=FieldValueInvalid, > additionalProperties={}), StatusCause(field=spec.podTemplate, > message=Invalid value: "null": spec.podTemplate in body must be of type > object: "null", reason=FieldValueInvalid, additionalProperties={}), > StatusCause(field=spec.restartNonce, message=Invalid value: "null": > spec.restartNonce in body must be of type integer: "null", > reason=FieldValueInvalid, additionalProperties={}), > StatusCause(field=spec.taskManager.replicas, message=Invalid value: "null": > spec.taskManager.replicas in body must be of type integer: "null", > reason=FieldValueInvalid, additionalProperties={}), > StatusCause(field=spec.taskManager.resource.ephemeralStorage, > message=Invalid value: "null": spec.taskManager.resource.ephemeralStorage > in body must be of type string: "null", reason=FieldValueInvalid, > additionalProperties={}), StatusCause(field=spec.taskManager.podTemplate, > message=Invalid value: "null": spec.taskManager.podTemplate in body must be > of type object: "null", reason=FieldValueInvalid, additionalProperties={}), > StatusCause(field=spec.job, message=Invalid value: "null": spec.job in body > must be of type object: "null", reason=FieldValueInvalid, > additionalProperties={}), StatusCause(field=.spec.taskManager.replicas, > message=Invalid value: 0: .spec.taskManager.replicas accessor error: <nil> > is of the type <nil>, expected int64, reason=FieldValueInvalid, > additionalProperties={})], group=flink.apache.org, kind=FlinkDeployment, > name=test, retryAfterSeconds=null, uid=null, additionalProperties={}), > kind=Status, message=FlinkDeployment.flink.apache.org "test" is invalid: > [spec.ingress: Invalid value: "null": spec.ingress in body must be of type > object: "null", spec.mode: Invalid value: "null": spec.mode in body must be > of type string: "null", spec.mode: Unsupported value: "null": supported > values: "native", "standalone", spec.logConfiguration: Invalid value: > "null": spec.logConfiguration in body must be of type object: "null", > spec.imagePullPolicy: Invalid value: "null": spec.imagePullPolicy in body > must be of type string: "null", spec.jobManager.podTemplate: Invalid value: > "null": spec.jobManager.podTemplate in body must be of type object: "null", > spec.jobManager.resource.ephemeralStorage: Invalid value: "null": > spec.jobManager.resource.ephemeralStorage in body must be of type string: > "null", spec.podTemplate: Invalid value: "null": spec.podTemplate in body > must be of type object: "null", spec.restartNonce: Invalid value: "null": > spec.restartNonce in body must be of type integer: "null", > spec.taskManager.replicas: Invalid value: "null": spec.taskManager.replicas > in body must be of type integer: "null", > spec.taskManager.resource.ephemeralStorage: Invalid value: "null": > spec.taskManager.resource.ephemeralStorage in body must be of type string: > "null", spec.taskManager.podTemplate: Invalid value: "null": > spec.taskManager.podTemplate in body must be of type object: "null", > spec.job: Invalid value: "null": spec.job in body must be of type object: > "null", .spec.taskManager.replicas: Invalid value: 0: > .spec.taskManager.replicas accessor error: <nil> is of the type <nil>, > expected int64], metadata=ListMeta(_continue=null, remainingItemCount=null, > resourceVersion=null, selfLink=null, additionalProperties={}), > reason=Invalid, status=Failure, additionalProperties={}). > > at > io.fabric8.kubernetes.client.KubernetesClientException.copyAsCause(KubernetesClientException.java:238) > > at > io.fabric8.kubernetes.client.dsl.internal.OperationSupport.waitForResult(OperationSupport.java:518) > > at > io.fabric8.kubernetes.client.dsl.internal.OperationSupport.handleResponse(OperationSupport.java:535) > > at > io.fabric8.kubernetes.client.dsl.internal.OperationSupport.handleUpdate(OperationSupport.java:358) > > at > io.fabric8.kubernetes.client.dsl.internal.BaseOperation.handleUpdate(BaseOperation.java:708) > > at > io.fabric8.kubernetes.client.dsl.internal.HasMetadataOperation.lambda$handleReplace$0(HasMetadataOperation.java:185) > > at > io.fabric8.kubernetes.client.dsl.internal.HasMetadataOperation.handleReplace(HasMetadataOperation.java:190) > > at > io.fabric8.kubernetes.client.dsl.internal.HasMetadataOperation.replace(HasMetadataOperation.java:101) > > at > io.fabric8.kubernetes.client.dsl.internal.HasMetadataOperation.replace(HasMetadataOperation.java:45) > > at > io.javaoperatorsdk.operator.processing.event.ReconciliationDispatcher$CustomResourceFacade.updateResource(ReconciliationDispatcher.java:387) > > at > io.javaoperatorsdk.operator.processing.event.ReconciliationDispatcher.conflictRetryingUpdate(ReconciliationDispatcher.java:343) > > at > io.javaoperatorsdk.operator.processing.event.ReconciliationDispatcher.updateCustomResourceWithFinalizer(ReconciliationDispatcher.java:316) > > at > io.javaoperatorsdk.operator.processing.event.ReconciliationDispatcher.handleReconcile(ReconciliationDispatcher.java:115) > > at > io.javaoperatorsdk.operator.processing.event.ReconciliationDispatcher.handleDispatch(ReconciliationDispatcher.java:89) > > at > io.javaoperatorsdk.operator.processing.event.ReconciliationDispatcher.handleExecution(ReconciliationDispatcher.java:62) > > at > io.javaoperatorsdk.operator.processing.event.EventProcessor$ReconcilerExecutor.run(EventProcessor.java:414) > > at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown > Source) > > at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown > Source) > > at java.base/java.lang.Thread.run(Unknown Source) > > The deployment we are trying to run is this: > > apiVersion: flink.apache.org/v1beta1 > > kind: FlinkDeployment > > metadata: > > namespace: flink > > name: test > > spec: > > mode: native > > image: flink:1.17 > > flinkVersion: v1_17 > > flinkConfiguration: > > taskmanager.numberOfTaskSlots: "2" > > serviceAccount: flink > > jobManager: > > resource: > > memory: "2048m" > > cpu: 1 > > taskManager: > > resource: > > memory: "2048m" > > cpu: 1 > > Regards, > Krzysztof Chmielewski > > > [1] https://lists.apache.org/thread/07d46txb6vttw7c8oyr6z4n676vgqh28 >