Re: Bulk Scheduler timeout when creating several jobs in flink kubernetes HA deployment

2021-08-26 Thread Gil De Grove
Hello Matthias, I'll extract the logs from the cluster au update that here. For the tm's, i'll try to find relevant logs, we had many of them deployed at that time. And all of the logs may not be that interesting to upload. Regards, Gil On Thu, Aug 26, 2021, 12:31 Matthias Pohl wrote: > Hi

Re: Bulk Scheduler timeout when creating several jobs in flink kubernetes HA deployment

2021-08-26 Thread Matthias Pohl
Hi Gil, could you provide the complete logs (TaskManager & JobManager) for us to investigate it? The error itself and the behavior you're describing sounds like expected behavior if there are not enough slots available for all the submitted jobs to be handled in time. Have you tried increasing the

Bulk Scheduler timeout when creating several jobs in flink kubernetes HA deployment

2021-08-25 Thread Gil De Grove
Hello, We are struggling a bit with an error in our kubernetes deployment. The deployment is composed of 2 flink job managers and 58 task managers. When deploying the jobs everything is going fine at first, but after the deployment of several jobs (mix of batch and streaming job using the SQL