Re: flink kubernetes flink autoscale behavior

2024-06-24 Thread Rion Williams
Hi Eric,I believe you might be referring to use of the adaptive scheduler which should support these “in-place” scaling operations via:jobmanager.scheduler: adaptiveYou can see the documentation for Elastic Scaling here for additional details and configuration.On Jun 24, 2024, at 11:56 PM, Enric

Re:Re: Backpressure issue with Flink Sql Job

2024-06-24 Thread Xuyang
Hi, Ashish. Can you confirm whether, on the subtask label page of this sink materializer node, the input records for each subtask are approximately the same? If the input records for subtask number 5 are significantly larger compared to the others, it signifies a serious data skew, and it

Re: Understanding flink-autoscaler behavior

2024-06-24 Thread Zhanghao Chen
You can try session mode with only one job, but still with adaptive scheduler disabled. When stopping a session job, the TMs won't be released immediately and can be reused later. Best, Zhanghao Chen From: Chetas Joshi Sent: Tuesday, June 25, 2024 1:59 To:

flink kubernetes flink autoscale behavior

2024-06-24 Thread Enric Ott
Hello,Community: I??ve recently started using the Flink Kubernetes Operator,and I'd like to know if CPU and Job Parallelism autoscaling are supported without restarting the whole job,if it??s supported, please tell me how to configure and deploy it. Thanks.

Re: Understanding flink-autoscaler behavior

2024-06-24 Thread Chetas Joshi
Hello, After disabling the adaptive scheduler, I was able to have the operator stop the job with a savepoint, and resume the job from that savepoint after the upgrade. However I observed that the upgrade life cycle is quite slow as it takes down and then brings back up all the task managers. I am

Re: Backpressure issue with Flink Sql Job

2024-06-24 Thread Penny Rastogi
Hi Ashish, Can you check a few things. 1. Is your source broker count also 20 for both topics? 2. You can try increasing the state operation memory and reduce the disk I/O. - - Increase the number of CU resources in a single slot. - Set optimization parameters: -

Backpressure issue with Flink Sql Job

2024-06-24 Thread Ashish Khatkar via user
Hi all, We are facing backpressure in the flink sql job from the sink and the backpressure only comes from a single task. This causes the checkpoint to fail despite enabling unaligned checkpoints and using debloating buffers. We enabled flamegraph and the task spends most of the time doing