Re: Task Failure Strategy for Adaptive Scheduler

2023-04-18 Thread Talat Uyarer via user
Hi David, Yes We have multiple disjoint DAGs in one job. We want better CPU utilization. Open Source Flink has a scheduling issue with those types of jobs. I made a fix on 1.13 with AS. Now we are scheduling evenly for all DAGs. However somehow when we get an exception on a DAG we dont want to a

Re: Task Failure Strategy for Adaptive Scheduler

2023-04-18 Thread David Morávek
> Our DAG has multiple sources which are not connected to each other. To double-check, are you saying the job consists of multiple disjoint DAGs? > Do you think somehow the adaptive scheduler supports region pipelines for streaming jobs ? It's doable but might not be straightforward since the A

Re: Task Failure Strategy for Adaptive Scheduler

2023-04-14 Thread Talat Uyarer via user
Thanks David and others. Our DAG has multiple sources which are not connected to each other. If one of them fails, I believe Flink can restart a single region for defaultscheduler. but it is not the same case for adaptive scheduler. Do you think somehow the adaptive scheduler supports region pipel

Re: Task Failure Strategy for Adaptive Scheduler

2023-04-12 Thread David Morávek
Hi Talat, For most streaming pipelines, we have to restart the whole pipeline no matter the scheduler used because they're a single pipelined region. One limitation of AdaptiveScheduler is the lack of support for local recovery. This will be addressed in Flink 1.18 [1]. [1] https://issues.apache.

Re: Task Failure Strategy for Adaptive Scheduler

2023-04-10 Thread Weihua Hu
Hi, AFAIK, the reactive mode always restarts the whole pipeline now. Best, Weihua On Tue, Apr 11, 2023 at 8:38 AM Talat Uyarer via user wrote: > Hi All, > > We use Flink 1.13 with reactive mode for our streaming jobs. When we have > an issue/exception on our pipeline. Flink rescheduled all ta

Task Failure Strategy for Adaptive Scheduler

2023-04-10 Thread Talat Uyarer via user
Hi All, We use Flink 1.13 with reactive mode for our streaming jobs. When we have an issue/exception on our pipeline. Flink rescheduled all tasks. Is there any way to reschedule only task that had exceptions ? Thanks