Re: [DISCUSS] FLIP-187: Adaptive Batch Job Scheduler

2021-11-08 Thread Lijie Wang
Hi all, So far, I think we have reached an agreement on this FLIP. I have started the voting thread [1]. Please cast your vote there or ask additional questions here. [1] https://lists.apache.org/thread/58yyxvnygw7oy5t556g8rm9y2xzb1l66 Best, Lijie Lijie Wang 于2021年11月5日周五 下午4:04写道: > Hi all,

Re: [DISCUSS] FLIP-187: Adaptive Batch Job Scheduler

2021-11-05 Thread Lijie Wang
Hi all, Thanks for all the comments on this FLIP. Based on the discussion in the mailing list and comments in the wiki, I updated the FLIP doc, the mainly changes include: 1. Added the limitation that currently only supports ALL-EDGES-BLOCKING batch jobs*.* 2. Adopted some suggestions in the

Re: [DISCUSS] FLIP-187: Adaptive Batch Job Scheduler

2021-11-03 Thread Till Rohrmann
I have to admit that I cannot think of a better name for the adaptive batch scheduler atm. Maybe it is good enough to call the two schedulers AdaptiveBatchScheduler and AdaptiveStreamingScheduler to tell which scheduler is used for which execution mode. It is true, though, that the former is

Re: [DISCUSS] FLIP-187: Adaptive Batch Job Scheduler

2021-11-03 Thread Lijie Wang
Hi David, Thanks for your comments. I personally think that "Adaptive" means: Flink automatically determines the appropriate scheduling and execution plan based on some information. The information can include both resource information and workload information, rather than being limited to a

Re: [DISCUSS] FLIP-187: Adaptive Batch Job Scheduler

2021-11-02 Thread David Morávek
Hi, thanks for drafting the FLIP, Lijie and Zhu Zhu. It already looks pretty solid and it will be a really great improvement to the batch scheduling. I'd second to the Till's feedback, especially when it comes to the consistent behavior between different deployment types / schedulers. What I'm

Re: [DISCUSS] FLIP-187: Adaptive Batch Job Scheduler

2021-10-31 Thread Lijie Wang
Hi, Till & Zhu Thanks for your feedback. Also thanks for your comments and suggestions on wiki, which are very helpful for perfecting the FLIP. I also agree to provide our users with consistent and easy-to-understand deployment options. Regarding the three options proposed by Till, my opinion

Re: [DISCUSS] FLIP-187: Adaptive Batch Job Scheduler

2021-10-30 Thread Zhu Zhu
Hi Till, Thanks for the comments! I agree with you that we should avoid an auto-scaled job not able to be scheduled in standalone/reactive mode. And I think it's great if we can expose a deployment option that is consistent for streaming and batch jobs, which can be easier to understand. Just

Re: [DISCUSS] FLIP-187: Adaptive Batch Job Scheduler

2021-10-29 Thread Till Rohrmann
Hi Lijie, Thanks for drafting this FLIP together with Zhu Zhu :-) I like the idea of making the parallelism of operators of a bounded job dependent on the data size. This makes the job adjust automatically when the data sources/sizes change. I can see this work well in combination with the

[DISCUSS] FLIP-187: Adaptive Batch Job Scheduler

2021-10-20 Thread Lijie Wang
Hi all, Zhu Zhu and I propose to introduce a new job scheduler to Flink: adaptive batch job scheduler. The new scheduler can automatically decide parallelisms of job vertices for batch jobs, according to the size of data volume each vertex needs to process. Major benefits of this scheduler