should have any impact on maste   --->  should not have any impact on maste

gabrywu <[email protected]> 于2020年5月21日周四 下午4:19写道:

> 100 sql tasks belonging to a business package. So heavyweight business
> logic ! It will be difficult to maintain.
> Maybe we can create a new JOB type called DAG, which split the 100 SQLs
> depend on their data lineage.
> Meanwhile the DAG job has some limitations ,for example ,25 subtask at
> most.
>
> Too many jobs under one label should have any impact on master,  jobs
> reached same schedule time will.
> After all ,Jobs under one label will not be triggered at the same time.
>
> Hemin Wen <[email protected]> 于2020年5月21日周四 上午10:34写道:
>
>> First, There are 100 (for example) sql tasks belonging to a business
>> package, Does not need to be split,
>> scheduling still has to serve the actual business.
>>
>> I agree with the concept of JOB, I understand what you mean is that the
>> master scheduling granularity is changed to job level,
>> But, If there are too many jobs under one label, it will affect the
>> execution of other label jobs,
>> So I think the current master is correct at the workflow level, parallel
>> between workflows, queued within workflows.
>>
>> I understand that this need to solve is the dependency between batch tasks
>> and batch creation tasks.
>> The list is for easier viewing of tasks. User manual maintenance of batch
>> tasks is a time-consuming operation.
>> So, I suggest to start with how to solve creating batch tasks and how to
>> solve dependencies between tasks.
>>
>>
>> --------------------
>> DolphinScheduler(Incubator) Commtter
>> Hemin Wen  温合民
>> [email protected]
>> --------------------
>>
>>
>> GabryWu <[email protected]> 于2020年5月20日周三 下午9:58写道:
>>
>> > If DAG removed ,master only need to pick up JOBs which reached schedule
>> > time, and dispatches them to one worker .However the JOBs will be in the
>> > queue before dispatched .
>> > The master will be lightweight , and complement work will be easy. The
>> > JOBs which have same label will be dispatched to workers and they will
>> > execute parallelly or in a sequence  relying on their DEPENDENCY
>> >
>> > ---Original---
>> > *From:* "GabryWu"<[email protected]>
>> > *Date:* Wed, May 20, 2020 18:41 PM
>> > *To:* "dev"<[email protected]>;
>> > *Cc:* "wenhemin"<[email protected]>;
>> > *Subject:* Re: How do you think Task DAG dependency and List dependency?
>> >
>> > In that case ,I recommend spliting the 100 SQLs into different JOB which
>> > has same label, each JOB has it's own business logic .Finally Add
>> > DEPENDENCY to each JOB. JOBs having same label can be shown in front
>> page
>> > through DAG
>> >
>> >
>> > ---Original---
>> > *From:* "Hemin Wen"<[email protected]>
>> > *Date:* Wed, May 20, 2020 17:38 PM
>> > *To:* "GabryWu"<[email protected]>;
>> > *Subject:* Re: How do you think Task DAG dependency and List dependency?
>> >
>> > For example, A workflow has 100 sql scripts task, tasks depend on each
>> > other through data lineage.
>> > This requirement is not suitable for DAG configuration.
>> >
>> >
>> > --------------------
>> > DolphinScheduler(Incubator) Commtter
>> > Hemin Wen  温合民
>> > [email protected]
>> > --------------------
>> >
>> >
>> > GabryWu <[email protected]> 于2020年5月20日周三 下午5:31写道:
>> >
>> >> what do you mean batch tasks
>> >>
>> >> ------------------ 原始邮件 ------------------
>> >> *发件人:* "wenhemin"<[email protected]>;
>> >> *发送时间:* 2020年5月20日(星期三) 下午5:04
>> >> *收件人:* "dev"<[email protected]>;
>> >> *主题:* Re: How do you think Task DAG dependency and List dependency?
>> >>
>> >> I do not recommend remove DAG.
>> >> I think, DS lacks support for batch tasks.
>> >> DAG solves the expression of different types of tasks, there is no way
>> to
>> >> express batch tasks of the same type.
>> >>
>> >> I don't know if what I understand is accurate, It is recommended to
>> >> understand in depth the problem that users
>> >> want to solve by using "List dependency”.
>> >>
>> >> --------------------
>> >> DolphinScheduler(Incubator) Commtter
>> >> Hemin Wen  温合民
>> >> [email protected]
>> >> --------------------
>> >>
>> >>
>> >> lidong dai <[email protected]> 于2020年5月20日周三 下午3:40写道:
>> >>
>> >> > I know what you said,  there maybe need more people to discuss this
>> >> topic,
>> >> > I want to know other's opinion, how do they think this question
>> >> >
>> >> >
>> >> >
>> >> > Best Regards
>> >> > ---------------
>> >> > DolphinScheduler(Incubator) PPMC
>> >> > Lidong Dai 代立冬
>> >> > [email protected]
>> >> > ---------------
>> >> >
>> >> >
>> >> > GabryWu <[email protected]> 于2020年5月19日周二 上午7:19写道:
>> >> >
>> >> > > not actually. I mean refactor backend code to remove DAG, where
>> only
>> >> JOB
>> >> > > and DEPENDENCY remained
>> >> > >
>> >> > > ---Original---
>> >> > > *From:* "lidong dai"<[email protected]>
>> >> > > *Date:* Mon, May 18, 2020 22:47 PM
>> >> > > *To:* "GabryWu"<[email protected]>;
>> >> > > *Cc:* "dev"<[email protected]>;
>> >> > > *Subject:* Re: How do you think Task DAG dependency and List
>> >> dependency?
>> >> > >
>> >> > > I think your said is "List dependency", List dependency is also
>> DAG,
>> >> it's
>> >> > > only different in use.  when use List dependency, you can add your
>> >> > upstream
>> >> > > dependency for the task, this is convenient for hugely tasks in one
>> >> > > workflow
>> >> > >
>> >> > >
>> >> > > Best Regards
>> >> > > ---------------
>> >> > > DolphinScheduler(Incubator) PPMC
>> >> > > Lidong Dai 代立冬
>> >> > > [email protected]
>> >> > > ---------------
>> >> > >
>> >> > >
>> >> > > GabryWu <[email protected]> 于2020年5月13日周三 上午11:08写道:
>> >> > >
>> >> > >> Until now ,DAG is one physical  concept, which means that DAG is
>> >> heavy
>> >> > >> class , and introduces other classes and stored in a big json
>> field.
>> >> > >> On the bigdata platform , DAG is not a good concept for
>> Schedulers,
>> >> > which
>> >> > >> can be abandoned.
>> >> > >> If DAG was abandoned,   JOB and DEPENDENCY will simply the
>> >> architecture
>> >> > >> and make DolphinScheduler stabilize and extended easily and
>> easier to
>> >> > >> search jobs
>> >> > >> However abandoning DAG doesn't mean remove DAG graph in the front
>> >> end,
>> >> > >> which is also an important visualization way for JOBs
>> >> > >> We can add one Job and Dependency manually ,and visualize them in
>> DAG
>> >> > >> graph automatically
>> >> > >>
>> >> > >>
>> >> > >>
>> >> > >> ------------------ 原始邮件 ------------------
>> >> > >> *发件人:* "lidong dai"<[email protected]>;
>> >> > >> *发送时间:* 2020年5月12日(星期二) 下午5:26
>> >> > >> *收件人:* "dev"<[email protected]>;
>> >> > >> *主题:* Re: How do you think Task DAG dependency and List
>> dependency?
>> >> > >>
>> >> > >> yes, your description is proper, thanks
>> >> > >>
>> >> > >>
>> >> > >> Best Regards
>> >> > >> ---------------
>> >> > >> DolphinScheduler(Incubator) PPMC
>> >> > >> Lidong Dai 代立冬
>> >> > >> [email protected]
>> >> > >> ---------------
>> >> > >>
>> >> > >>
>> >> > >> leon bao <[email protected]> 于2020年5月12日周二 下午5:06写道:
>> >> > >>
>> >> > >> > i think you want to show the DAG using list mode.
>> >> > >> > DS show DAG using graphic mode, that will have some problems:
>> >> > >> >
>> >> > >> > 1. once the number of tasks is large,  graphic mode will be
>> >> confusing.
>> >> > >> > 2. it is difficult to find the specified task in a complex DAG.
>> >> > >> >
>> >> > >> > lidong dai <[email protected]> 于2020年5月12日周二 下午4:31写道:
>> >> > >> >
>> >> > >> > > I want to say do we need to implement List dependency?
>> >> > >> > >
>> >> > >> > >
>> >> > >> > >
>> >> > >> > > Best Regards
>> >> > >> > > ---------------
>> >> > >> > > DolphinScheduler(Incubator) PPMC
>> >> > >> > > Lidong Dai 代立冬
>> >> > >> > > [email protected]
>> >> > >> > > ---------------
>> >> > >> > >
>> >> > >> > >
>> >> > >> > > JUN GAO <[email protected]> 于2020年5月12日周二 下午12:26写道:
>> >> > >> > >
>> >> > >> > > > Hi , @lidong dai
>> >> > >> > > > Sorry , I don`t know your mean about this .
>> >> > >> > > > You want to discuss how to implement Task DAG dependency and
>> >> List
>> >> > >> > > > dependency ? Or discuss what is Task DAG dependency and List
>> >> > >> > dependency ?
>> >> > >> > > > you can show us what`s your need and ideas , So that we can
>> >> know
>> >> > the
>> >> > >> > > topic
>> >> > >> > > > of discussion.
>> >> > >> > > >
>> >> > >> > > > Thank you !
>> >> > >> > > >
>> >> > >> > > > lidong dai <[email protected]> 于2020年5月11日周一 下午10:34写道:
>> >> > >> > > >
>> >> > >> > > > > hi ,
>> >> > >> > > > >
>> >> > >> > > > >    Task dependency between upstream and downstream  called
>> >> List
>> >> > >> > > > dependency
>> >> > >> > > > > do you have ideas about this topic?
>> >> > >> > > > >
>> >> > >> > > > >
>> >> > >> > > > >
>> >> > >> > > > > Best Regards
>> >> > >> > > > > ---------------
>> >> > >> > > > > DolphinScheduler(Incubator) PPMC
>> >> > >> > > > > Lidong Dai 代立冬
>> >> > >> > > > > [email protected]
>> >> > >> > > > > ---------------
>> >> > >> > > > >
>> >> > >> > > >
>> >> > >> > > >
>> >> > >> > > > --
>> >> > >> > > >
>> >> > >> > > > DolphinScheduler(Incubator)  PPMC
>> >> > >> > > > Jun Gao 高俊
>> >> > >> > > > [email protected]
>> >> > >> > > >
>> >> > >> > >
>> >> > >> >
>> >> > >> >
>> >> > >> > --
>> >> > >> > DolphinScheduler(Incubator)  PPMC
>> >> > >> > BaoLiang 鲍亮
>> >> > >> > [email protected]
>> >> > >> >
>> >> > >>
>> >> > >>
>> >> >
>> >>
>> >>
>>
>

Reply via email to