Hi Jeff,

I need it for a research prototype exploring joint DAG scheduling; so there
are cases where we want some stages to be delayed. The current DAGScheduler
(NaturalOrderControlled) immediately sends the map tasks to be scheduled.

I wrote up a DAGScheduler, which can read a flat file, and release tasks to
be scheduled when natural order and time constraints are satisfied. The
code is at (
https://github.com/raajay/tez/blob/crossquery/tez-dag/src/main/java/org/apache/tez/dag/app/dag/impl/DAGSchedulerCrossQuery.java)
if you are interested.


Basically, upon creating an instance of a DAG Scheduler, an Executor
Services is launched to periodically call a function in DAGScheduler that
clears out the pending events.

I would like to avoid reading the start times from a flat file; is it
possible to send it
as a string via "localResources"? If yes, what methods should I use ?

Thanks
Raajay

On Wed, Jan 13, 2016 at 11:58 AM, Jianfeng (Jeff) Zhang <
jzh...@hortonworks.com> wrote:

>
> Hi Raajay,
>
> I don’t think this would it be an easy change (Maybe you can do it
> VertexManager, but may still not be trivial change),  do you mind tell
> what’s your purpose for waiting for a period of time ?
>
>
> Best Regard,
> Jeff Zhang
>
>
> From: Raajay <raaja...@gmail.com>
> Reply-To: "user@tez.apache.org" <user@tez.apache.org>
> Date: Wednesday, January 13, 2016 at 9:20 AM
> To: "user@tez.apache.org" <user@tez.apache.org>
> Subject: DAGScheduler
>
> Hello,
>
> I want to implement a DAGScheduler that ensures that all (or some) map
> vertices of a given dag are not scheduled immediately, but say after a
> configurable amount of time, T.
>
> I can think to do this in 3 following ways:
>
> 1. I would buffer all the "scheduleTask" events and then raise
> "TaskAttemptEventSchedule" after time, T. For this, I would need a clock
> with in DAGScheduler. I can also subscribe to an event raised periodically.
> Is there such an eventGenerator in Tez ?
>
> 2. Using delayed events. Is there support to raised delayed events ?
>
> 3. Modify the priority levels in "DAGSchedulerNaturalOrderControlled"
> based on T, to delay the start of vertices ?
>
>
> Which of the three is easiest to implement and possibly have limited
> side-effects ? Any help/pointers is appreciated.
>
> Thanks
> Raajay
>
>
>

Reply via email to