Hi Siddharth,

I created the ticket:

https://issues.apache.org/jira/browse/AIRFLOW-392

Let me know what I can do to help.

Cheers,
David

On Tue, Aug 2, 2016 at 8:10 PM, siddharth anand <san...@apache.org> wrote:

> Interesting. If you haven't already, can you create a Jira and append an
> example dag that I can run to reproduce (likely capturing the code you have
> above). You can then assign the bug to me to look into.
>
> Also, please provide enough context on your use-case and why you are
> structuring your code this way. It will help identify any alternatives.
> It's not clear to me what you want to do exactly.
> -s
>
> On Tue, Aug 2, 2016 at 8:02 PM, David Klosowski <dav...@thinknear.com>
> wrote:
>
> > start_date being updated isn't the issue here.  I haven't changed it.
>  New
> > execution_dates keep getting created for the past before any dags or
> > start_dates existed.
> >
> > Cheers,
> > David
> >
> > On Tue, Aug 2, 2016 at 7:10 PM, siddharth anand <san...@apache.org>
> wrote:
> >
> > > The problem might be that the start_date does not get updated. I work
> > > around this by changing the name of my dag. I do lose history as well,
> > but
> > > it works.
> > >
> > > My dags are named "some_dag_v1". When I change a start date, I update
> the
> > > version suffix to force a reload : "some_dag_v2"
> > >
> > > -s
> > >
> > > On Tue, Aug 2, 2016 at 6:49 PM, David Klosowski <dav...@thinknear.com>
> > > wrote:
> > >
> > > > I have a DAG that I just deployed that the scheduler keeps scheduling
> > for
> > > > the last two months in the past.
> > > >
> > > > start_date: 8/5/2016
> > > >
> > > > scheduled runs started:
> > > > 7/3/2016
> > > > 6/5/2016
> > > >
> > > > Here is the gist of this DAG's architecture:
> > > >
> > > > The DAG depends another dags tasks using 7 dynamic
> ExternalTaskSensors
> > > that
> > > > it builds which that represent 'daily' jobs and then has a
> > DummyOperator
> > > > task which aggregates and triggers the 'weekly' job task upon
> > completion.
> > > >
> > > > Some of the code showcasing this:
> > > >
> > > > run_for_date = datetime(2016, 8, 2)
> > > >
> > > > args = {'owner': 'airflow',
> > > >         'depends_on_past': False,
> > > >         'start_date': run_for_date,
> > > >         'email': [alert_email],
> > > >         'email_on_failure': True,
> > > >         'email_on_retry': False,
> > > >         'retries': 1,
> > > >         'trigger_rule' : 'all_success'}
> > > >
> > > > dag = DAG(dag_id='weekly_no_track', default_args=args,
> > > >           schedule_interval=timedelta(days=7),
> > > >           max_active_runs=1)
> > > >
> > > >
> > > > downstream_task = dag.get_task('wait-for-dailies')
> > > > for weekday in [MO, TU, WE, TH, FR, SA, SU]:
> > > >     task_id = 'wait-for-daily-{day}'.format(day=weekday)
> > > >
> > > >     # weekday(-1) subtracts 1 relative week from the given weekday,
> > > however
> > > > if the calculated date is already Monday,
> > > >     # for example, -1 won't change the day.
> > > >     delta = relativedelta(weekday=weekday(-1))
> > > >
> > > >     sensor = ExternalTaskSensor(task_id=task_id, dag=dag,
> > > >                                 external_dag_id='daily_no_track',
> > > > external_task_id='daily-no-track',
> > > >                                 execution_delta=delta, timeout=86400)
> > #
> > > > 86400 = 24 hours
> > > >     sensor.set_downstream(downstream_task)
> > > >
> > > >
> > > > I don't understand what is going on.  Why is the scheduler doing
> > this?  I
> > > > want the DAG to start considering dates from today and on in UTC.
> > > >
> > > > Cheers,
> > > > David
> > > >
> > >
> >
>

Reply via email to