[jira] [Commented] (AIRFLOW-210) sqlalchemy warns about task_instance.execution_date invoked with an empty sequence

2016-10-19 Thread Jeff Long (JIRA)

[ 
https://issues.apache.org/jira/browse/AIRFLOW-210?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15590289#comment-15590289
 ] 

Jeff Long commented on AIRFLOW-210:
---

A quick fix for this in 1.7.1.3 is adding the following around models.py:2633:

{code:title=models.py:2633}
task_instances = []
if len(execution_dates) > 0:
task_instances = (
{code}


> sqlalchemy warns about task_instance.execution_date invoked with an empty 
> sequence
> --
>
> Key: AIRFLOW-210
> URL: https://issues.apache.org/jira/browse/AIRFLOW-210
> Project: Apache Airflow
>  Issue Type: Bug
>  Components: scheduler
>Affects Versions: Airflow 1.7.0
> Environment: Linux, python 2.7, and mysql 5.5.47
>Reporter: Eric Johnson
>Assignee: Ajay Yadava
>Priority: Minor
>
> On a fresh installation of airflow, I'm noticing this warning while running 
> the scheduler. I'm using mysql as the storage for airflow.
> {{/opt/packages/python/2.7.5/lib/python2.7/site-packages/sqlalchemy/sql/default_comparator.py:153:
>  SAWarning: The IN-predicate on "task_instance.execution_date" was invoked 
> with an empty sequence. This results in a contradiction, which nonetheless 
> can be expensive to evaluate.  Consider alternative strategies for improved 
> performance.}}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (AIRFLOW-385) Scheduler logs should have a "latest" directory

2016-10-19 Thread Ben Tallman (JIRA)

[ 
https://issues.apache.org/jira/browse/AIRFLOW-385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15589859#comment-15589859
 ] 

Ben Tallman commented on AIRFLOW-385:
-

Not sure I'm worried about the latest flag, but moving to 
dag_id/execution_date/task_id.log would be very helpful...

> Scheduler logs should have a "latest" directory
> ---
>
> Key: AIRFLOW-385
> URL: https://issues.apache.org/jira/browse/AIRFLOW-385
> Project: Apache Airflow
>  Issue Type: Improvement
>  Components: logging
>Reporter: Dan Davydov
>Assignee: Vijay Bhat
>Priority: Minor
>  Labels: beginner, logging
>
> Right now the scheduler logs for each parsed file look like 
> /tmp/airflow/scheduler/logs/2016-07-17/some-dag/...
> The problem is this makes it hard to find the latest logs which is the most 
> frequent use case.
> Ideally we would create and have the scheduler keep up to date a symlink to 
> the latest date /tmp/airflow/scheduler/logs/latest which would point to e.g. 
> /tmp/airflow/scheduler/logs/2016-07-17
> We might also want to consider changing the structure to dag/date instead of 
> date/dag too, but that can be done as a separate task if desired.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Assigned] (AIRFLOW-385) Scheduler logs should have a "latest" directory

2016-10-19 Thread Vijay Bhat (JIRA)

 [ 
https://issues.apache.org/jira/browse/AIRFLOW-385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vijay Bhat reassigned AIRFLOW-385:
--

Assignee: Vijay Bhat

> Scheduler logs should have a "latest" directory
> ---
>
> Key: AIRFLOW-385
> URL: https://issues.apache.org/jira/browse/AIRFLOW-385
> Project: Apache Airflow
>  Issue Type: Improvement
>  Components: logging
>Reporter: Dan Davydov
>Assignee: Vijay Bhat
>Priority: Minor
>  Labels: beginner, logging
>
> Right now the scheduler logs for each parsed file look like 
> /tmp/airflow/scheduler/logs/2016-07-17/some-dag/...
> The problem is this makes it hard to find the latest logs which is the most 
> frequent use case.
> Ideally we would create and have the scheduler keep up to date a symlink to 
> the latest date /tmp/airflow/scheduler/logs/latest which would point to e.g. 
> /tmp/airflow/scheduler/logs/2016-07-17
> We might also want to consider changing the structure to dag/date instead of 
> date/dag too, but that can be done as a separate task if desired.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (AIRFLOW-582) ti.get_dagrun() should not filter on start_date

2016-10-19 Thread Ben Tallman (JIRA)
Ben Tallman created AIRFLOW-582:
---

 Summary: ti.get_dagrun() should not filter on start_date
 Key: AIRFLOW-582
 URL: https://issues.apache.org/jira/browse/AIRFLOW-582
 Project: Apache Airflow
  Issue Type: Bug
Reporter: Ben Tallman


The filter in ti.get_dagrun should not include start_date, it should only 
search on execution_date.

@provide_session
def get_dagrun(self, session):
"""
Returns the DagRun for this TaskInstance
:param session:
:return: DagRun
"""
dr = session.query(DagRun).filter(
DagRun.dag_id == self.dag_id,
DagRun.execution_date == self.execution_date,
DagRun.start_date == self.start_date
).first()

return dr




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)