[jira] [Created] (AIRFLOW-2447) Fix TestHiveMetastoreHook to run all cases
Kengo Seki created AIRFLOW-2447: --- Summary: Fix TestHiveMetastoreHook to run all cases Key: AIRFLOW-2447 URL: https://issues.apache.org/jira/browse/AIRFLOW-2447 Project: Apache Airflow Issue Type: Bug Components: hive_hooks, tests Reporter: Kengo Seki Assignee: Kengo Seki TestHiveMetastoreHook has a method called {{get_databases}} but it isn't executed since its name doesn't start with {{test_}}. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Assigned] (AIRFLOW-2429) Make Airflow Flake8 compliant
[ https://issues.apache.org/jira/browse/AIRFLOW-2429?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tao Feng reassigned AIRFLOW-2429: - Assignee: Tao Feng > Make Airflow Flake8 compliant > -- > > Key: AIRFLOW-2429 > URL: https://issues.apache.org/jira/browse/AIRFLOW-2429 > Project: Apache Airflow > Issue Type: Improvement >Reporter: Fokko Driesprong >Assignee: Tao Feng >Priority: Major > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Updated] (AIRFLOW-2446) Add S3ToRedshiftTransfer to the "Integration" doc
[ https://issues.apache.org/jira/browse/AIRFLOW-2446?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kengo Seki updated AIRFLOW-2446: Summary: Add S3ToRedshiftTransfer to the "Integration" doc (was: Add S3ToRedshiftTransfer into the "Integration" doc) > Add S3ToRedshiftTransfer to the "Integration" doc > - > > Key: AIRFLOW-2446 > URL: https://issues.apache.org/jira/browse/AIRFLOW-2446 > Project: Apache Airflow > Issue Type: Improvement > Components: aws, docs, Documentation, redshift >Reporter: Kengo Seki >Assignee: Kengo Seki >Priority: Minor > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Assigned] (AIRFLOW-2402) Airflow 1.10 Logs UI throws oops error
[ https://issues.apache.org/jira/browse/AIRFLOW-2402?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kevin Yang reassigned AIRFLOW-2402: --- Assignee: Kevin Yang (was: Ramki Subramanian) > Airflow 1.10 Logs UI throws oops error > -- > > Key: AIRFLOW-2402 > URL: https://issues.apache.org/jira/browse/AIRFLOW-2402 > Project: Apache Airflow > Issue Type: Bug > Components: authentication, ui >Affects Versions: 1.10.0 >Reporter: Ramki Subramanian >Assignee: Kevin Yang >Priority: Major > Fix For: 1.10.0 > > > Hi, > I am getting an error at > [incubator-airflow/airflow/www_rbac/views.py|https://github.com/apache/incubator-airflow/blob/4d64ad4928f0188f7532936e8da6612f5ec7170d/airflow/www_rbac/views.py#L454] > Line 454 in > [4d64ad4|https://github.com/apache/incubator-airflow/commit/4d64ad4928f0188f7532936e8da6612f5ec7170d] > | |logs[i] = log.decode('utf-8')| > > {{/home/user/lib/python2.7/site-packages/airflow/www_rbac/views.py", line > 454, in log logs[i] = log.decode('utf-8') AttributeError: 'list' object has > no attribute 'decode' }} > Not sure if someone is already looking into this, or I am missing some config? > Branch : 1.10_test > More Info here: > [https://github.com/apache/incubator-airflow/commit/05e1861e24de42f9a2c649cd93041c5c744504e1#diff-77df5adb32d964f37748c4557ffb3c4c] > > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (AIRFLOW-2446) Add S3ToRedshiftTransfer into the "Integration" doc
Kengo Seki created AIRFLOW-2446: --- Summary: Add S3ToRedshiftTransfer into the "Integration" doc Key: AIRFLOW-2446 URL: https://issues.apache.org/jira/browse/AIRFLOW-2446 Project: Apache Airflow Issue Type: Improvement Components: aws, docs, Documentation, redshift Reporter: Kengo Seki Assignee: Kengo Seki -- This message was sent by Atlassian JIRA (v7.6.3#76005)
incubator-airflow git commit: closes apache/incubator-airflow#3325 *Need to be reopened and follow contributor guidelines*
Repository: incubator-airflow Updated Branches: refs/heads/master 2a55ffe0c -> 06017f66e closes apache/incubator-airflow#3325 *Need to be reopened and follow contributor guidelines* Project: http://git-wip-us.apache.org/repos/asf/incubator-airflow/repo Commit: http://git-wip-us.apache.org/repos/asf/incubator-airflow/commit/06017f66 Tree: http://git-wip-us.apache.org/repos/asf/incubator-airflow/tree/06017f66 Diff: http://git-wip-us.apache.org/repos/asf/incubator-airflow/diff/06017f66 Branch: refs/heads/master Commit: 06017f66eed0aef2c9c441bbd0ae5869059d011e Parents: 2a55ffe Author: r39132Authored: Wed May 9 14:14:04 2018 -0700 Committer: r39132 Committed: Wed May 9 14:14:04 2018 -0700 -- --
[jira] [Created] (AIRFLOW-2445) Allow templating for kubernetes operator
Sergio B created AIRFLOW-2445: - Summary: Allow templating for kubernetes operator Key: AIRFLOW-2445 URL: https://issues.apache.org/jira/browse/AIRFLOW-2445 Project: Apache Airflow Issue Type: Bug Reporter: Sergio B Allow templating in command, arguments and environment variables -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Updated] (AIRFLOW-2445) Allow templating for kubernetes operator
[ https://issues.apache.org/jira/browse/AIRFLOW-2445?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergio B updated AIRFLOW-2445: -- Issue Type: Improvement (was: Bug) > Allow templating for kubernetes operator > > > Key: AIRFLOW-2445 > URL: https://issues.apache.org/jira/browse/AIRFLOW-2445 > Project: Apache Airflow > Issue Type: Improvement >Reporter: Sergio B >Priority: Major > > Allow templating in command, arguments and environment variables -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (AIRFLOW-2444) Remove unused option(include_adhoc) in cli backfill command
Tao Feng created AIRFLOW-2444: - Summary: Remove unused option(include_adhoc) in cli backfill command Key: AIRFLOW-2444 URL: https://issues.apache.org/jira/browse/AIRFLOW-2444 Project: Apache Airflow Issue Type: Bug Reporter: Tao Feng Assignee: Tao Feng Include_adhoc is removed in this pr([https://github.com/apache/incubator-airflow/pull/1667/files).] This option currently doesn't do anything. Just remove it. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (AIRFLOW-2253) Adding Airflow CLI instrumentation
[ https://issues.apache.org/jira/browse/AIRFLOW-2253?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16469396#comment-16469396 ] Jin Hyuk Chang commented on AIRFLOW-2253: - [~kevcampb], I could be wrong, but I believe it's a correct behavior of initdb to just create table but not adding any record in it. The error message is just for information but it does not actually make "initdb" operation fail. I've posted a PR to remove cli_logger on "initdb". Let me know if you have any issue. Thanks, Jin > Adding Airflow CLI instrumentation > -- > > Key: AIRFLOW-2253 > URL: https://issues.apache.org/jira/browse/AIRFLOW-2253 > Project: Apache Airflow > Issue Type: Improvement >Reporter: Jin Hyuk Chang >Assignee: Jin Hyuk Chang >Priority: Major > > We are trying to improve user experience on Airflow particularly on Airflow > CLI where our user does most of testing in CLI. As a first step, we'd like to > instrument user's activity on CLI. > Currently, Airflow instruments user activity from web UI via > www.utils.action_logger, but not on CLI and we are trying to instrument user > activity from Airflow CLI as well. By default, we'd like to instrument same > as what current behavior from Web UI. > Should it need different instrumentation, we will provide a registry method > to register callback so that customized instrument can be easily plugged in. > (e.g: We have our customized event topic that already wired with ingestion > pipeline. Having pluggable callback, making integration with customized > instrumentation a breeze.) > Thanks! > Jin -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Updated] (AIRFLOW-2442) Airflow run command leaves database connections open
[ https://issues.apache.org/jira/browse/AIRFLOW-2442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alejandro Fernandez updated AIRFLOW-2442: - Summary: Airflow run command leaves database connections open (was: Airflow run command leaves database connections open,) > Airflow run command leaves database connections open > > > Key: AIRFLOW-2442 > URL: https://issues.apache.org/jira/browse/AIRFLOW-2442 > Project: Apache Airflow > Issue Type: Bug > Components: cli >Affects Versions: Airflow 1.8, 1.8.0 >Reporter: Alejandro Fernandez >Assignee: Alejandro Fernandez >Priority: Major > Fix For: Airflow 2.0 > > Attachments: connection_duration_1_hour.png, db_connections.png, > fixed_before_and_after.jpg, monthly_db_connections.png, running_tasks.png > > > *Summary* > The "airflow run" command creates a connection to the database and leaves it > open (until killed by SQLALchemy later). The number of these connections can > skyrocket whenever hundreds/thousands of tasks are launched simultaneously, > and potentially hit the database connection limit. > The problem is that in cli.py, the run() method first calls > {code:java} > settings.configure_orm(disable_connection_pool=True){code} > correctly > to use a NullPool, but then parses any custom configs and again calls > {code:java} > settings.configure_orm(){code} > , thereby overriding the desired behavior by instead using a QueuePool. > The QueuePool uses the default configs for SQL_ALCHEMY_POOL_SIZE and > SQL_ALCHEMY_POOL_RECYCLE. This means that while the task is running and the > executor is sending heartbeats, the sleeping connection is idle until it is > killed by SQLAlchemy. > This fixes a bug introduced by > [https://github.com/apache/incubator-airflow/pull/1934] in > [https://github.com/apache/incubator-airflow/pull/1934/commits/b380013634b02bb4c1b9d1cc587ccd12383820b6#diff-1c2404a3a60f829127232842250ff406R344] > > which is present in branches 1-8-stable, 1-9-stable, and 1-10-test > NOTE: Will create a PR once I've done more testing since I'm on an older > branch. For now, attaching a patch file [^AIRFLOW-2442.patch] -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Updated] (AIRFLOW-2442) Airflow run command leaves database connections open,
[ https://issues.apache.org/jira/browse/AIRFLOW-2442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alejandro Fernandez updated AIRFLOW-2442: - Summary: Airflow run command leaves database connections open, (was: Airflow run command leaves database connections open, which can hit the database limit) > Airflow run command leaves database connections open, > - > > Key: AIRFLOW-2442 > URL: https://issues.apache.org/jira/browse/AIRFLOW-2442 > Project: Apache Airflow > Issue Type: Bug > Components: cli >Affects Versions: Airflow 1.8, 1.8.0 >Reporter: Alejandro Fernandez >Assignee: Alejandro Fernandez >Priority: Major > Fix For: Airflow 2.0 > > Attachments: connection_duration_1_hour.png, db_connections.png, > fixed_before_and_after.jpg, monthly_db_connections.png, running_tasks.png > > > *Summary* > The "airflow run" command creates a connection to the database and leaves it > open (until killed by SQLALchemy later). The number of these connections can > skyrocket whenever hundreds/thousands of tasks are launched simultaneously, > and potentially hit the database connection limit. > The problem is that in cli.py, the run() method first calls > {code:java} > settings.configure_orm(disable_connection_pool=True){code} > correctly > to use a NullPool, but then parses any custom configs and again calls > {code:java} > settings.configure_orm(){code} > , thereby overriding the desired behavior by instead using a QueuePool. > The QueuePool uses the default configs for SQL_ALCHEMY_POOL_SIZE and > SQL_ALCHEMY_POOL_RECYCLE. This means that while the task is running and the > executor is sending heartbeats, the sleeping connection is idle until it is > killed by SQLAlchemy. > This fixes a bug introduced by > [https://github.com/apache/incubator-airflow/pull/1934] in > [https://github.com/apache/incubator-airflow/pull/1934/commits/b380013634b02bb4c1b9d1cc587ccd12383820b6#diff-1c2404a3a60f829127232842250ff406R344] > > which is present in branches 1-8-stable, 1-9-stable, and 1-10-test > NOTE: Will create a PR once I've done more testing since I'm on an older > branch. For now, attaching a patch file [^AIRFLOW-2442.patch] -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Updated] (AIRFLOW-2442) Airflow run command leaves database connections open, which can hit the database limit
[ https://issues.apache.org/jira/browse/AIRFLOW-2442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alejandro Fernandez updated AIRFLOW-2442: - Attachment: (was: AIRFLOW-2442.patch) > Airflow run command leaves database connections open, which can hit the > database limit > -- > > Key: AIRFLOW-2442 > URL: https://issues.apache.org/jira/browse/AIRFLOW-2442 > Project: Apache Airflow > Issue Type: Bug > Components: cli >Affects Versions: Airflow 1.8, 1.8.0 >Reporter: Alejandro Fernandez >Assignee: Alejandro Fernandez >Priority: Major > Fix For: Airflow 2.0 > > Attachments: connection_duration_1_hour.png, db_connections.png, > fixed_before_and_after.jpg, monthly_db_connections.png, running_tasks.png > > > *Summary* > The "airflow run" command creates a connection to the database and leaves it > open (until killed by SQLALchemy later). The number of these connections can > skyrocket whenever hundreds/thousands of tasks are launched simultaneously, > and potentially hit the database connection limit. > The problem is that in cli.py, the run() method first calls > {code:java} > settings.configure_orm(disable_connection_pool=True){code} > correctly > to use a NullPool, but then parses any custom configs and again calls > {code:java} > settings.configure_orm(){code} > , thereby overriding the desired behavior by instead using a QueuePool. > The QueuePool uses the default configs for SQL_ALCHEMY_POOL_SIZE and > SQL_ALCHEMY_POOL_RECYCLE. This means that while the task is running and the > executor is sending heartbeats, the sleeping connection is idle until it is > killed by SQLAlchemy. > This fixes a bug introduced by > [https://github.com/apache/incubator-airflow/pull/1934] in > [https://github.com/apache/incubator-airflow/pull/1934/commits/b380013634b02bb4c1b9d1cc587ccd12383820b6#diff-1c2404a3a60f829127232842250ff406R344] > > which is present in branches 1-8-stable, 1-9-stable, and 1-10-test > NOTE: Will create a PR once I've done more testing since I'm on an older > branch. For now, attaching a patch file [^AIRFLOW-2442.patch] -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Resolved] (AIRFLOW-2393) UI tree view struggles with large dags (60 tasks)x25 dag histories
[ https://issues.apache.org/jira/browse/AIRFLOW-2393?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Arthur Wiedmer resolved AIRFLOW-2393. - Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request #3279 [https://github.com/apache/incubator-airflow/pull/3279] > UI tree view struggles with large dags (60 tasks)x25 dag histories > -- > > Key: AIRFLOW-2393 > URL: https://issues.apache.org/jira/browse/AIRFLOW-2393 > Project: Apache Airflow > Issue Type: Improvement > Components: ui >Affects Versions: 1.9.0 >Reporter: Badger >Priority: Major > Fix For: 2.0.0 > > > Hi, > We are noticing the tree view is taking a long time to render as our DAG has > become more complex. We will need to start breaking our dag apart in order to > continue to use the user interface. > The basic problem is that a reasonably complex DAG (60 operators) x the > standard 25 dag run histories on the tree view causes a 350MB json response > (compressed to 8MB) to be downloaded, this then needs the browser to render > it. > On quick observation this appears to be because, the response appears to > contain all meta-data for each task. > Is this something others think is a problem. We occasionally have to refresh > due to memory errors and have already increased the RAM allocated to the box. > A suggestion might be to load specific instance history when a user hovers > over the task, rather than exporting all of the history on page load. I'd > look at contributing a PR but haven't had chance to take a look at this area > of the code base. > Thanks -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (AIRFLOW-2393) UI tree view struggles with large dags (60 tasks)x25 dag histories
[ https://issues.apache.org/jira/browse/AIRFLOW-2393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16468984#comment-16468984 ] ASF subversion and git services commented on AIRFLOW-2393: -- Commit 2a55ffe0cd2c04fde2eecdbeff2fa727067069e1 in incubator-airflow's branch refs/heads/master from Tao feng [ https://git-wip-us.apache.org/repos/asf?p=incubator-airflow.git;h=2a55ffe ] [AIRFLOW-2086][AIRFLOW-2393] Customize default dagrun number in tree view Closes #3279 from feng-tao/reduce-tree-view This introduces a new configuration variable to set the default number of dag runs displayed in the tree view. For large DAGs, this could cause timeouts in the webserver. > UI tree view struggles with large dags (60 tasks)x25 dag histories > -- > > Key: AIRFLOW-2393 > URL: https://issues.apache.org/jira/browse/AIRFLOW-2393 > Project: Apache Airflow > Issue Type: Improvement > Components: ui >Affects Versions: 1.9.0 >Reporter: Badger >Priority: Major > > Hi, > We are noticing the tree view is taking a long time to render as our DAG has > become more complex. We will need to start breaking our dag apart in order to > continue to use the user interface. > The basic problem is that a reasonably complex DAG (60 operators) x the > standard 25 dag run histories on the tree view causes a 350MB json response > (compressed to 8MB) to be downloaded, this then needs the browser to render > it. > On quick observation this appears to be because, the response appears to > contain all meta-data for each task. > Is this something others think is a problem. We occasionally have to refresh > due to memory errors and have already increased the RAM allocated to the box. > A suggestion might be to load specific instance history when a user hovers > over the task, rather than exporting all of the history on page load. I'd > look at contributing a PR but haven't had chance to take a look at this area > of the code base. > Thanks -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Resolved] (AIRFLOW-2086) The tree view page is too slow when display big dag.
[ https://issues.apache.org/jira/browse/AIRFLOW-2086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Arthur Wiedmer resolved AIRFLOW-2086. - Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request #3279 [https://github.com/apache/incubator-airflow/pull/3279] > The tree view page is too slow when display big dag. > > > Key: AIRFLOW-2086 > URL: https://issues.apache.org/jira/browse/AIRFLOW-2086 > Project: Apache Airflow > Issue Type: Improvement > Components: webserver >Reporter: Lintao LI >Priority: Major > Fix For: 2.0.0 > > > The tree view page is too slow for big(actually not too big) dag. > The page size will increase dramatically to hundreds of MB. > please refer to > [here|https://stackoverflow.com/questions/48656221/apache-airflow-webui-tree-view-is-too-slow] > for details. > I think the page contains a lot of redundant data. it's a bug or a flaw of > design. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (AIRFLOW-2086) The tree view page is too slow when display big dag.
[ https://issues.apache.org/jira/browse/AIRFLOW-2086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16468983#comment-16468983 ] ASF subversion and git services commented on AIRFLOW-2086: -- Commit 2a55ffe0cd2c04fde2eecdbeff2fa727067069e1 in incubator-airflow's branch refs/heads/master from Tao feng [ https://git-wip-us.apache.org/repos/asf?p=incubator-airflow.git;h=2a55ffe ] [AIRFLOW-2086][AIRFLOW-2393] Customize default dagrun number in tree view Closes #3279 from feng-tao/reduce-tree-view This introduces a new configuration variable to set the default number of dag runs displayed in the tree view. For large DAGs, this could cause timeouts in the webserver. > The tree view page is too slow when display big dag. > > > Key: AIRFLOW-2086 > URL: https://issues.apache.org/jira/browse/AIRFLOW-2086 > Project: Apache Airflow > Issue Type: Improvement > Components: webserver >Reporter: Lintao LI >Priority: Major > Fix For: 2.0.0 > > > The tree view page is too slow for big(actually not too big) dag. > The page size will increase dramatically to hundreds of MB. > please refer to > [here|https://stackoverflow.com/questions/48656221/apache-airflow-webui-tree-view-is-too-slow] > for details. > I think the page contains a lot of redundant data. it's a bug or a flaw of > design. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
incubator-airflow git commit: [AIRFLOW-2086][AIRFLOW-2393] Customize default dagrun number in tree view
Repository: incubator-airflow Updated Branches: refs/heads/master 2728138f1 -> 2a55ffe0c [AIRFLOW-2086][AIRFLOW-2393] Customize default dagrun number in tree view Closes #3279 from feng-tao/reduce-tree-view This introduces a new configuration variable to set the default number of dag runs displayed in the tree view. For large DAGs, this could cause timeouts in the webserver. Project: http://git-wip-us.apache.org/repos/asf/incubator-airflow/repo Commit: http://git-wip-us.apache.org/repos/asf/incubator-airflow/commit/2a55ffe0 Tree: http://git-wip-us.apache.org/repos/asf/incubator-airflow/tree/2a55ffe0 Diff: http://git-wip-us.apache.org/repos/asf/incubator-airflow/diff/2a55ffe0 Branch: refs/heads/master Commit: 2a55ffe0cd2c04fde2eecdbeff2fa727067069e1 Parents: 2728138 Author: Tao fengAuthored: Wed May 9 08:45:06 2018 -0700 Committer: Arthur Wiedmer Committed: Wed May 9 08:45:17 2018 -0700 -- UPDATING.md | 3 +++ airflow/config_templates/default_airflow.cfg | 3 +++ airflow/www/templates/airflow/dag.html | 2 +- airflow/www/templates/airflow/dags.html | 2 +- airflow/www/templates/airflow/list_dags.html | 2 +- airflow/www/views.py | 19 --- airflow/www_rbac/templates/airflow/dag.html | 2 +- airflow/www_rbac/templates/airflow/dags.html | 2 +- airflow/www_rbac/views.py| 14 +- 9 files changed, 32 insertions(+), 17 deletions(-) -- http://git-wip-us.apache.org/repos/asf/incubator-airflow/blob/2a55ffe0/UPDATING.md -- diff --git a/UPDATING.md b/UPDATING.md index defd95b..c9e1395 100644 --- a/UPDATING.md +++ b/UPDATING.md @@ -5,6 +5,9 @@ assists users migrating to a new version. ## Airflow Master +### Add a configuration variable(default_dag_run_display_number) to control numbers of dag run for display +Add a configuration variable(default_dag_run_display_number) under webserver section to control num of dag run to show in UI. + ### Default executor for SubDagOperator is changed to SequentialExecutor ### New Webserver UI with Role-Based Access Control http://git-wip-us.apache.org/repos/asf/incubator-airflow/blob/2a55ffe0/airflow/config_templates/default_airflow.cfg -- diff --git a/airflow/config_templates/default_airflow.cfg b/airflow/config_templates/default_airflow.cfg index b91961e..33b99ff 100644 --- a/airflow/config_templates/default_airflow.cfg +++ b/airflow/config_templates/default_airflow.cfg @@ -280,6 +280,9 @@ rbac = False # Define the color of navigation bar navbar_color = #007A87 +# Default dagrun to show in UI +default_dag_run_display_number = 25 + [email] email_backend = airflow.utils.email.send_email_smtp http://git-wip-us.apache.org/repos/asf/incubator-airflow/blob/2a55ffe0/airflow/www/templates/airflow/dag.html -- diff --git a/airflow/www/templates/airflow/dag.html b/airflow/www/templates/airflow/dag.html index ed84f27..18242d3 100644 --- a/airflow/www/templates/airflow/dag.html +++ b/airflow/www/templates/airflow/dag.html @@ -57,7 +57,7 @@ Graph View - + Tree View http://git-wip-us.apache.org/repos/asf/incubator-airflow/blob/2a55ffe0/airflow/www/templates/airflow/dags.html -- diff --git a/airflow/www/templates/airflow/dags.html b/airflow/www/templates/airflow/dags.html index d22bfb3..2397890 100644 --- a/airflow/www/templates/airflow/dags.html +++ b/airflow/www/templates/airflow/dags.html @@ -145,7 +145,7 @@ - + http://git-wip-us.apache.org/repos/asf/incubator-airflow/blob/2a55ffe0/airflow/www/templates/airflow/list_dags.html -- diff --git a/airflow/www/templates/airflow/list_dags.html b/airflow/www/templates/airflow/list_dags.html index e8533d7..c7f2497 100644 --- a/airflow/www/templates/airflow/list_dags.html +++ b/airflow/www/templates/airflow/list_dags.html @@ -147,7 +147,7 @@ - + http://git-wip-us.apache.org/repos/asf/incubator-airflow/blob/2a55ffe0/airflow/www/views.py -- diff --git a/airflow/www/views.py b/airflow/www/views.py index 6e2f1fc..c36d55f 100644 --- a/airflow/www/views.py +++ b/airflow/www/views.py @@ -7,9 +7,9 @@ # to you under the Apache
[jira] [Created] (AIRFLOW-2443) Fix doc links to pythonhosted
Anand created AIRFLOW-2443: -- Summary: Fix doc links to pythonhosted Key: AIRFLOW-2443 URL: https://issues.apache.org/jira/browse/AIRFLOW-2443 Project: Apache Airflow Issue Type: Bug Components: ui Affects Versions: 1.9.0 Reporter: Anand Fix For: Airflow 1.9.0 Airflow 1.9 UI still redirects to incorrect documentation link on pythonhosted.org, though seems to have been fixed in master in this bug - https://issues.apache.org/jira/browse/AIRFLOW-2115 . -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Updated] (AIRFLOW-2250) execution_date logic seems broken in 1.9
[ https://issues.apache.org/jira/browse/AIRFLOW-2250?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sulphide updated AIRFLOW-2250: -- Description: after upgrading to 1.9 new dags get execution_date's that are not aligned with our hardcoded start_date. among other things, this breaks our sensors... [2018-03-23 20:06:21,781] \{base_task_runner.py:98} INFO - Subtask: [2018-03-23 20:06:21,780] \{sensors.py:243} INFO - Poking for upstream_2017-12-06_2000.upstream_task on 2018-03-22T19:51:52.050448 in this case our ExternalTaskSensor has a start_date with a start_time of 2100 and is pointing at the upstream dag and task with an execution_delta of 1 hour. THE PROBLEM: the dagrun gets an execution_date of 2018-03-022T19:51:52.050448 but should be getting an execution_date of 2018-03-22T21:00:00 it seems that if we manually set the execution date on the first dag run then every subsequent dag run has the right execution date (so far), so this is a problem with generating an execution_date for a dag run with no history within the schedule interval. was: after upgrading to 1.9 some of our dags get execution_date's that are not aligned with our hardcoded start_date. this breaks our sensors, e.g. [2018-03-23 20:06:21,781] \{base_task_runner.py:98} INFO - Subtask: [2018-03-23 20:06:21,780] \{sensors.py:243} INFO - Poking for upstream_2017-12-06_2000.upstream_task on 2018-03-22T19:51:52.050448 in this case our ExternalTaskSensor has a start_date with a start_time of 2100 and is pointing at the upstream dag and task with an execution_delta of 1 hour. the problem: the dagrun gets an execution_date of 2018-03-022T19:51:52.050448 but should be getting an execution_date of 2018-03-22T21:00:00 it seems that if we manually set the execution date on the first dag run then every subsequent dag run has the right execution date (so far). > execution_date logic seems broken in 1.9 > > > Key: AIRFLOW-2250 > URL: https://issues.apache.org/jira/browse/AIRFLOW-2250 > Project: Apache Airflow > Issue Type: Bug >Reporter: sulphide >Priority: Major > Attachments: Screen Shot 2018-03-23 at 4.47.03 PM.png > > > after upgrading to 1.9 new dags get execution_date's that are not aligned > with our hardcoded start_date. > among other things, this breaks our sensors... > [2018-03-23 20:06:21,781] \{base_task_runner.py:98} INFO - Subtask: > [2018-03-23 20:06:21,780] \{sensors.py:243} INFO - Poking for > upstream_2017-12-06_2000.upstream_task on 2018-03-22T19:51:52.050448 > > in this case our ExternalTaskSensor has a start_date with a start_time of > 2100 and is pointing at the upstream dag and task with an execution_delta of > 1 hour. > THE PROBLEM: the dagrun gets an execution_date of 2018-03-022T19:51:52.050448 > but should be getting an execution_date of 2018-03-22T21:00:00 > it seems that if we manually set the execution date on the first dag run then > every subsequent dag run has the right execution date (so far), so this is a > problem with generating an execution_date for a dag run with no history > within the schedule interval. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Updated] (AIRFLOW-2250) execution_date logic seems broken in 1.9
[ https://issues.apache.org/jira/browse/AIRFLOW-2250?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sulphide updated AIRFLOW-2250: -- Description: after upgrading to 1.9 some of our dags get execution_date's that are not aligned with our hardcoded start_date. this breaks our sensors, e.g. [2018-03-23 20:06:21,781] \{base_task_runner.py:98} INFO - Subtask: [2018-03-23 20:06:21,780] \{sensors.py:243} INFO - Poking for upstream_2017-12-06_2000.upstream_task on 2018-03-22T19:51:52.050448 in this case our ExternalTaskSensor has a start_date with a start_time of 2100 and is pointing at the upstream dag and task with an execution_delta of 1 hour. the problem: the dagrun gets an execution_date of 2018-03-022T19:51:52.050448 but should be getting an execution_date of 2018-03-22T21:00:00 it seems that if we manually set the execution date on the first dag run then every subsequent dag run has the right execution date (so far). was: after upgrading to 1.9 some of our dags get execution_date's that are not aligned with our hardcoded start_date. this breaks our sensors, e.g. [2018-03-23 20:06:21,781] \{base_task_runner.py:98} INFO - Subtask: [2018-03-23 20:06:21,780] \{sensors.py:243} INFO - Poking for upstream_2017-12-06_2000.upstream_task on 2018-03-22T19:51:52.050448 in this case our ExternalTaskSensor has a start_date with a start_time of 2100 and is pointing at the upstream dag and task with an execution_delta of 1 hour. the problem: the task_instance gets an execution_date of 2018-03-022T19:51:52.050448 but should be getting an execution_date of 2018-03-22T21:00:00 it seems that if we manually set the execution date on the first dag run then every subsequent dag run has the right execution date (so far). > execution_date logic seems broken in 1.9 > > > Key: AIRFLOW-2250 > URL: https://issues.apache.org/jira/browse/AIRFLOW-2250 > Project: Apache Airflow > Issue Type: Bug >Reporter: sulphide >Priority: Major > Attachments: Screen Shot 2018-03-23 at 4.47.03 PM.png > > > after upgrading to 1.9 some of our dags get execution_date's that are not > aligned with our hardcoded start_date. this breaks our sensors, e.g. > > [2018-03-23 20:06:21,781] \{base_task_runner.py:98} INFO - Subtask: > [2018-03-23 20:06:21,780] \{sensors.py:243} INFO - Poking for > upstream_2017-12-06_2000.upstream_task on 2018-03-22T19:51:52.050448 > > in this case our ExternalTaskSensor has a start_date with a start_time of > 2100 and is pointing at the upstream dag and task with an execution_delta of > 1 hour. > the problem: the dagrun gets an execution_date of 2018-03-022T19:51:52.050448 > but should be getting an execution_date of 2018-03-22T21:00:00 > it seems that if we manually set the execution date on the first dag run then > every subsequent dag run has the right execution date (so far). -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (AIRFLOW-2250) execution_date logic seems broken in 1.9
[ https://issues.apache.org/jira/browse/AIRFLOW-2250?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16468913#comment-16468913 ] sulphide commented on AIRFLOW-2250: --- we've noticed that even if we 'fix' a dag with the above workaround if we have a dag with catchup=False and more than a day has passed since the dag was turned on then the dag will create the broken execution dates when we turn it on. so this bug is related to the dag's ability to create its first execution date. > execution_date logic seems broken in 1.9 > > > Key: AIRFLOW-2250 > URL: https://issues.apache.org/jira/browse/AIRFLOW-2250 > Project: Apache Airflow > Issue Type: Bug >Reporter: sulphide >Priority: Major > Attachments: Screen Shot 2018-03-23 at 4.47.03 PM.png > > > after upgrading to 1.9 some of our dags get execution_date's that are not > aligned with our hardcoded start_date. this breaks our sensors, e.g. > > [2018-03-23 20:06:21,781] \{base_task_runner.py:98} INFO - Subtask: > [2018-03-23 20:06:21,780] \{sensors.py:243} INFO - Poking for > upstream_2017-12-06_2000.upstream_task on 2018-03-22T19:51:52.050448 > > in this case our ExternalTaskSensor has a start_date with a start_time of > 2100 and is pointing at the upstream dag and task with an execution_delta of > 1 hour. > the problem: the task_instance gets an execution_date of > 2018-03-022T19:51:52.050448 but should be getting an execution_date of > 2018-03-22T21:00:00 > it seems that if we manually set the execution date on the first dag run then > every subsequent dag run has the right execution date (so far). -- This message was sent by Atlassian JIRA (v7.6.3#76005)