t oo created AIRFLOW-6335:
-----------------------------

             Summary: dag_processor_manager timeout logs should be ERROR not 
INFO/WARN
                 Key: AIRFLOW-6335
                 URL: https://issues.apache.org/jira/browse/AIRFLOW-6335
             Project: Apache Airflow
          Issue Type: Improvement
          Components: DAG, scheduler
    Affects Versions: 1.10.6
            Reporter: t oo


I triggered a large DAG with 18k tasks, after 30mins still nothing started 
going to scheduled/queued/running state.

All airflow processes (scheduler, worker.etc) were up. I then went searching 
through the logs for ERROR, nothing came up! After a lot of digging i found 
below in dag_processor_manager.log:
||File Path||PID||Runtime||# DAGs||# Errors||Last Runtime||Last Run||
|/home/ec2-user/airflow/dags/redact1.py|31366|261.22s|0|-1|361.32s|2019-12-24T17:10:44|
|/home/ec2-user/airflow/dags/redact2.py| | |1|0|1.00s|2019-12-24T17:16:29|

[2019-12-24 17:40:48,739] \{dag_processing.py:1324} INFO - Processor for 
/home/ec2-user/airflow/dags/redact1.py with PID 17307 started at 
2019-12-24T17:34:47.417660+00:00 has timed out, killing it.
[2019-12-24 17:40:49,696] \{dag_processing.py:1324} INFO - Processor for 
/home/ec2-user/airflow/dags/redact1.py with PID 17307 started at 
2019-12-24T17:34:47.417660+00:00 has timed out, killing it.
[2019-12-24 17:40:49,697] \{dag_processing.py:1191} WARNING - Processor for 
/home/ec2-user/airflow/dags/redact1.py exited with return code -9.

 

Solution:

Change from INFO to ERROR:

[https://github.com/apache/airflow/blob/1.10.6/airflow/utils/dag_processing.py#L1321]

Change from WARN to ERROR:

[https://github.com/apache/airflow/blob/1.10.6/airflow/utils/dag_processing.py#L1189]

 

 

 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to