Gary Harpaz created AIRFLOW-5818:
------------------------------------

             Summary: Very bad webserver performance when defining many dags 
with many operators
                 Key: AIRFLOW-5818
                 URL: https://issues.apache.org/jira/browse/AIRFLOW-5818
             Project: Apache Airflow
          Issue Type: Bug
          Components: webserver
    Affects Versions: 1.10.5, 1.10.4, 1.10.3, 1.10.2, 1.10.1, 1.10.0, 1.9.0
            Reporter: Gary Harpaz
         Attachments: dup_dags.py, my_dag.template

In my scenario I have defined 500 dags, each dag has approximately 1500 
operators.

This makes webserver impossible to work with even when all dags are paused and 
nothing is running. The cpu spikes all the time and webserver consumes huge 
amounts of  memory for no reason.

To reproduce this use the attched my_dag.template file and duplicate it using 
the attached dup_dags.py script.

 

The root cause of this issue is that dagbag will load all dags into memory 
which takes huge cpu and memory unnecessarily. 

I have already fixed this in:

[https://github.com/gary-harpaz/airflow/tree/improve-performance]

 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to