Gary Harpaz created AIRFLOW-5818: ------------------------------------ Summary: Very bad webserver performance when defining many dags with many operators Key: AIRFLOW-5818 URL: https://issues.apache.org/jira/browse/AIRFLOW-5818 Project: Apache Airflow Issue Type: Bug Components: webserver Affects Versions: 1.10.5, 1.10.4, 1.10.3, 1.10.2, 1.10.1, 1.10.0, 1.9.0 Reporter: Gary Harpaz Attachments: dup_dags.py, my_dag.template
In my scenario I have defined 500 dags, each dag has approximately 1500 operators. This makes webserver impossible to work with even when all dags are paused and nothing is running. The cpu spikes all the time and webserver consumes huge amounts of memory for no reason. To reproduce this use the attched my_dag.template file and duplicate it using the attached dup_dags.py script. The root cause of this issue is that dagbag will load all dags into memory which takes huge cpu and memory unnecessarily. I have already fixed this in: [https://github.com/gary-harpaz/airflow/tree/improve-performance] -- This message was sent by Atlassian Jira (v8.3.4#803005)