We run ~50k tasks a day at Airbnb. How many tasks/day are you planning on running?
Though you can archive the `task_instance` and `job` table down the line, but that shouldn't be a concern until you hit tens of millions of entries. Then you can setup a daily Airflow job that archives some of these entries. I believe we do it based on `start_date` and move rows to some other table in the same db. Max On Mon, Apr 3, 2017 at 5:30 PM, Edgardo Vega <edgardo.v...@gmail.com> wrote: > I have been playing with airflow for a few days and it's not obvious what > will happen down the road when we have lots of dags over a long period of > time. I set a fake dag to run once a minute for a few days and everything > seems okay except the graph view dropdown which works but take a few > seconds to show up. > > Is there a way roll older data out of the system in order to clean things > visually and keep the database at a smallish size? > > -- > Cheers, > > Edgardo >