Flávio Clésio created AIRFLOW-2620:
--------------------------------------

             Summary: New property called "RetainSameConnection"
                 Key: AIRFLOW-2620
                 URL: https://issues.apache.org/jira/browse/AIRFLOW-2620
             Project: Apache Airflow
          Issue Type: Wish
          Components: configuration, operators, redshift
    Affects Versions: Airflow 2.0
            Reporter: Flávio Clésio


Some ETL stuff relies on the -high- reusability of the connections in the 
middle of the data flow, and part of this is achieved using temporary tables.

I performed a test [1] in Airflow and the behaviour is the for each command 
that we use the "PostgresOperator" hits the database again, and if the DAG had 
a lot of connections it can be a nightmare. 

As I said in SO, for several cases where you have cases of processing some 
tables in stages the reuse a temporary table across multiple tasks can be very 
useful and plus: can remove a lot of pressure in the RDBMS cause you won't need 
to hit several times.

That's why a purpose a feature of the connection called "RetainSameConnection" 
that will keep the same connection during the execution of the DAG.
 
Reference: [1] - [My question in Stack 
Overflow|https://stackoverflow.com/questions/50858770/airflow-retain-the-same-database-connection/]
 about it



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to