mobuchowski opened a new pull request, #40078:
URL: https://github.com/apache/airflow/pull/40078

   This PR builds on https://github.com/apache/airflow/pull/39890
   
   After this change, OpenLineage will execute metadata extraction in separate, 
forked process. 
   It's a technique modeled to what interaction between `LocalTaskJobRunner` 
and `StandardTaskRunner` looks like - a process, in this case process of 
`StandardTaskRunner` watches over OpenLineage listener process during metadata 
extraction. 
   
   This adds a layer of isolation between task execution and OpenLineage, 
adding a level of assurance that OpenLineage execution does not interfere with 
task execution in a way other than taking time.
   Additionally, this allows us to add configurable timeout for OL execute 
methods.
   
   The reason for that is, beyond configurability, that sometimes metadata 
extraction code can hang - for example, when dealing with Snowflake connection 
issue https://github.com/snowflakedb/snowflake-connector-python/pull/1898 - and 
we want to give as much guarantees that OL will not cause task to fail.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@airflow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to