> Non DAG, Non module assets as part of the DAG folder are out of scope. So say for example for some reason you include a GIF. This will not automatically be available without changes to your code.
What about SQL files a task uses, either as a template or via something else such as dbt? How about YAML based dag generators? (This might be mentioned in the wiki page, but it's not loading for me right now) -ash On 26 May 2024 08:55:11 BST, Bolke de Bruin <bdbr...@gmail.com> wrote: >Hi All, > >I would like to discuss a new AIP aimed at enhancing the DAG loading >mechanism to support reading DAGs from ephemeral storage solutions. This >proposal is intended to supersede AIP-5 Remote DAG Fetcher and provide a >more flexible and scalable approach and to prepare for AIP-63. > >https://cwiki.apache.org/confluence/display/AIRFLOW/AIP-71+Generalizing+DAG+Loader+and+Processor+for+Ephemeral+Storage > >*Abstract* >This proposal aims to generalize the DAG loader and processor to use >pathlib.Path for file operations instead of assuming direct OS filesystem >access. It includes implementing a custom module loader that supports >loading from ObjectStoragePath locations and other Path-like abstractions, >with caching capabilities provided by fsspec. Furthermore, while this AIP >does not directly implement DAG versioning, it creates a foundational layer >that can be extended to support DAG versioning as outlined in AIP-63. > >A work in progress PR can be found here: >https://github.com/apache/airflow/pull/39647 > >*Key points for discussion* > >Previous proposals, like AIP-5, suggested using a Fetcher mechanism. Kind >of like an in-process git-sync. This proposal is about making that >redundant by fully supporting object storage locations by leveraging >ObjectStoragePath and fsspec caching mechanisms. > >Earlier feedback on AIP-5 was that we thought that having an additional >Fetcher process was out of scope of the project. With the transient >integration of pathlib.Path and ObjectStoragePath I think this argument >does not hold anymore and the demand is there. In addition the added >flexibility allows AIP-63 to be implemented easier (what that looks like >remains to be seen). > >Airflow scans DAGs often. This very likely requires a caching mechanism for >both the DAGs and their modules. Fsspec does implement caching, and it is >planned to leverage this. > >Non DAG, Non module assets as part of the DAG folder are out of scope. So >say for example for some reason you include a GIF. This will not >automatically be available without changes to your code. > >I kindly request your thoughts :-). > >Bolke > >-- > >-- >Bolke de Bruin >bdbr...@gmail.com