Yes to both the below questions @Elad Kalif <elad...@apache.org>. The
upcoming Data-Awareness AIPs the first one and the 2nd should be covered by
AIP-66 once it is out of draft.

1. Should datasets be also versioned?
> 2. Should we support executing more than 1 DAG version at a given time?


On Tue, 28 May 2024 at 10:07, Elad Kalif <elad...@apache.org> wrote:

> I have a general question about (maybe somehow related to the DAG Bundle
> concept introduced in the AIPs)
> The way I see it DAGs are tightly coupled with Datasets. Tasks take
> dependency on dataset or/and produce a dataset.
> We are focused on the versions of the code (DAG) but to make this play
> nicely we should consider also applying versions to datasets.
> Granted not every change to DAG code means change in dataset version but we
> should consider if we want to leave datasets versionless.
>
> I previously worked with some data products that allow versioning of tables
> and it was really nice! It enabled the concept of Data Contract (treating
> tables much like you treat API) and it made things much easier.
> I sometimes even had two versions of the same workflow running one for the
> new version and one for the deprecated version thus allowing my customers
> the flexibility to migrate between the table versions before the deprecated
> version is discontinued.
>
> I am raising two main questions here:
> 1. Should datasets be also versioned?
> 2. Should we support executing more than 1 DAG version at a given time?
> (allow user to declare Draft/Production/Deprecated/Deleted) state for each
> version.
>
> On Wed, Mar 6, 2024 at 1:58 AM Jed Cunningham <jedcunning...@apache.org>
> wrote:
>
> > Hello everyone!
> >
> > I'm excited to start a discussion around DAG Versioning in Airflow. It's
> > been the most requested feature in the last 3 community surveys!
> >
> > AIP-63: DAG Versioning
> > <
> >
> https://cwiki.apache.org/confluence/display/AIRFLOW/AIP-63%3A+DAG+Versioning
> > >
> >
> > As this topic quickly becomes rather large, I've made AIP-63 an umbrella
> > AIP and split the specifics into separate AIPs:
> >
> > AIP-64: Keep TaskInstance try history
> > <
> >
> https://cwiki.apache.org/confluence/display/AIRFLOW/AIP-64%3A+Keep+TaskInstance+try+history
> > >
> > AIP-65: Improve DAG history in UI
> > <
> >
> https://cwiki.apache.org/confluence/display/AIRFLOW/AIP-65%3A+Improve+DAG+history+in+UI
> > >
> > [WIP] AIP-66: Execution of specific DAG code versions
> > <
> >
> https://cwiki.apache.org/confluence/display/AIRFLOW/%5BWIP%5D+AIP-66%3A+Execution+of+specific+DAG+versions
> > >
> >
> > AIP-64 and AIP-65 are ready to be discussed in depth, while AIP-66 is
> there
> > to provide an intentionally high level vision of what we may want to
> tackle
> > before Airflow's "DAG versioning" story is complete.
> >
> > Thanks,
> > Jed
> >
>

Reply via email to