Good summary. Thanks, Kaxil! On Wed, Sep 16, 2020 at 5:12 PM Kaxil Naik <kaxiln...@apache.org> wrote: > > Hi all, > > I have created a document to summarize the discussion from our third dev > call for Airflow 2.0. > > Thank you all who joined the call. > > *Doc Link*: > https://cwiki.apache.org/confluence/display/AIRFLOW/Meeting+Notes#MeetingNotes-#4:14Sep2020 > > To all those who attended, can you please double-check and add if I have > missed anything? > > To all those who didn't join, if you disagree to anything in > the Summary please voice your opinion. > > Also please let me know if someone wants to include an item in Next call's > Agenda. > > Including the Summary here too (might potentially break formatting): > > *Key Decisions* > > - *Updates* > - Airflow v2-0-test branch > <https://github.com/apache/airflow/commits/v2-0-test> has already > been cut and currently manually rebased on top of the Master. > Currently, we > don't run CI as the branch is in-sync with Master. As soon as we > have a PR > / commit that we don't want to have it in 2.0 we will diverge v2-0-test > branch from Master and start running tests against it. > - The upgrade-check PR <https://github.com/apache/airflow/pull/9467> was > merged, we now need to define more rules to add more checks. > - *API* > - Progress: > - Project Board: https://github.com/apache/airflow/projects/1 > - The issues labelled with "Enhancement" are not a requirement > for 2.0 > - Endpoints: > - Task Instance Endpoint > <https://github.com/apache/airflow/pull/9597> is WIP, all the > other endpoints have been implemented. > - Permissions Model: > - On-going discussion on the PR > <https://github.com/apache/airflow/pull/10594> but close to > completion. > - The next piece of work to be done is migrating existing Views > to use resource-based permissions. (Github issue > <https://github.com/apache/airflow/issues/10469>). This is > mainly for standardizing the permissions model across API and UI. > - *Improvements to SubDags / Concept of TaskGroup* > - AIP-34 > > <https://cwiki.apache.org/confluence/display/AIRFLOW/AIP-34+TaskGroup%3A+A+UI+task+grouping+concept+as+an+alternative+to+SubDagOperator> > | PR <https://github.com/apache/airflow/pull/10153> introduced the > concepts of TaskGroup and will be *included in Airflow 2.0*. > - The PR implements TaskGroups for Graph View, the Tree View will > be implemented in follow-up PRs. > - Follow-up items from the discussion: > - Discuss on mailing list whether we should deprecate SubDags in > favour of TaskGroup in 2.0 or wait until Airflow 2.1 or 2.2 > - Add docs around when to use TaskGroup vs SubDag and potentially > listing PROs and CONS. > - *Scheduler HA *(AIP-15 > > <https://cwiki.apache.org/confluence/pages/viewpage.action?pageId=103092651> > ) > - A Draft PR <https://github.com/apache/airflow/pull/10956> has been > created to enable code reviews and to allow the members of the > community to > start testing it with various setups. > - To get the most benefit of Scheduler HA on MySQL, users will need > to use MySQL 8. This is because MySQL 5.7 does not support SKIP LOCK > > <https://dev.mysql.com/doc/refman/8.0/en/innodb-locking-reads.html#innodb-locking-reads-nowait-skip-locked>feature > but note that *MySQL 5.7 will still continue to work with at least > the same or improved performance as now*. > - Astronomer has done performance testing with different Scenarios > and will publish benchmarks over the coming weeks. Google Composer Team > + > Polidea said that they would be happy to carry out various tests for > Scheduler HA as well. > - There were some concerns raised around LOCKING Timeout periods and > the usage of DAG Serialization. More testing in the upcoming weeks > should > help mitigate any concerns and help fix the bugs if discovered. > - *Docs:* > - Explicitly mention that for HA Scheduler reads some of the > properties from serialized_dag table. Users can turn on/off DAG > Serialization in the Webserver but the Scheduler will > continue using it. > - Do we recommend 2 schedulers for Production deployments? > - X Schedulers vs single Scheduler. Use case when one would be > better than the other. > - Some kind of Bell Curve showing an increase in Schedulers > stops improving performance and maybe also degrades. This > is intended to > give guidance around what number of schedulers to run > based on expected > load, since this decision could be based on multiple factors. > - Follow up items: > - Create mailing list thread to discuss "Removing Pickling from > Airflow 2.0". Currently, pickled dags are only supported by > CeleryExecutor > and we have a flag on *airflow scheduler > <https://airflow.readthedocs.io/en/latest/cli-ref.html#scheduler> > *(--do-pickle) > and "--ship-dag" on *airflow tasks run > <https://airflow.readthedocs.io/en/latest/cli-ref.html#run> *command. > If we want to remove pickling Airflow 2.0 is the right time > or we shouldn't > do it until 3.0 > - *Helm Chart* > - We will continue focusing on getting Airflow 2.0 out so the first > official release of Helm Chart might need to wait. > - The issue with Helm Chart sources was fixed and there are no > blockers currently if we were to release it at some point in the near > future. > - Enhancements (but not blockers) are: > - Better Test Coverage with integration tests > - Docs pointing to the chart on the Airflow Website or the docsite > - The artifacts for the Helm chart would be published at > https://downloads.apache.org/airflow/ > - There is still an open question around *Helm Chart Versioning > Policy *i.e. do we want to tie-in Airflow Versions with Helm Chart? > Or do we just start from *1.0.0? * This needs to be decided before > the release of the Helm Chart. > > > > *Things to Discuss Next* > > - *21 September (Subject to Change)* > - Finish up open discussion items from the earlier meeting if not yet > resolved: > - Providers versioning, > - SubDag deprecation, > - Helm Chart release, > - REST API permissions > - Docs changes > - UI Changes for 2.0 > - Minimum effort changes: CSS/colours/spacing to make the UI look > a bit modern > - Process: > - When should we defer the in-scope items to post-2.0 > - Completion by a date? > - Progress by a date? > > > Regards, > Kaxil
-- Jarek Potiuk Polidea | Principal Software Engineer M: +48 660 796 129