[RESULT][VOTE] AIP-59 Performance Testing Framework

2024-06-03 Thread Bartosz Jankiewicz
Apache Airflow Performance tests framework has been accepted. 5 "+1" binding votes received: - Scheffler Jens - Jarek Potiuk - Vincent Beck - Amogh Desai - Xiaodong DENG 6 "+1" non-binding votes received: - Rajeshwar Bishundeo - Aritra Basu - Wei Lee - Rahul Vats - Mehta, Shu

Re: [VOTE] AIP-59 Performance testing framework

2024-03-18 Thread Hussain, Syed
+1 (non-binding) From: Bartosz Jankiewicz Sent: Tuesday, March 12, 2024 11:40:13 PM To: dev@airflow.apache.org Subject: [EXTERNAL] [COURRIEL EXTERNE] [VOTE] AIP-59 Performance testing framework CAUTION: This email originated from outside of the organization. Do

Re: [VOTE] AIP-59 Performance testing framework

2024-03-14 Thread Mehta, Shubham
t; > Thanks & Regards, > Amogh Desai > > On Thu, Mar 14, 2024 at 12:28 AM Rahul Vats <mailto:rah.sharm...@gmail.com>> wrote: > >> +1 (non- binding) >> >> On Tue, 12 Mar, 2024, 23:40 Bartosz Jankiewicz, >> mailto:bjankiew...@google.com.inva>lid&

Re: [VOTE] AIP-59 Performance testing framework

2024-03-14 Thread Xiaodong (XD) DENG
gh Desai > > On Thu, Mar 14, 2024 at 12:28 AM Rahul Vats wrote: > >> +1 (non- binding) >> >> On Tue, 12 Mar, 2024, 23:40 Bartosz Jankiewicz, >> wrote: >> >>> Hi folks, >>> >>> The AIP for performance testing has been in review fo

Re: [VOTE] AIP-59 Performance testing framework

2024-03-13 Thread Amogh Desai
+1 binding Glad to see this happen. Go for it! Thanks & Regards, Amogh Desai On Thu, Mar 14, 2024 at 12:28 AM Rahul Vats wrote: > +1 (non- binding) > > On Tue, 12 Mar, 2024, 23:40 Bartosz Jankiewicz, > wrote: > > > Hi folks, > > > > The AIP for perfo

Re: [VOTE] AIP-59 Performance testing framework

2024-03-13 Thread Rahul Vats
+1 (non- binding) On Tue, 12 Mar, 2024, 23:40 Bartosz Jankiewicz, wrote: > Hi folks, > > The AIP for performance testing has been in review for quite some time and > I've included your feedback in the document. > > I'd like to call a vote, and if you agree I&

Re: [VOTE] AIP-59 Performance testing framework

2024-03-13 Thread Wei Lee
lt;mailto:jens.scheff...@de.bosch.com> > > > > > > Sitz: Stuttgart, Registergericht: Amtsgericht Stuttgart, HRB 14000; > > > Aufsichtsratsvorsitzender: Prof. Dr. Stefan Asenkerschbaumer; > > > Geschäftsführung: Dr. Stefan Hartung, Dr. Christian Fischer, Dr

Re: [VOTE] AIP-59 Performance testing framework

2024-03-13 Thread Aritra Basu
mer; > > Geschäftsführung: Dr. Stefan Hartung, Dr. Christian Fischer, Dr. Markus > Forschner, > > Stefan Grosch, Dr. Markus Heyn, Dr. Frank Meyer, Dr. Tanja Rückert > > > > -Original Message- > > From: Jarek Potiuk mailto:ja...@potiuk.com>> > &g

Re: [VOTE] AIP-59 Performance testing framework

2024-03-13 Thread Bishundeo, Rajeshwar
> Geschäftsführung: Dr. Stefan Hartung, Dr. Christian Fischer, Dr. Markus > Forschner, > Stefan Grosch, Dr. Markus Heyn, Dr. Frank Meyer, Dr. Tanja Rückert > > -Original Message- > From: Jarek Potiuk mailto:ja...@potiuk.com>> > Sent: Mittwoch, 13. März 2024 10:05

RE: [VOTE] AIP-59 Performance testing framework

2024-03-13 Thread Vincent Beck
hrung: Dr. Stefan Hartung, Dr. Christian Fischer, Dr. Markus > Forschner, > Stefan Grosch, Dr. Markus Heyn, Dr. Frank Meyer, Dr. Tanja Rückert > > -Original Message- > From: Jarek Potiuk > Sent: Mittwoch, 13. März 2024 10:05 > To: dev@airflow.apache.org > Subject: R

RE: [VOTE] AIP-59 Performance testing framework

2024-03-13 Thread Scheffler Jens (XC-AS/EAE-ADA-T)
- From: Jarek Potiuk Sent: Mittwoch, 13. März 2024 10:05 To: dev@airflow.apache.org Subject: Re: [VOTE] AIP-59 Performance testing framework +1 (binding) On Wed, Mar 13, 2024 at 7:40 AM Bartosz Jankiewicz wrote: > > Hi folks, > > The AIP for performance testing has been in revi

Re: [VOTE] AIP-59 Performance testing framework

2024-03-13 Thread Jarek Potiuk
+1 (binding) On Wed, Mar 13, 2024 at 7:40 AM Bartosz Jankiewicz wrote: > > Hi folks, > > The AIP for performance testing has been in review for quite some time and > I've included your feedback in the document. > > I'd like to call a vote, and if you agree

[VOTE] AIP-59 Performance testing framework

2024-03-12 Thread Bartosz Jankiewicz
Hi folks, The AIP for performance testing has been in review for quite some time and I've included your feedback in the document. I'd like to call a vote, and if you agree I'd start a development of the framework. The AIP can be found below: https://cwiki.apache.org/confluence/

Re: Performance.

2022-02-01 Thread Jarek Potiuk
thread. I propose you open separate threads for each of the questions you >> have (in Github Discussions) this is a much more efficient way of asking >> for help. But Again DevList is likely not the best place to ask those >> questions. >> >> J >> >> >> On

Re: Performance.

2022-02-01 Thread Giorgio Zoppi
;> Hi devs, >> I have tried multiple LocalExecutors in 16GB vm and sometimes >> I have experienced workers consuming a lot of memory, Is this normal? >> Is there any incoming activity to improve worker performance for python >> operators? >> Is there any java/go operator available? >> Best Regards, >> Giorgio >> >> >> >> -- Life is a chess game - Anonymous.

Re: Performance.

2022-02-01 Thread Jarek Potiuk
in 16GB vm and sometimes > I have experienced workers consuming a lot of memory, Is this normal? > Is there any incoming activity to improve worker performance for python > operators? > Is there any java/go operator available? > Best Regards, > Giorgio > > > >

Performance.

2022-02-01 Thread Giorgio Zoppi
Hi devs, I have tried multiple LocalExecutors in 16GB vm and sometimes I have experienced workers consuming a lot of memory, Is this normal? Is there any incoming activity to improve worker performance for python operators? Is there any java/go operator available? Best Regards, Giorgio

[RESULT] [VOTE] AIP-15: Support Multiple-Schedulers for HA & Better Scheduling Performance

2020-03-30 Thread Ash Berlin-Taylor
This vote has passed with 5 +1 votes and no -1 votes. Work will begin on this soon :) -ash On Mar 19 2020, at 8:53 pm, Maxime Beauchemin wrote: > +1 (binding) > > Solid work! > > On Tue, Mar 17, 2020 at 10:43 PM Jarek Potiuk > wrote: > >> +1 (binding) >> >> On Tue, Mar 17, 2020 at 11:16 P

Re: [VOTE] AIP-15: Support Multiple-Schedulers for HA & Better Scheduling Performance

2020-03-19 Thread Maxime Beauchemin
+1 (binding) Solid work! On Tue, Mar 17, 2020 at 10:43 PM Jarek Potiuk wrote: > +1 (binding) > > On Tue, Mar 17, 2020 at 11:16 PM Kaxil Naik wrote: > > > > +1 (binding) > > > > On Tue, Mar 17, 2020 at 10:06 PM Deng Xiaodong > wrote: > > > > > +1 (binding). > > > > > > Thanks for proceeding th

Re: [VOTE] AIP-15: Support Multiple-Schedulers for HA & Better Scheduling Performance

2020-03-17 Thread Jarek Potiuk
+1 (binding) On Tue, Mar 17, 2020 at 11:16 PM Kaxil Naik wrote: > > +1 (binding) > > On Tue, Mar 17, 2020 at 10:06 PM Deng Xiaodong wrote: > > > +1 (binding). > > > > Thanks for proceeding this AIP, Ash. > > > > > > XD > > > > On Tue, Mar 17, 2020 at 22:40 Ash Berlin-Taylor wrote: > > > > > Hi

Re: [VOTE] AIP-15: Support Multiple-Schedulers for HA & Better Scheduling Performance

2020-03-17 Thread Kaxil Naik
+1 (binding) On Tue, Mar 17, 2020 at 10:06 PM Deng Xiaodong wrote: > +1 (binding). > > Thanks for proceeding this AIP, Ash. > > > XD > > On Tue, Mar 17, 2020 at 22:40 Ash Berlin-Taylor wrote: > > > Hi all, > > > > This email calls for a vote on the design proposed in AIP-15, found here > > > >

Re: [VOTE] AIP-15: Support Multiple-Schedulers for HA & Better Scheduling Performance

2020-03-17 Thread Deng Xiaodong
+1 (binding). Thanks for proceeding this AIP, Ash. XD On Tue, Mar 17, 2020 at 22:40 Ash Berlin-Taylor wrote: > Hi all, > > This email calls for a vote on the design proposed in AIP-15, found here > > > https://cwiki.apache.org/confluence/pages/viewpage.action?pageId=103092651 > > A few notes

[VOTE] AIP-15: Support Multiple-Schedulers for HA & Better Scheduling Performance

2020-03-17 Thread Ash Berlin-Taylor
Hi all, This email calls for a vote on the design proposed in AIP-15, found here https://cwiki.apache.org/confluence/pages/viewpage.action?pageId=103092651 A few notes - The proposed architecture is to use an active/active architecture where each scheduler is fully capable - Nothing in thi

Re: [PROPOSAL][AIP-15 Support Multiple-Schedulers for HA & Better Scheduling Performance]

2020-03-17 Thread Ash Berlin-Taylor
ss stable and predictable than PostgresSQL and this can > > > >cause > > > various stability problems. I know that Astronomer uses > >PostgreSQL, > > but > > > Airflow supports RDMS in a production environment and > >both must work > > > p

Re: [PROPOSAL][AIP-15 Support Multiple-Schedulers for HA & Better Scheduling Performance]

2020-03-16 Thread Dan Davydov
gt; various stability problems. I know that Astronomer uses > >PostgreSQL, > but > > Airflow supports RDMS in a production environment and > >both must work > > properly in this AIP. > > > >Best regards, > >Kamil > > > >On Tue, Mar 3, > >

Re: [PROPOSAL][AIP-15 Support Multiple-Schedulers for HA & Better Scheduling Performance]

2020-03-16 Thread Deng Xiaodong
> >both must work > properly in this AIP. > > > >Best regards, > >Kamil > > > >On Tue, Mar 3, > 2020 at 12:50 PM Kaxil Naik wrote: > >> > >> Good work on the Proposal Ash > & Vikram. > >> > >> > >>

Re: [PROPOSAL][AIP-15 Support Multiple-Schedulers for HA & Better Scheduling Performance]

2020-03-16 Thread Ash Berlin-Taylor
2020 at 12:50 PM Kaxil > Naik wrote: > >> > >> Good work on the Proposal Ash & Vikram. > >> > >> > >> > > >> On Fri, Feb 28, 2020 at 10:39 PM Vikram Koka > > > >> wrote: > >> > >> > > Team, &g

Re: [PROPOSAL][AIP-15 Support Multiple-Schedulers for HA & Better Scheduling Performance]

2020-03-03 Thread Kaxil Naik
regards, > >Kamil > > > >On Tue, Mar 3, 2020 at 12:50 PM Kaxil Naik wrote: > >> > >> Good work on the Proposal Ash & Vikram. > >> > >> > >> > >> On Fri, Feb 28, 2020 at 10:39 PM Vikram Koka > > > >> wrote: > >>

Re: [PROPOSAL][AIP-15 Support Multiple-Schedulers for HA & Better Scheduling Performance]

2020-03-03 Thread Ash Berlin-Taylor
e, Mar 3, 2020 at 12:50 PM Kaxil Naik wrote: >> >> Good work on the Proposal Ash & Vikram. >> >> >> >> On Fri, Feb 28, 2020 at 10:39 PM Vikram Koka > >> wrote: >> >> > Team, >> > >> > >> > >> &

Re: [PROPOSAL][AIP-15 Support Multiple-Schedulers for HA & Better Scheduling Performance]

2020-03-03 Thread Kamil Breguła
> Team, > > > > > > > > We just updated 'AIP-15 Support Multiple-Schedulers for HA & Better > > Scheduling Performance' on Confluence and would very much appreciate > > feedback and suggestions from the community. > > > > > &

Re: [PROPOSAL][AIP-15 Support Multiple-Schedulers for HA & Better Scheduling Performance]

2020-03-03 Thread Kaxil Naik
Good work on the Proposal Ash & Vikram. On Fri, Feb 28, 2020 at 10:39 PM Vikram Koka wrote: > Team, > > > > We just updated 'AIP-15 Support Multiple-Schedulers for HA & Better > Scheduling Performance' on Confluence and would very much appreciate > feed

[PROPOSAL][AIP-15 Support Multiple-Schedulers for HA & Better Scheduling Performance]

2020-02-28 Thread Vikram Koka
Team, We just updated 'AIP-15 Support Multiple-Schedulers for HA & Better Scheduling Performance' on Confluence and would very much appreciate feedback and suggestions from the community. https://cwiki.apache.org/confluence/pages/viewpage.action?pageId=103092651 The original

Re: Big performance optimization of Scheduler - 10x faster , 2000+ fewer queries count

2020-02-27 Thread Kamil Breguła
Yes. Here is PR. https://github.com/apache/airflow/pull/6697 It made this work easier. On Thu, Feb 27, 2020 at 12:18 PM Ash Berlin-Taylor wrote: > We're just finializing the AIP document around scheduler HA and hope to > have it published by the end of this week. > > DagFileProcessor is a new cl

Re: Big performance optimization of Scheduler - 10x faster , 2000+ fewer queries count

2020-02-27 Thread Ash Berlin-Taylor
We're just finializing the AIP document around scheduler HA and hope to have it published by the end of this week. DagFileProcessor is a new class, but the code in it is not new -- it's the same, gradually evolving scheduler and parsing code, but refactored/moved to live elsewhere, right? -ash

Re: Big performance optimization of Scheduler - 10x faster , 2000+ fewer queries count

2020-02-26 Thread Kamil Breguła
Hello, In my opinion, it will be easier because the database will be less loaded, but more importantly. I added wait_for_update in one method, which means that another scheduler will not be able to damage the state of the database. Most of these changes even streamline HA, because more things are

Re: Big performance optimization of Scheduler - 10x faster , 2000+ fewer queries count

2020-02-26 Thread Maxime Beauchemin
Hey, I wanted to echo the awesomeness once more, but also bring up the question as to whether any of this work may make it harder to distribute / HA the scheduler down the line (?) I almost started analyzing the code and thought it'd just be easier to ask the authors. Max On Wed, Feb 26, 2020 at

Re: Big performance optimization of Scheduler - 10x faster , 2000+ fewer queries count

2020-02-26 Thread Felix Uellendall
Really awesome indeed! I hadn’t enough time to look into all of it yet but will definitely do. Thanks Polidea and Databand for your efforts you put into this! Kamil especially! -Felix Sent from ProtonMail Mobile On Wed, Feb 26, 2020 at 08:54, Sumit Maheshwari wrote: > Awesome work guys!! Ku

Re: Big performance optimization of Scheduler - 10x faster , 2000+ fewer queries count

2020-02-25 Thread Sumit Maheshwari
Awesome work guys!! Kudos to all of you 👏 On Wed, Feb 26, 2020 at 6:59 AM Jiajie Zhong wrote: > Good work! Thanks Kamil > > Best Wish > — Jiajie

Re: Big performance optimization of Scheduler - 10x faster , 2000+ fewer queries count

2020-02-25 Thread Jiajie Zhong
Good work! Thanks Kamil Best Wish — Jiajie

Re: Big performance optimization of Scheduler - 10x faster , 2000+ fewer queries count

2020-02-25 Thread Maxime Beauchemin
hose optimizations into our > > > > production environment. > > > > > > > > Thanks, > > > > QP Hou > > > > > > > > On Mon, Feb 24, 2020 at 8:35 AM Kamil Breguła < > > kamil.breg...@polidea.com > > > > > &g

Re: Big performance optimization of Scheduler - 10x faster , 2000+ fewer queries count

2020-02-25 Thread Robin Edwards
AM Kamil Breguła < > kamil.breg...@polidea.com > > > > > > wrote: > > > > > > > > Hello, > > > > > > > > Polidea [1] together with Databand [2] has taken steps to optimize > > > > scheduler performance. > &

Re: Big performance optimization of Scheduler - 10x faster , 2000+ fewer queries count

2020-02-24 Thread Driesprong, Fokko
> On Mon, Feb 24, 2020 at 8:35 AM Kamil Breguła > > > wrote: > > > > > > Hello, > > > > > > Polidea [1] together with Databand [2] has taken steps to optimize > > > scheduler performance. > > > I made many changes last weekend: >

Re: Big performance optimization of Scheduler - 10x faster , 2000+ fewer queries count

2020-02-24 Thread Tao Feng
eploy those optimizations into our > production environment. > > Thanks, > QP Hou > > On Mon, Feb 24, 2020 at 8:35 AM Kamil Breguła > wrote: > > > > Hello, > > > > Polidea [1] together with Databand [2] has taken steps to optimize > > scheduler performan

Re: Big performance optimization of Scheduler - 10x faster , 2000+ fewer queries count

2020-02-24 Thread Qingping Hou
] has taken steps to optimize > scheduler performance. > I made many changes last weekend: > 1. [AIRFLOW-6856] Bulk fetch paused_dag_ids > https://github.com/apache/airflow/pull/7476 > 2. [AIRFLOW-6857] Bulk sync DAGs > https://github.com/apache/airflow/pull/7477 > 3. [AIRFLOW-686

Re: Big performance optimization of Scheduler - 10x faster , 2000+ fewer queries count

2020-02-24 Thread Evgeny Shulman
> wrote: > > > Hello, > > > > Polidea [1] together with Databand [2] has taken steps to optimize > > scheduler performance. > > I made many changes last weekend: > > 1. [AIRFLOW-6856] Bulk fetch paused_dag_ids > > https://github.com/apache/airflow/pull/74

Re: Big performance optimization of Scheduler - 10x faster , 2000+ fewer queries count

2020-02-24 Thread Jarek Potiuk
Those are all great improvements Kamil! It would be great to have them reviewed, tested and merged for 2.0 ! J. On Mon, Feb 24, 2020 at 5:35 PM Kamil Breguła wrote: > Hello, > > Polidea [1] together with Databand [2] has taken steps to optimize > scheduler performance. > I ma

Re: Big performance optimization of Scheduler - 10x faster , 2000+ fewer queries count

2020-02-24 Thread Tomasz Urbaszek
imize > scheduler performance. > I made many changes last weekend: > 1. [AIRFLOW-6856] Bulk fetch paused_dag_ids > https://github.com/apache/airflow/pull/7476 > 2. [AIRFLOW-6857] Bulk sync DAGs > https://github.com/apache/airflow/pull/7477 > 3. [AIRFLOW-6862] Do not check the fresh

Big performance optimization of Scheduler - 10x faster , 2000+ fewer queries count

2020-02-24 Thread Kamil Breguła
Hello, Polidea [1] together with Databand [2] has taken steps to optimize scheduler performance. I made many changes last weekend: 1. [AIRFLOW-6856] Bulk fetch paused_dag_ids https://github.com/apache/airflow/pull/7476 2. [AIRFLOW-6857] Bulk sync DAGs https://github.com/apache/airflow/pull/7477

Re: [DISCUSS] Performance aspects of Airflow (2.0 + 1.10 backport)

2019-12-27 Thread Jarek Potiuk
> > > As I mentioned in that article I think it's critical both for performance > testing of changes, and "regression" testing of releases that we have a > repeatable performance test framework. I somewhat simplified the script I > used in the article, so I need

Re: Test fixtures for repeatable (performance) tests?

2019-12-13 Thread Kaxil Naik
3:42 PM Ash Berlin-Taylor wrote: > As part of my performance testing work I created a simple driver script, > one of the steps it did was to "reset" the DB (at least for a given dag) so > that the code was operating on the same rows in the same state each time -- > I ran

Test fixtures for repeatable (performance) tests?

2019-12-13 Thread Ash Berlin-Taylor
As part of my performance testing work I created a simple driver script, one of the steps it did was to "reset" the DB (at least for a given dag) so that the code was operating on the same rows in the same state each time -- I ran all my tests about 10 times to get mean and variance.

Re: [DISCUSS] Performance aspects of Airflow (2.0 + 1.10 backport)

2019-12-11 Thread Ash Berlin-Taylor
This is obviously a subject dear to my heart :) As I mentioned in that article I think it's critical both for performance testing of changes, and "regression" testing of releases that we have a repeatable performance test framework. I somewhat simplified the script I used in th

[DISCUSS] Performance aspects of Airflow (2.0 + 1.10 backport)

2019-12-11 Thread Jarek Potiuk
*TL;DR;* We are gearing up @ Polidea to work on Apache Airflow performance and I wanted to start a discussion that might lead to creating a new AIP and implementing it :). Here is a high-level summary of the discussions we had so far, so this might be a starting point to get some details worked