Ping,

I am quite interested in this topic and trying to understand the difference
between the "scheduling delay" metric articulated as compared to the "task
latency" aka "task lag" metric which we have been using before.

As you may recall, we have been using two specific metrics to
benchmark Scheduler performance, specifically "task latency" and "task
throughput" since Airflow 2.0.
These were described in the 2.0 Scheduler blog post
<https://www.astronomer.io/blog/airflow-2-scheduler/>
Specifically, within that we defined task tatency as the time it takes for
the task to begin executing once it's dependencies are all met.

Thanks,
Vikram




On Wed, Jun 8, 2022 at 10:25 AM Ping Zhang <pin...@umich.edu> wrote:

> Hi Airflow Community,
>
> Airflow is a scheduling platform for data pipelines, however there is no
> good metric to measure the scheduling delay in the production and also the
> stress test environment. This makes it hard to catch regressions in the
> scheduler during the stress test stage.
>
> I would like to propose an airflow scheduling delay metric
> definition. Here is the detailed design of the metric and its
> implementation:
>
>
> https://docs.google.com/document/d/1NhO26kgWkIZJEe50M60yh_jgROaU84dRJ5qGFqbkNbU/edit?usp=sharing
>
> Please take a look and any feedback is welcome.
>
> Thanks,
>
> Ping
>
>

Reply via email to