Monitoring performance for releases

Maximilian Michels Thu, 09 Jul 2020 10:22:18 -0700

Hi,

We recently saw an increase in latency migrating from Beam 2.18.0 to2.21.0 (Python SDK with Flink Runner). This proofed very hard to debugand it looks like each version in between the two versions let toincreased latency.

This is not the first time we saw issues when migrating, another time wehad a decline in checkpointing performance and thus added acheckpointing test [1] and dashboard [2] (see checkpointing widget).

That makes me wonder if we should monitor performance (throughput /latency) for basic use cases as part of the release testing. Currently,our release guide [3] mentions running examples but not evaluating theperformance. I think it would be good practice to check relevant chartswith performance measurements as part of of the release process. Therelease guide should reflect that.


WDYT?

-Max

PS: Of course, this requires tests and metrics to be available. This PRadds latency measurements to the load tests [4].



[1] https://github.com/apache/beam/pull/11558

[2]https://apache-beam-testing.appspot.com/explore?dashboard=5751884853805056

[3] https://beam.apache.org/contribute/release-guide/
[4] https://github.com/apache/beam/pull/12065

Monitoring performance for releases

Reply via email to