Re: [gdal-dev] Official dataset for benchmarking GDAL I/O?

2024-02-25 Thread Kurt Schwehr via gdal-dev
As Even said, this is a really tough topic. I have tried some micro benchmarking for small bits and for short term dev this is sort of ok. The biggest problem is getting a stable test env for benchmarking. Even a single user machine doing only benchmarking is all over the place. And if you are benc

Re: [gdal-dev] Official dataset for benchmarking GDAL I/O?

2024-02-25 Thread Adam Stewart via gdal-dev
Thanks Even, I think what I'm envisioning is more of an integration test than a unit test. We don't intend to use this in TorchGeo CI on every commit, only on PRs that we know may impact I/O (much less frequent than in GDAL). We would also run it before each release and publish performance metr

Re: [gdal-dev] Official dataset for benchmarking GDAL I/O?

2024-02-25 Thread Even Rouault via gdal-dev
Adam, Automated performance regression testing is probably one of the aspect of testing that could be enhanced. While the GDAL autotest suite is quite comprehensive functionally wise, performance testing has traditionally been a bit lagging. That said, this is an aspect we have improved latel

[gdal-dev] Official dataset for benchmarking GDAL I/O?

2024-02-25 Thread Adam Stewart via gdal-dev
Hi, Background: I'm the developer of the TorchGeo software library. TorchGeo is a machine learning library that heavily relies on GDAL (via rasterio/fiona) for satellite imagery I/O. One of our primary concerns is ensuring that we can load data from disk