Re: Review of Andrei's std.benchmark

Andrei Alexandrescu Thu, 20 Sep 2012 21:25:22 -0700

On 9/20/12 2:37 PM, Jacob Carlborg wrote:

On 2012-09-20 14:36, Andrei Alexandrescu wrote:

Let's use the minimum. It is understood it's not what you'll see in
production, but it is an excellent proxy for indicative and reproducible
performance numbers.


Why not min, max and average?

For a very simple reason: unless the algorithm under benchmark is verylong-running, max is completely useless, and it ruins average as well.

For virtually all benchmarks I've run, the distribution of timings is ahalf-Gaussian very concentrated around the minimum. Say you have aminimum of e.g. 73 us. Then there would be a lot of results close tothat; the mode of the distribution would be very close, e.g. 75 us, andthe more measurements you take, the closer the mode is to the minimum.Then you have a few timings up to e.g. 90 us. And finally you willinevitably have a few outliers at some milliseconds. Those are orders ofmagnitude larger than anything of interest and are caused by systeminterrupts that happened to fall in the middle of the measurement.

Taking those into consideration and computing the average with thoseoutliers simply brings useless noise into the measurement process.



Andrei

Re: Review of Andrei's std.benchmark

Reply via email to