Re: Performance regression testing?

Nicholas Nethercote Tue, 29 Nov 2005 09:17:24 -0800

On Mon, 28 Nov 2005, Joe Buck wrote:

On Mon, 28 Nov 2005, Mark Mitchell wrote:
We're collectively putting a lot of energy into performanceimprovements in GCC. Sometimes, a performance gain from one patch getsundone by another patch -- which is itself often doing something elsebeneficial. People have mentioned to me that we require people to runregression tests for correctness, but that we don't really haveanything equivalent for performance.
It would be possible to detect performance regression after fact, but
soon enough to look at reverting patches.  For example, given multiple
machines doing SPEC benchmark runs every night, the alarm could be raised
if a significant performance regression is detected.  To guard against
noise from machine hiccups, two different machines would have to report
a regression to raise the alarm.  But the big problem is the non-freeness
of SPEC; ideally there would be a benchmark that ...
... everyone can download and run
... is reasonably fast
... is non-trivial


Yes!  This would be very useful for other free software projects.

Another possible requirement is that the tests are not too large; itwould be nice to include them in the source code of one's project foreasier integration.

As a strawman, perhaps we could add a small integer program (bzip?) and
a small floating-point program to the testsuite, and have DejaGNU print
out the number of iterations of each that run in 10 seconds.


Would that really catch much?

I've been thinking about this kind of thing recently for Valgrind. I wasthinking that a combination of real programs and artificialmicrobenchmarks would be good. The microbenchmarks would be like the GCC(correctness) torture tests -- a collection of programs, added to overtime, each one demonstrating a prior performance bug. You could start itoff with a few tests containing things like key inner loops extracted fromprograms such as bzip2.

Measuring the programs and categorizing regressions is tricky. It'spossible that the artificial tests would be small enough that anyregression would be obvious (eg. failing to remove that extra instructionwould cause a 10% slowdown). And CSiBE-style graphing is very effectivefor seeing trends.


Nick

Re: Performance regression testing?

Reply via email to