Re: [webkit-dev] Iterating SunSpider

Oliver Hunt Tue, 07 Jul 2009 16:22:02 -0700

What you seem to think is better would be to repeatedly updatesunspider everytime that something gets faster, ignoring entirelythat the value in sunspider is precisely that it has not changed.
Not quite what I'm saying :-)
I'd like benchmarks to:
    a) have meaning even as browsers change over time
b) evolve. as new areas of JS (or whatever) become important,the benchmark should have facilities to include that.
Fair?  Good? Bad?

It's not unreasonable, but it can't be done on a whim, and changescannot be made trivially. Both re-weighting sunspider and adding newtests as things are made faster is incredibly hard to do soundlybecause it becomes easy to end up obscuring meaningful data.

In the context of regex for example, say sunspider had been reweightedfor the current generation on js engines before anyone had looked atregex. Regex would not have stood out as being substantially slower,and would likely not have been investigated resulting in everyonehaving regex an order of magnitude slower than current engines.That's why sunspider has not been updated: after what a year and ahalf (?) it can still show areas where performance can be improved andwhile it does that it's still useful.

So determining when it is sensible to update sunspider is difficult,you may be right, and find rebalancing shows new areas whereperformance can be improved, but if you're wrong you run the risk ofchanging the benchmark from something that is actually usefuldevelopment tool into something that is only useful for producing anumber at the end.

If we see one section of the test taking dramatically longer thananother then we can assume that we have not been paying enoughattention to performance in that area, this is how we orginallynoticed just how slow the regex engine was. If we had beencontinually rebalancing the test over and over again we would nothave noticed this or other areas where performance could be (andhas) improved. It would also break sunspider as a means fortracking and/or preventing performance regressions.
Of course, using old versions of the benchmark for regressiontesting is not prohibited by iterating a benchmark.

But what happens when the benchmarks disagree as to what is theimprovement? You can't improve performance with one benchmark whiletesting for regressions with another.


--Oliver

_______________________________________________
webkit-dev mailing list
[email protected]
http://lists.webkit.org/mailman/listinfo.cgi/webkit-dev

Re: [webkit-dev] Iterating SunSpider

Reply via email to