HZ=100: not necessarily better?

Robert Watson Sat, 17 Jun 2006 05:51:04 -0700

Scott asked me if I could take a look at the impact of changing HZ for somesimple TCP performance tests. I ran the first couple, and got some resultsthat were surprising, so I thought I'd post about them and ask people who areinterested if they could do some investigation also. The short of it is thatwe had speculated that the increased CPU overhead of a higher HZ would besignificant when it came to performance measurement, but in fact, I measureimproved performance under high HTTP load with a higher HZ. This was, ofcourse, the reason we first looked at increasing HZ: improving timergranularity helps improve the performance of network protocols, such as TCP.Recent popular opinion has swung in the opposite direction, that higher HZoverhead outweighs this benefit, and I think we should be cautious and do alot more investigating before assuming that is true.

Simple performance results below. Two boxes on a gig-e network with if_emethernet cards, one running a simple web server hosting 100 byte pages, andthe other downloading them in parallel (netrate/http and netrate/httpd). Theperformance difference is marginal, but at least in the SMP case, likely morethan a measurement error or cache alignment fluke. Results aretransactions/second sustained over a 30 second test -- bigger is better; boxis a dual xeon p4 with HTT; 'vendor.*' are the default 7-CURRENT HZ setting(1000) and 'hz.*' are the HZ=100 versions of the same kernels. Regardless,there wasn't an obvious performance improvement by reducing HZ from 1000 to100. Results may vary, use only as directed.

What we might want to explore is using a programmable timer to set up highprecision timeouts, such as TCP timers, while keeping base statisticsprofiling and context switching at 100hz. I think phk has previously proposeddoing this with the HPET timer.

I'll run some more diverse tests today, such as raw bandwidth tests, pps onUDP, and so on, and see where things sit. The reduced overhead should bemeasurable in cases where the test is CPU-bound and there's no clear benefitto more accurate timing, such as with TCP, but it would be good to confirmthat.


Robert N M Watson
Computer Laboratory
University of Cambridge


peppercorn:~/tmp/netperf/hz> ministat *SMP
x hz.SMP
+ vendor.SMP
+--------------------------------------------------------------------------+
|xx x xx   x       xx  x     +              +   +  +   +    ++ +         ++|
|  |_______A________|                     |_____________A___M________|     |
+--------------------------------------------------------------------------+
    N           Min           Max        Median           Avg        Stddev
x  10         13715         13793         13750       13751.1     29.319883
+  10         13813         13970         13921       13906.5     47.551726
Difference at 95.0% confidence
        155.4 +/- 37.1159
        1.13009% +/- 0.269913%
        (Student's t, pooled s = 39.502)

peppercorn:~/tmp/netperf/hz> ministat *UP
x hz.UP
+ vendor.UP
+--------------------------------------------------------------------------+
|x           x xx   x      xx+   ++x+   ++  * +    +                      +|
|         |_________M_A_______|___|______M_A____________|                  |
+--------------------------------------------------------------------------+
    N           Min           Max        Median           Avg        Stddev
x  10         14067         14178         14116       14121.2     31.279386
+  10         14141         14257         14170       14175.9     33.248058
Difference at 95.0% confidence
        54.7 +/- 30.329
        0.387361% +/- 0.214776%
        (Student's t, pooled s = 32.2787)

_______________________________________________
freebsd-performance@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-performance
To unsubscribe, send any mail to "[EMAIL PROTECTED]"

HZ=100: not necessarily better?

Reply via email to