"meaningful" spinlock contention when bound to non-intr CPU?

Rick Jones Thu, 01 Feb 2007 11:44:46 -0800

For various nefarious porpoises relating to comparing and contrasting asingle 10G NIC with N 1G ports and hopefully finding interestingprocessor cache (mis)behaviour in the stack, I got my hands on a pair of8 core systems with plenty of RAM and I/O slots. (rx6600 with 1.6 GHzdual-core Itanium2, aka Montecito)


A 2.6.10-rc5 kernel onto each system thanks to pointers from Dan Frazier.

Into each went a quartet of dual-port 1G NICs driven by e10007.3.15-k2-NAPI and I connected them back to back. I tweakedsmp_affinity to have each port's interrupts go to a separate core.


Netperf2 configured with --enable-burst.

When I run eight concurrent netperf TCP_RR tests, each doing 24concurrent single-byte transactions (test-specific -b 24), TCP_NODELAYset, (test-specific -D) and bind each netserver/netperf to the same CPUas is taking the interrupts of the NIC handling that connection (global-T) I see things looking pretty good. Decent aggregate transactions persecond, and nothing in the CPU profiles to suggest spinlock contention.

Happiness and joy. An N CPU system behaving (at this level at least)like N, 1 CPU systems.

When I then decide to bind the netperf/netservers to CPU(s) other thanthe ones taking the interrupts from the NIC(s) the aggregatetransactions per second drops by roughly 40/135 or ~30%. I was indeedexpecting a delta - no idea if that is in the realm of "to be expected"- but decided to go ahead and look at the profiles.

The profiles (either via q-syscollect or caliper) show upwards of 3% ofthe CPU consumed by spinlock contention (ie time spent inia64_spinlock_contention). (I'm guessing some of the rest of the perfdrop comes from those "interesting" cache behaviours still to be sought)

With some help from Lee Schermerhorn and Alan Brunelle I got a lockmeterkernel going, and it is suggesting that the greatest spinlock contentioncomes from the routines:


SPINLOCKS         HOLD            WAIT

UTIL CON MEAN( MAX ) MEAN( MAX )(% CPU) TOTAL NOWAIT SPINRJECT NAME

7.4% 2.8% 0.1us( 143us) 3.3us( 147us)( 1.4%) 75262432 97.2% 2.8%0% lock_sock_nested+0x3029.5% 6.6% 0.5us( 148us) 0.9us( 143us)(0.49%) 37622512 93.4% 6.6%0% tcp_v4_rcv+0xb303.0% 5.6% 0.1us( 142us) 0.9us( 143us)(0.14%) 13911325 94.4% 5.6%0% release_sock+0x1209.6% 0.75% 0.1us( 144us) 0.7us( 139us)(0.08%) 75262432 99.2% 0.75%0% release_sock+0x30

I suppose it stands to some reason that there would be contentionassociated with the socket since there will be two things going for thesocket (a netperf/netserver and an interrupt/upthestack) each running onseparate CPUs. Some of it looks like it _may_ be inevitable? -waking-up the user who will now be racing to grab the socket before thestack releases it? (I may have been mis-interpreting some of the code Iwas checking)

Still, does this look like something worth persuing? In a past life/OSwhen one was able to eliminate one percentage point of spinlockcontention, two percentage points of improvement ensued.


rick jones

-
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

"meaningful" spinlock contention when bound to non-intr CPU?

Reply via email to