I tested the two proposed patches from earlier this thread on a 4S system. This was just with my (worst case) micro.
Unfortunately both patches scale much worse than the duplicated pools, and can be even worse than the baseline (not sure why). The base line peaks at slightly above 200K ops/s with less than 20 CPUs. And the it gets slower until settling around 100K ops/s with more CPUs. Ted's patch peaks at 350K with four CPUs, but then quickly degrades to 50K ops/s at 20+ CPUs. At 144 CPUs it is slightly faster again at ~80K. Spelvin's patch peaks at only 140K at 2 CPUs (so it's slower than base line), stays around 120K upto 20, then degrades quickly to 50K and then slowly improves again to ~80K. The duplicated pool patch is ~200K upto 20 CPus, 400K upto 40, 600K at slightly below 60 CPUs, and then very slowly degrades to 520K at 144. -Andi -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/