> That said, another thing that strikes me is that you have 32 CPU > threads, and the stupid test-program I sent out had MAX_THREADS set to > 16. Did you change that? Becuase if not, then some of the extreme > performance profile might be about how the threads get scheduled on > your machine (HT threads vs full cores etc).
I'll try to get new numbers with 32 threads[*] - but even if they look good, I'd be upset about the 16 thread case being worse with the cmpxchg/no-cpu-relax case than the original code. -Tony [*] probably not till tomorrow

