On Wed, Mar 03, 2021 at 09:42:06PM -0300, André Almeida wrote: > ** Performance > > - For comparing futex() and futex2() performance, I used the artificial > benchmarks implemented at perf (wake, wake-parallel, hash and > requeue). The setup was 200 runs for each test and using 8, 80, 800, > 8000 for the number of threads, Note that for this test, I'm not using > patch 14 ("kernel: Enable waitpid() for futex2") , for reasons explained > at "The patchset" section.
How heavily contended where the benchmarks? One of the benefits of the original futex was that no system call was necessary in the happy path when the lock is uncontended. Especially on a non-NUMA system (which are the far more common case), since that's where relying on a single memory access was a huge win for the original futex. I would expect that futex2 will fare worse in this particular case, since it requires a system call entry for all operations --- the question is how large is the delta in this worst case (for futex2) and best case (for futex) scenario. Cheers, - Ted