On 2019/10/7 23:46, Will Deacon wrote: > Hi all, > > This is version three of the patches I previously posted here: > > v1: https://lkml.kernel.org/r/20190802101000.12958-1-w...@kernel.org > v2: https://lkml.kernel.org/r/20190827163204.29903-1-w...@kernel.org > > Changes since v2 include: > > - Remove the x86 assembly version and enable this code unconditionally > - Move saturation warnings out-of-line to reduce image bloat > > Cheers, > > Will > > Cc: Kees Cook <keesc...@chromium.org> > Cc: Ingo Molnar <mi...@kernel.org> > Cc: Elena Reshetova <elena.reshet...@intel.com> > Cc: Peter Zijlstra <pet...@infradead.org> > Cc: Ard Biesheuvel <ard.biesheu...@linaro.org> > Cc: Hanjun Guo <guohan...@huawei.com> > Cc: Jan Glauber <jglau...@marvell.com> >
I tested on top of 5.4-rc2 (with Jan's open-read-close file test case), on a 96 CPU cores ARM64 server, I can see no much difference under 24 cores (each 24 core is a NUMA node), but +5.9% performance improve on 48 cores and +8.4% for 96 cores. For the ARM64 arch, Tested-by: Hanjun Guo <guohan...@huawei.com> Thanks Hanjun