Optimize visibilitymap_count() with AVX-512 instructions. Commit 792752af4e added infrastructure for using AVX-512 intrinsic functions, and this commit uses that infrastructure to optimize visibilitymap_count(). Specificially, a new pg_popcount_masked() function is introduced that applies a bitmask to every byte in the buffer prior to calculating the population count, which is used to filter out the all-visible or all-frozen bits as needed. Platforms without AVX-512 support should also see a nice speedup due to the reduced number of calls to a function pointer.
Co-authored-by: Ants Aasma Discussion: https://postgr.es/m/BL1PR11MB5304097DF7EA81D04C33F3D1DCA6A%40BL1PR11MB5304.namprd11.prod.outlook.com Branch ------ master Details ------- https://git.postgresql.org/pg/commitdiff/41c51f0c68b21b4603bd2a9c3d3ad017fdd22627 Modified Files -------------- src/backend/access/heap/visibilitymap.c | 25 ++----- src/include/port/pg_bitutils.h | 34 +++++++++ src/port/pg_bitutils.c | 126 ++++++++++++++++++++++++++++++++ src/port/pg_popcount_avx512.c | 60 +++++++++++++++ 4 files changed, 225 insertions(+), 20 deletions(-)