Re: [PATCH v3 2/2] util: add util function buffer_zero_avx512()

2020-02-28 Thread Robert Hoo
On Fri, 2020-02-28 at 18:09 -0800, Richard Henderson wrote: > On 2/27/20 6:24 PM, Robert Hoo wrote: > > if ((bv & 6) == 6 && (b & bit_AVX2)) { > > cache |= CACHE_AVX2; > > } > > +if ((bv & 6) == 6 && (b & bit_AVX512F)) { > > +

Re: [PATCH v3 2/2] util: add util function buffer_zero_avx512()

2020-02-28 Thread Richard Henderson
On 2/27/20 6:24 PM, Robert Hoo wrote: > if ((bv & 6) == 6 && (b & bit_AVX2)) { > cache |= CACHE_AVX2; > } > +if ((bv & 6) == 6 && (b & bit_AVX512F)) { > +cache |= CACHE_AVX512F; > +} Oh, one more thing I missed --

[PATCH v3 2/2] util: add util function buffer_zero_avx512()

2020-02-27 Thread Robert Hoo
And intialize buffer_is_zero() with it, when Intel AVX512F is available on host. This function utilizes Intel AVX512 fundamental instructions which is faster than its implementation with AVX2 (in my unit test, with 4K buffer, on CascadeLake SP, ~36% faster, buffer_zero_avx512() V.S.