Re: [PATCH] Optimize multiplication for V8QI,V16QI,V32QI under TARGET_AVX512BW [target/95488]

2020-06-12 Thread Hongtao Liu via Gcc-patches
Thanks for the review. On Fri, Jun 12, 2020 at 11:28 AM Jeff Law wrote: > > On Fri, 2020-06-05 at 13:46 +0800, Hongtao Liu via Gcc-patches wrote: > > Hi: > > > > +/* Optimize vector MUL generation for V8QI, V16QI and V32QI > > + under TARGET_AVX512BW. i.e. for v16qi a * b, it has > > + > > +

Re: [PATCH] Optimize multiplication for V8QI,V16QI,V32QI under TARGET_AVX512BW [target/95488]

2020-06-11 Thread Jeff Law via Gcc-patches
On Fri, 2020-06-05 at 13:46 +0800, Hongtao Liu via Gcc-patches wrote: > Hi: > > +/* Optimize vector MUL generation for V8QI, V16QI and V32QI > + under TARGET_AVX512BW. i.e. for v16qi a * b, it has > + > + vpmovzxbw ymm2, xmm0 > + vpmovzxbw ymm3, xmm1 > + vpmullw ymm4, ymm2, ymm3 > +

[PATCH] Optimize multiplication for V8QI,V16QI,V32QI under TARGET_AVX512BW [target/95488]

2020-06-04 Thread Hongtao Liu via Gcc-patches
Hi: +/* Optimize vector MUL generation for V8QI, V16QI and V32QI + under TARGET_AVX512BW. i.e. for v16qi a * b, it has + + vpmovzxbw ymm2, xmm0 + vpmovzxbw ymm3, xmm1 + vpmullw ymm4, ymm2, ymm3 + vpmovwb xmm0, ymm4 + + it would take less instructions than ix86_expand_vecop_qihi. +