Re: [PATCH][AVX512] Optimize vpexpand* to mask mov when mask have all ones in it's lower part (including 0 and -1).

2021-06-21 Thread Hongtao Liu via Gcc-patches
This is the patch I'm going to push to the trunk. On Wed, May 12, 2021 at 3:28 PM Hongtao Liu wrote: > > ping > > On Fri, Apr 30, 2021 at 12:49 PM Hongtao Liu wrote: > > > > Hi: > > For v{,p}expand* When mask is 0, -1, or has all all one bits in its > > lower part, it can be optimized to simpl

Re: [PATCH][AVX512] Optimize vpexpand* to mask mov when mask have all ones in it's lower part (including 0 and -1).

2021-05-12 Thread Hongtao Liu via Gcc-patches
ping On Fri, Apr 30, 2021 at 12:49 PM Hongtao Liu wrote: > > Hi: > For v{,p}expand* When mask is 0, -1, or has all all one bits in its > lower part, it can be optimized to simple mov or mask mov. > Bootstrapped and regtested on x86_64-linux-gnu{-m32,} and > x86_64-linux-gnu{m32\ -march=casca

[PATCH][AVX512] Optimize vpexpand* to mask mov when mask have all ones in it's lower part (including 0 and -1).

2021-04-29 Thread Hongtao Liu via Gcc-patches
Hi: For v{,p}expand* When mask is 0, -1, or has all all one bits in its lower part, it can be optimized to simple mov or mask mov. Bootstrapped and regtested on x86_64-linux-gnu{-m32,} and x86_64-linux-gnu{m32\ -march=cascadelake,-m64\ -march=cascadelake}, gcc/ChangeLog: * config/i38