[Bug tree-optimization/109885] gcc does not generate movmskps and testps instructions (clang does)

pinskia at gcc dot gnu.org via Gcc-bugs Sat, 10 Feb 2024 01:53:29 -0800

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109885


Andrew Pinski <pinskia at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
          Component|target                      |tree-optimization
                 CC|                            |pinskia at gcc dot gnu.org
             Blocks|                            |53947

--- Comment #3 from Andrew Pinski <pinskia at gcc dot gnu.org> ---
What is even funnier on the LLVM side is if we have:
```
void f(unsigned int * __restrict a, unsigned int * __restrict b)
{
  unsigned int t = 0;
  t += (a[0] == b[0]);
  t += (a[1] == b[1])<<1;
  t += (a[2] == b[2])<<2;
  t += (a[3] == b[3])<<3;
  *a = t;
}
```
LLVM can produce movmskps for x86_64 but then does do a similar trick that it
did for the sum for aarch64.

Note GCC does not handle reductions that well for SLP either.


Referenced Bugs:

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=53947
[Bug 53947] [meta-bug] vectorizer missed-optimizations

[Bug tree-optimization/109885] gcc does not generate movmskps and testps instructions (clang does)

Reply via email to