https://gcc.gnu.org/bugzilla/show_bug.cgi?id=117875
Richard Biener <rguenth at gcc dot gnu.org> changed:
What |Removed |Added
----------------------------------------------------------------------------
Target|x86_64-*-*, aarch64* |x86_64-*-*, aarch64*,
| |loongarch
Assignee|unassigned at gcc dot gnu.org |rguenth at gcc dot
gnu.org
Status|NEW |ASSIGNED
--- Comment #17 from Richard Biener <rguenth at gcc dot gnu.org> ---
My bisect on a Zen2 machine lands on r15-5340-g4b8c5b337e007e ...
So --param vect-force-slp=0 vs 1 shows the regression there. Re-confirmed
also on trunk head with that.
The interesting difference is probably
-fast_algorithms.c:133:19: optimized: loop vectorized using 32 byte vectors
-fast_algorithms.c:133:19: optimized: loop versioned for vectorization because
of possible aliasing
-fast_algorithms.c:133:19: optimized: loop vectorized using 16 byte vectors
but there's also
-plan7.c:137:17: optimized: loop vectorized using 16 byte vectors
-sre_math.c:174:17: optimized: loop vectorized using 16 byte vectors
-sre_math.c:192:17: optimized: loop vectorized using 16 byte vectors
I will have a closer look.