[Bug tree-optimization/107247] SLP reduction results fail to reduce to a single accumulator
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107247 Richard Biener changed: What|Removed |Added Ever confirmed|0 |1 Status|UNCONFIRMED |ASSIGNED Assignee|unassigned at gcc dot gnu.org |rguenth at gcc dot gnu.org Last reconfirmed||2022-10-13 --- Comment #1 from Richard Biener --- I have a patch.
[Bug tree-optimization/107247] SLP reduction results fail to reduce to a single accumulator
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107247 --- Comment #2 from CVS Commits --- The master branch has been updated by Richard Biener : https://gcc.gnu.org/g:e5139d18dfb8130876ea59178e8471fb1b34bb80 commit r13-3276-ge5139d18dfb8130876ea59178e8471fb1b34bb80 Author: Richard Biener Date: Thu Oct 13 14:56:01 2022 +0200 tree-optimization/107247 - reduce SLP reduction accumulator The following makes sure to reduce a multi-vector SLP reduction accumulator to a single vector using vector operations if easily possible (if the number of lanes in the vector type is a multiple of the number of scalar accumulators). PR tree-optimization/107247 * tree-vect-loop.cc (vect_create_epilog_for_reduction): Reduce multi vector SLP reduction accumulators. Check the adjusted number of accumulator vectors against one for the re-use in the epilogue.
[Bug tree-optimization/107247] SLP reduction results fail to reduce to a single accumulator
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107247 Richard Biener changed: What|Removed |Added Resolution|--- |FIXED Status|ASSIGNED|RESOLVED --- Comment #3 from Richard Biener --- This is now fixed for the cases not requiring permutations. One could think of a three-lane SLP reduction with three four component vectors being reduced by first "expanding" to four "three component" vectors, summing them and then extracting from the lower three lanes. Likewise for a six-lane SLP reduction which would get a more complex extraction of two-vector, six-lane pairs. Unless a compelling case comes along I don't consider these important.
[Bug tree-optimization/107247] SLP reduction results fail to reduce to a single accumulator
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107247 Andrew Pinski changed: What|Removed |Added Target Milestone|--- |13.0