https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108271
Robin Dapp <rdapp at gcc dot gnu.org> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |rdapp at gcc dot gnu.org --- Comment #3 from Robin Dapp <rdapp at gcc dot gnu.org> --- This is basically the same problem as PR108412. As long as loads/stores have a high(ish) latency and we mostly do load/store, they will tend to lump together at the end of the function. Setting vector load/store to a latency of <= 2 helps here and we might want to do this in order to avoid excessive spilling. I had to deal with this before, e.g. in SPEC2006's calculix. In the end insn scheduling wouldn't buy us anything and rather caused more spilling causing performance degradationl