https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113597
--- Comment #8 from Andrew Pinski <pinskia at gcc dot gnu.org> --- (In reply to Alex Coplan from comment #7) > I expect the store pairs come from memcpy lowering/expansion in the aarch64 > backend, that is the only way we get store pairs so early in the RTL > pipeline IIRC. In this case, memset is more likely. Either: for (int i = 0; i < j; i++) m[i] = vdupq_n_f32(0.F); Or for (int i = 0; i < l; i++) n[i] = vdupq_n_f32(0.F);