https://gcc.gnu.org/bugzilla/show_bug.cgi?id=82139
Andrew Pinski <pinskia at gcc dot gnu.org> changed: What |Removed |Added ---------------------------------------------------------------------------- Last reconfirmed| |2021-09-04 Keywords| |ra Status|UNCONFIRMED |NEW Ever confirmed|0 |1 --- Comment #1 from Andrew Pinski <pinskia at gcc dot gnu.org> --- It is worse on the trunk: .L2: movdqu (%rdi), %xmm1 movdqu (%rdi), %xmm0 addq $16, %rdi paddd %xmm3, %xmm1 paddd %xmm2, %xmm0 blendpd $2, %xmm0, %xmm1 movups %xmm1, -16(%rdi) cmpq %rdi, %rax jne .L2 Why two loads from %rdi here? This is done during RA as far as I can tell.