https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109667

            Bug ID: 109667
           Summary: [12/13/14 Regression] Unnecessary temporary storage
                    used for 32-byte struct
           Product: gcc
           Version: 12.3.1
            Status: UNCONFIRMED
          Severity: normal
          Priority: P3
         Component: tree-optimization
          Assignee: unassigned at gcc dot gnu.org
          Reporter: chfast at gmail dot com
  Target Milestone: ---

Reduced reproducer:

struct i256 {
    long v[4];
};
void assign(struct i256 *v, long z) {
    struct i256 r = {};
    for (int i = 0; i < 1; ++i) 
        r.v[i] = z;
    *v = r;
}

https://godbolt.org/z/avM74o3r6

The compiler allocates temporary storage on stack for `r`:

assign:
        pxor    xmm0, xmm0
        mov     QWORD PTR [rsp-40], rsi
        movups  XMMWORD PTR [rsp-32], xmm0
        movdqa  xmm1, XMMWORD PTR [rsp-40]
        mov     QWORD PTR [rsp-16], 0
        movdqa  xmm2, XMMWORD PTR [rsp-24]
        movups  XMMWORD PTR [rdi], xmm1
        movups  XMMWORD PTR [rdi+16], xmm2
        ret

Regression since 12. The 11 compiles nicely to:

assign:
        mov     QWORD PTR [rdi], rsi
        mov     QWORD PTR [rdi+8], 0
        mov     QWORD PTR [rdi+16], 0
        mov     QWORD PTR [rdi+24], 0
        ret

Reply via email to