https://gcc.gnu.org/bugzilla/show_bug.cgi?id=123190

            Bug ID: 123190
           Summary: [16 Regression] 8% slowdown of 433.milc on AMD zen4
                    since r16-5275-ga645e903e8c394
           Product: gcc
           Version: 16.0
            Status: UNCONFIRMED
          Keywords: missed-optimization
          Severity: normal
          Priority: P3
         Component: tree-optimization
          Assignee: unassigned at gcc dot gnu.org
          Reporter: pheeck at gcc dot gnu.org
                CC: rguenth at gcc dot gnu.org
            Blocks: 26163
  Target Milestone: ---
              Host: x86_64-linux
            Target: x86_64-linux

As seen here

https://lnt.opensuse.org/db_default/v4/SPEC/graph?plot.0=1109.70.0

there was an 8% exec time slowdown of the 433.milc SPEC 2006
benchmark when compiled with -O2 -march=x86-64-v3 -flto -fprofile-use on an AMD
Zen4
machine.  In pr122585, comment1 Andrew Pinski guessed that this was caused by
r16-5275-ga645e903e8c394 and I verified that this is indeed the case. 

Author:     Richard Biener <[email protected]>
AuthorDate: Thu Nov 6 11:49:31 2025 +0100
Commit:     Richard Biener <[email protected]>
CommitDate: Fri Nov 14 12:06:02 2025 +0100

    tree-optimization/122573 - enhance SLP of invariant loads

    Currently SLP of invariant loads is only supported for the case of
    a single load that is splat, as side-effect of supporting this case
    even for non-invariant loads.  The following extends this to any
    set of invariant loads.  The way we have load permutations for
    these makes it a bit awkward, thus adjustments in that area.


Referenced Bugs:

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=26163
[Bug 26163] [meta-bug] missed optimization in SPEC (2k17, 2k and 2k6 and 95)

Reply via email to