https://gcc.gnu.org/bugzilla/show_bug.cgi?id=103592
Bug ID: 103592 Summary: fatigue2 benchmarks on zen runs 43% faster with -fno-tree-vectorize -fno-tree-slp-vectorize Product: gcc Version: 12.0 Status: UNCONFIRMED Severity: normal Priority: P3 Component: tree-optimization Assignee: unassigned at gcc dot gnu.org Reporter: hubicka at gcc dot gnu.org Target Milestone: --- While looking into -fno-inline-functions-called-once difference I noticed that on zen hardware I get: - 0m33s runtime for fatigue2 benchmark (from phoronix) when built with -Ofast -march=native -fno-slp-vectorize -fno-tree-vectorize - 0m57s for -Ofast -march=native binary