http://gcc.gnu.org/bugzilla/show_bug.cgi?id=47298
Joost VandeVondele <Joost.VandeVondele at mat dot ethz.ch> changed: What |Removed |Added ---------------------------------------------------------------------------- Last reconfirmed| |2012-06-29 --- Comment #4 from Joost VandeVondele <Joost.VandeVondele at mat dot ethz.ch> 2012-06-29 14:44:05 UTC --- on 4.8 this still is not handled optimally. I get 4.3s for: gfortran -O2 -funroll-loops -ftree-vectorize -ffast-math -march=native 6.7s for: gfortran -O3 -funroll-loops -ftree-vectorize -ffast-math -march=native so more than 50% slowdown going from -O2 to -O3 on -march=corei7 -mcx16 -msahf -mno-movbe -mno-aes -mno-pclmul -mpopcnt -mno-abm -mno-lwp -mno-fma -mno-fma4 -mno-xop -mno-bmi -mno-bmi2 -mno-tbm -mno-avx -mno-avx2 -msse4.2 -msse4.1 -mno-lzcnt -mno-rtm -mno-hle -mno-rdrnd -mno-f16c -mno-fsgsbase --param l1-cache-size=32 --param l1-cache-line-size=64