[Bug middle-end/47298] -O3 destroys beautifully vectorized code obtained at -O2

Joost.VandeVondele at pci dot uzh.ch Fri, 14 Jan 2011 12:53:06 -0800

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=47298


--- Comment #2 from Joost VandeVondele <Joost.VandeVondele at pci dot uzh.ch> 
2011-01-14 20:52:54 UTC ---
(In reply to comment #1)
> It's faster for me with -O3 (Athlon64, using -march=native).

well not on 
model name      : Intel(R) Xeon(R) CPU           X5550  @ 2.67GHz
stepping        : 5

I have 8Gflops with -O2 and somewhat more than 4 with -O3

BTW, the proper test program is
> cat test_compare.f90 
      REAL(KIND=8), DIMENSION(12,12) :: A,B,C
      A=0 ; B=0 ; C=0
      DO I=1,10000000
         CALL HARD_NN_12_12_12(C,A,B)
      ENDDO
      END

[Bug middle-end/47298] -O3 destroys beautifully vectorized code obtained at -O2

Reply via email to