[Bug target/31897] [4.3 Regression] 30% speed regression with -m32 on Opteron with rnflow

2011-06-27 Thread rguenth at gcc dot gnu.org
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=31897 Richard Guenther changed: What|Removed |Added Status|NEW |RESOLVED Resolution|

[Bug target/31897] [4.3 Regression] 30% speed regression with -m32 on Opteron with rnflow

2010-05-22 Thread rguenth at gcc dot gnu dot org
--- Comment #18 from rguenth at gcc dot gnu dot org 2010-05-22 18:11 --- GCC 4.3.5 is being released, adjusting target milestone. -- rguenth at gcc dot gnu dot org changed: What|Removed |Added --

[Bug target/31897] [4.3 Regression] 30% speed regression with -m32 on Opteron with rnflow

2009-08-04 Thread rguenth at gcc dot gnu dot org
--- Comment #17 from rguenth at gcc dot gnu dot org 2009-08-04 12:28 --- GCC 4.3.4 is being released, adjusting target milestone. -- rguenth at gcc dot gnu dot org changed: What|Removed |Added --

[Bug target/31897] [4.3 Regression] 30% speed regression with -m32 on Opteron with rnflow

2009-01-24 Thread rguenth at gcc dot gnu dot org
--- Comment #16 from rguenth at gcc dot gnu dot org 2009-01-24 10:19 --- GCC 4.3.3 is being released, adjusting target milestone. -- rguenth at gcc dot gnu dot org changed: What|Removed |Added --

[Bug target/31897] [4.3 Regression] 30% speed regression with -m32 on Opteron with rnflow

2008-08-27 Thread jsm28 at gcc dot gnu dot org
--- Comment #15 from jsm28 at gcc dot gnu dot org 2008-08-27 22:01 --- 4.3.2 is released, changing milestones to 4.3.3. -- jsm28 at gcc dot gnu dot org changed: What|Removed |Added ---

[Bug target/31897] [4.3 Regression] 30% speed regression with -m32 on Opteron with rnflow

2008-08-23 Thread ubizjak at gmail dot com
--- Comment #14 from ubizjak at gmail dot com 2008-08-23 20:07 --- Looking at http://users.physik.fu-berlin.de/~tburnus/gcc-trunk/benchmark/, it looks that the problem was mysteriously fixed in recent mainline. -- ubizjak at gmail dot com changed: What|Removed

[Bug target/31897] [4.3 Regression] 30% speed regression with -m32 on Opteron with rnflow

2008-01-30 Thread hubicka at gcc dot gnu dot org
--- Comment #11 from hubicka at gcc dot gnu dot org 2008-01-30 16:04 --- If you have nothing against, I would probably go for -mfpmath=sse implied by -ftree-vectorize route then. Since there is now i386 ABI mailing list I hope if we can move in direction of having -mfpmath=sse by defau

[Bug target/31897] [4.3 Regression] 30% speed regression with -m32 on Opteron with rnflow

2008-01-19 Thread ubizjak at gmail dot com
--- Comment #10 from ubizjak at gmail dot com 2008-01-19 16:31 --- (In reply to comment #9) > Does the regression on C2 duo show even without vectorizing? It looks like > generic SSE fpmath performance issue. There should be no reason why SSE math > in combination with SSE vectorizatio

[Bug target/31897] [4.3 Regression] 30% speed regression with -m32 on Opteron with rnflow

2008-01-19 Thread hubicka at gcc dot gnu dot org
--- Comment #9 from hubicka at gcc dot gnu dot org 2008-01-19 12:03 --- Does the regression on C2 duo show even without vectorizing? It looks like generic SSE fpmath performance issue. There should be no reason why SSE math in combination with SSE vectorization should result in regress

[Bug target/31897] [4.3 Regression] 30% speed regression with -m32 on Opteron with rnflow

2007-12-21 Thread ubizjak at gmail dot com
--- Comment #8 from ubizjak at gmail dot com 2007-12-21 18:57 --- Confirmed on K8 at http://gcc.gnu.org/ml/gcc-patches/2007-12/msg01042.html -- ubizjak at gmail dot com changed: What|Removed |Added --

[Bug target/31897] [4.3 Regression] 30% speed regression with -m32 on Opteron with rnflow

2007-10-19 Thread ubizjak at gmail dot com
--- Comment #7 from ubizjak at gmail dot com 2007-10-19 17:53 --- (In reply to comment #6) > It is little bit sick, but what about implying -mfpmath=sse when > -ftree-vectorize is used and SSE is available? Then you will hit Core2 Duo, that shows the opposite in 32bit and 64bit mode: -

[Bug target/31897] [4.3 Regression] 30% speed regression with -m32 on Opteron with rnflow

2007-10-09 Thread hubicka at gcc dot gnu dot org
--- Comment #6 from hubicka at gcc dot gnu dot org 2007-10-09 14:54 --- It is little bit sick, but what about implying -mfpmath=sse when -ftree-vectorize is used and SSE is available? The reason why we don't default to fpmath=sse is because the extra precision is told to be part of i386

[Bug target/31897] [4.3 Regression] 30% speed regression with -m32 on Opteron with rnflow

2007-08-09 Thread mmitchel at gcc dot gnu dot org
-- mmitchel at gcc dot gnu dot org changed: What|Removed |Added Priority|P3 |P2 http://gcc.gnu.org/bugzilla/show_bug.cgi?id=31897

[Bug target/31897] [4.3 Regression] 30% speed regression with -m32 on Opteron with rnflow

2007-07-18 Thread ubizjak at gmail dot com
--- Comment #5 from ubizjak at gmail dot com 2007-07-18 12:05 --- The problem is in cptrf2 function when both -mfpmath=387 and -ftree-vectorize are used. -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=31897

[Bug target/31897] [4.3 Regression] 30% speed regression with -m32 on Opteron with rnflow

2007-07-04 Thread ubizjak at gmail dot com
--- Comment #4 from ubizjak at gmail dot com 2007-07-04 12:32 --- (In reply to comment #1) > gfortran -ffast-math -funroll-loops -O3 -msse3 -mfpmath=387 rnflow.f90 > > time ./a.out > user0m37.982s > gfortran -ffast-math -funroll-loops -O3 -msse3 -mfpmath=387 -ftree-vectorize > rnfl

[Bug target/31897] [4.3 Regression] 30% speed regression with -m32 on Opteron with rnflow

2007-07-04 Thread ubizjak at gmail dot com
--- Comment #3 from ubizjak at gmail dot com 2007-07-04 12:29 --- (In reply to comment #2) > Can't reproduce this, gcc 4.3 actually seems to be faster (tests done on Intel > quadcore Core2): On core2 the bug doesn't trigger, but it shows on FC4 with: vendor_id : GenuineIntel cpu

[Bug target/31897] [4.3 Regression] 30% speed regression with -m32 on Opteron with rnflow

2007-07-04 Thread jakub at gcc dot gnu dot org
--- Comment #2 from jakub at gcc dot gnu dot org 2007-07-04 11:57 --- Can't reproduce this, gcc 4.3 actually seems to be faster (tests done on Intel quadcore Core2): /usr/src/gcc-4.2/obj/gcc/gfortran -B /usr/src/gcc-4.2/obj/gcc/ -L /usr/src/gcc-4.2/obj/x86_64-unknown-linux-gnu/32/libgfor

[Bug target/31897] [4.3 Regression] 30% speed regression with -m32 on Opteron with rnflow

2007-06-30 Thread pinskia at gcc dot gnu dot org
-- pinskia at gcc dot gnu dot org changed: What|Removed |Added CC||pinskia at gcc dot gnu dot |