[Bug target/40644] Optimizing for pentium-m gives worse code than optimizing for i486

2009-07-03 Thread aanisimov at inbox dot ru


--- Comment #3 from aanisimov at inbox dot ru  2009-07-03 19:12 ---
> 
> Try disabling prefetching.
> 

Indeed, removing -fprefetch-loop-arrays made the program run in 37.534 seconds,
exactly like one compiled for i686.


-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=40644



[Bug target/40644] Optimizing for pentium-m gives worse code than optimizing for i486

2009-07-03 Thread rguenth at gcc dot gnu dot org


--- Comment #2 from rguenth at gcc dot gnu dot org  2009-07-03 18:55 ---
Try -march=pentium-m -mtune=generic.  Pentium-M never received any special
tuning (it is the same as for pentium-pro).  So is -march=i686 btw, but
i686 does not have SSE, so it is likely vectorization and/or prefetching
that slows your case 3. down.

Try disabling prefetching.


-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=40644



[Bug target/40644] Optimizing for pentium-m gives worse code than optimizing for i486

2009-07-03 Thread aanisimov at inbox dot ru


--- Comment #1 from aanisimov at inbox dot ru  2009-07-03 18:28 ---
Created an attachment (id=18137)
 --> (http://gcc.gnu.org/bugzilla/attachment.cgi?id=18137&action=view)
Sample program


-- 


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=40644