> > L.S., > > > > On our weather forecasting code (compiled with -O3 -flto and linked with > > -O3 -flto -fwhole-program) I get a speedup of 65 seconds per time step > > in the model integration vs. 75 seconds with -O3 alone. > > There is bug making -fwhole-program disabled with LTO compilations. > I hope to get this fixed in mainline tomorrow. > > It will be interesting to see how much difference -fwhole-program makes > for you. Also ipa-sra was finally enabled at -O2 and I would be greatly > interested if it makes any difference (in general it should help to > fortran codebases by eliminating need to pass stuff around by reference) and just for non-scientific comparsion, this is with the patches I sent tonight.
-rwxr-xr-x 1 jh jh 57000 2009-10-06 23:53 gzip-O3 -rwxr-xr-x 1 jh jh 73296 2009-10-06 23:53 gzip-O3-flto -rwxr-xr-x 1 jh jh 56368 2009-10-06 23:53 gzip-O3-flto-fwhole-program -rwxr-xr-x 1 jh jh 76496 2009-10-06 23:56 gzip-O3-combine -rwxr-xr-x 1 jh jh 57136 2009-10-06 23:55 gzip-O3-combine-fwhole-program So things seems to work now plus minus as expected. I.e. LTO builds seems similar to combined builds and whole-programs improves code size quite noticeably. Runtime results for gzip are pretty much unchanged, but that is expected. I am quite curoius about full SPEC run. Honza > > Honza