> Sebastian, > > Here are (attached) results for testing HIRLAM with and without > -ftree-loop-linear. > > As you can see, the results are neutral: 4 loops fewer vectorized, but > about 50 fewer recognized. >
any chance you kept the dumps and can report which loops were not vectorized/recognized with -ftree-loop-linear (so we could see if these represent missed vectorization opportunities?) thanks, dorit > Now I like to redo that test with -ftree-loop-distribution. Can you > send me a patch against the trunk (otherwise it won't be a fair comparison). > > Kind regards, > > -- > Toon Moene - e-mail: [EMAIL PROTECTED] - phone: +31 346 214290 > Saturnushof 14, 3738 XG Maartensdijk, The Netherlands > At home: http://moene.indiv.nluug.nl/~toon/ > GNU Fortran's path to Fortran 2003: http://gcc.gnu.org/wiki/Fortran2003 > Baseline, no source changes: > > Mon Dec 10 17:45:19 UTC 2007 (revision 130746) > > Compilation flags: > > CCFLAGS := -g -O3 $(MACHINECPP) -ffast-math -fno-associative-math - > march=native -mtune=native -ftree-vectorizer-verbose=2 > FCFLAGS := -g -O3 -fbacktrace -ffpe-trap=invalid,zero,overflow - > ffast-math -fno-associative-math -march=native -mtune=native -ftree- > vectorizer-verbose=2 > > Loops vectorized: > 5675 > Loops not vectorized: > 13705 > > Timings: > 20061201_00/HL_Cycle_2006120100.html: FORECAST TOOK 12.7488 SECONDS > 20061201_00/HL_Cycle_2006120100.html: FORECAST TOOK 2445.9609 SECONDS > 20061201_06/HL_Cycle_2006120106.html: FORECAST TOOK 259.3362 SECONDS > 20061201_06/HL_Cycle_2006120106.html: FORECAST TOOK 12.4408 SECONDS > 20061201_06/HL_Cycle_2006120106.html: FORECAST TOOK 305.9351 SECONDS > 20061201_12/HL_Cycle_2006120112.html: FORECAST TOOK 262.1124 SECONDS > 20061201_12/HL_Cycle_2006120112.html: FORECAST TOOK 12.7448 SECONDS > 20061201_12/HL_Cycle_2006120112.html: FORECAST TOOK 2323.3733 SECONDS > 20061201_12r/HL_Cycle_2006120112r.html: FORECAST TOOK 412.7058 SECONDS > 20061201_18/HL_Cycle_2006120118.html: FORECAST TOOK 264.5685 SECONDS > 20061201_18/HL_Cycle_2006120118.html: FORECAST TOOK 12.6648 SECONDS > 20061201_18/HL_Cycle_2006120118.html: FORECAST TOOK 306.7352 SECONDS > 20061202_00/HL_Cycle_2006120200.html: FORECAST TOOK 261.5164 SECONDS > 20061202_00/HL_Cycle_2006120200.html: FORECAST TOOK 12.7688 SECONDS > 20061202_00/HL_Cycle_2006120200.html: FORECAST TOOK 2325.3774 SECONDS > 20061202_00r/HL_Cycle_2006120200r.html: FORECAST TOOK 413.8739 SECONDS > > Baseline, no source changes, with -ftree-loop-linear: > > Mon Dec 10 17:45:19 UTC 2007 (revision 130746) > > Compilation flags: > > CCFLAGS := -g -O3 $(MACHINECPP) -ftree-loop-linear -ffast-math -fno- > associative-math -march=native -mtune=native -ftree-vectorizer-verbose=2 > FCFLAGS := -g -O3 -ftree-loop-linear -fbacktrace -ffpe-trap=invalid, > zero,overflow -ffast-math -fno-associative-math -march=native - > mtune=native -ftree-vectorizer-verbose=2 > > This compilation got one ICE: > > rttov_aitosu.f90: In function 'rttov_aitosu': > rttov_aitosu.f90:4: error: definition in block 262 does not dominate > use in block 134 > for SSA_NAME: pretmp.240_59 in statement: > prephitmp.220_58 = PHI <pretmp.240_59(134), D.1480_1373(138)> > PHI argument > pretmp.240_59 > for PHI node > prephitmp.220_58 = PHI <pretmp.240_59(134), D.1480_1373(138)> > rttov_aitosu.f90:4: internal compiler error: verify_ssa failed > Please submit a full bug report, > with preprocessed source if appropriate. > See <http://gcc.gnu.org/bugs.html> for instructions. > > Worked around by compiling this file without -ftree-loop-linear > > Loops vectorized: > 5671 > Loops not vectorized: > 13655 > > Timings: > 20061201_00/HL_Cycle_2006120100.html: FORECAST TOOK 12.5648 SECONDS > 20061201_00/HL_Cycle_2006120100.html: FORECAST TOOK 2444.1208 SECONDS > 20061201_06/HL_Cycle_2006120106.html: FORECAST TOOK 259.3402 SECONDS > 20061201_06/HL_Cycle_2006120106.html: FORECAST TOOK 12.4728 SECONDS > 20061201_06/HL_Cycle_2006120106.html: FORECAST TOOK 307.8672 SECONDS > 20061201_12/HL_Cycle_2006120112.html: FORECAST TOOK 260.0323 SECONDS > 20061201_12/HL_Cycle_2006120112.html: FORECAST TOOK 12.8608 SECONDS > 20061201_12/HL_Cycle_2006120112.html: FORECAST TOOK 2310.2485 SECONDS > 20061201_12r/HL_Cycle_2006120112r.html: FORECAST TOOK 411.3977 SECONDS > 20061201_18/HL_Cycle_2006120118.html: FORECAST TOOK 261.1283 SECONDS > 20061201_18/HL_Cycle_2006120118.html: FORECAST TOOK 12.7248 SECONDS > 20061201_18/HL_Cycle_2006120118.html: FORECAST TOOK 308.1313 SECONDS > 20061202_00/HL_Cycle_2006120200.html: FORECAST TOOK 262.7564 SECONDS > 20061202_00/HL_Cycle_2006120200.html: FORECAST TOOK 12.6528 SECONDS > 20061202_00/HL_Cycle_2006120200.html: FORECAST TOOK 2336.5620 SECONDS > 20061202_00r/HL_Cycle_2006120200r.html: FORECAST TOOK 410.6577 SECONDS