> Sebastian,
>
> Here are (attached) results for testing HIRLAM with and without
> -ftree-loop-linear.
>
> As you can see, the results are neutral:  4 loops fewer vectorized, but
> about 50 fewer recognized.
>

any chance you kept the dumps and can report which loops were not
vectorized/recognized with -ftree-loop-linear (so we could see if these
represent missed vectorization opportunities?)

thanks,
dorit

> Now I like to redo that test with -ftree-loop-distribution.  Can you
> send me a patch against the trunk (otherwise it won't be a fair
comparison).
>
> Kind regards,
>
> --
> Toon Moene - e-mail: [EMAIL PROTECTED] - phone: +31 346 214290
> Saturnushof 14, 3738 XG  Maartensdijk, The Netherlands
> At home: http://moene.indiv.nluug.nl/~toon/
> GNU Fortran's path to Fortran 2003: http://gcc.gnu.org/wiki/Fortran2003
> Baseline, no source changes:
>
> Mon Dec 10 17:45:19 UTC 2007 (revision 130746)
>
> Compilation flags:
>
> CCFLAGS := -g -O3 $(MACHINECPP) -ffast-math -fno-associative-math -
> march=native -mtune=native -ftree-vectorizer-verbose=2
> FCFLAGS := -g -O3 -fbacktrace -ffpe-trap=invalid,zero,overflow -
> ffast-math -fno-associative-math -march=native -mtune=native -ftree-
> vectorizer-verbose=2
>
> Loops vectorized:
> 5675
> Loops not vectorized:
> 13705
>
> Timings:
> 20061201_00/HL_Cycle_2006120100.html: FORECAST TOOK    12.7488 SECONDS
> 20061201_00/HL_Cycle_2006120100.html: FORECAST TOOK  2445.9609 SECONDS
> 20061201_06/HL_Cycle_2006120106.html: FORECAST TOOK   259.3362 SECONDS
> 20061201_06/HL_Cycle_2006120106.html: FORECAST TOOK    12.4408 SECONDS
> 20061201_06/HL_Cycle_2006120106.html: FORECAST TOOK   305.9351 SECONDS
> 20061201_12/HL_Cycle_2006120112.html: FORECAST TOOK   262.1124 SECONDS
> 20061201_12/HL_Cycle_2006120112.html: FORECAST TOOK    12.7448 SECONDS
> 20061201_12/HL_Cycle_2006120112.html: FORECAST TOOK  2323.3733 SECONDS
> 20061201_12r/HL_Cycle_2006120112r.html: FORECAST TOOK   412.7058 SECONDS
> 20061201_18/HL_Cycle_2006120118.html: FORECAST TOOK   264.5685 SECONDS
> 20061201_18/HL_Cycle_2006120118.html: FORECAST TOOK    12.6648 SECONDS
> 20061201_18/HL_Cycle_2006120118.html: FORECAST TOOK   306.7352 SECONDS
> 20061202_00/HL_Cycle_2006120200.html: FORECAST TOOK   261.5164 SECONDS
> 20061202_00/HL_Cycle_2006120200.html: FORECAST TOOK    12.7688 SECONDS
> 20061202_00/HL_Cycle_2006120200.html: FORECAST TOOK  2325.3774 SECONDS
> 20061202_00r/HL_Cycle_2006120200r.html: FORECAST TOOK   413.8739 SECONDS
>
> Baseline, no source changes, with -ftree-loop-linear:
>
> Mon Dec 10 17:45:19 UTC 2007 (revision 130746)
>
> Compilation flags:
>
> CCFLAGS := -g -O3 $(MACHINECPP) -ftree-loop-linear -ffast-math -fno-
> associative-math -march=native -mtune=native -ftree-vectorizer-verbose=2
> FCFLAGS := -g -O3 -ftree-loop-linear -fbacktrace -ffpe-trap=invalid,
> zero,overflow -ffast-math -fno-associative-math -march=native -
> mtune=native -ftree-vectorizer-verbose=2
>
> This compilation got one ICE:
>
> rttov_aitosu.f90: In function 'rttov_aitosu':
> rttov_aitosu.f90:4: error: definition in block 262 does not dominate
> use in block 134
> for SSA_NAME: pretmp.240_59 in statement:
> prephitmp.220_58 = PHI <pretmp.240_59(134), D.1480_1373(138)>
> PHI argument
> pretmp.240_59
> for PHI node
> prephitmp.220_58 = PHI <pretmp.240_59(134), D.1480_1373(138)>
> rttov_aitosu.f90:4: internal compiler error: verify_ssa failed
> Please submit a full bug report,
> with preprocessed source if appropriate.
> See <http://gcc.gnu.org/bugs.html> for instructions.
>
> Worked around by compiling this file without -ftree-loop-linear
>
> Loops vectorized:
> 5671
> Loops not vectorized:
> 13655
>
> Timings:
> 20061201_00/HL_Cycle_2006120100.html: FORECAST TOOK    12.5648 SECONDS
> 20061201_00/HL_Cycle_2006120100.html: FORECAST TOOK  2444.1208 SECONDS
> 20061201_06/HL_Cycle_2006120106.html: FORECAST TOOK   259.3402 SECONDS
> 20061201_06/HL_Cycle_2006120106.html: FORECAST TOOK    12.4728 SECONDS
> 20061201_06/HL_Cycle_2006120106.html: FORECAST TOOK   307.8672 SECONDS
> 20061201_12/HL_Cycle_2006120112.html: FORECAST TOOK   260.0323 SECONDS
> 20061201_12/HL_Cycle_2006120112.html: FORECAST TOOK    12.8608 SECONDS
> 20061201_12/HL_Cycle_2006120112.html: FORECAST TOOK  2310.2485 SECONDS
> 20061201_12r/HL_Cycle_2006120112r.html: FORECAST TOOK   411.3977 SECONDS
> 20061201_18/HL_Cycle_2006120118.html: FORECAST TOOK   261.1283 SECONDS
> 20061201_18/HL_Cycle_2006120118.html: FORECAST TOOK    12.7248 SECONDS
> 20061201_18/HL_Cycle_2006120118.html: FORECAST TOOK   308.1313 SECONDS
> 20061202_00/HL_Cycle_2006120200.html: FORECAST TOOK   262.7564 SECONDS
> 20061202_00/HL_Cycle_2006120200.html: FORECAST TOOK    12.6528 SECONDS
> 20061202_00/HL_Cycle_2006120200.html: FORECAST TOOK  2336.5620 SECONDS
> 20061202_00r/HL_Cycle_2006120200r.html: FORECAST TOOK   410.6577 SECONDS

Reply via email to