https://gcc.gnu.org/bugzilla/show_bug.cgi?id=90364
--- Comment #5 from Martin Liška <marxin at gcc dot gnu.org> --- (In reply to Richard Biener from comment #4) > (In reply to Martin Liška from comment #3) > > So the problem is that without a profile tree-vectorizer does a > > vectorization in 1162 functions, while with PGO only 49 functions are > > vectorized. > > Can you please Richi take a look? I can provide vectorizer dump files. > > optimize_loop_nest_for_speed_p returning false? > > Does the train profile match the ref profile or is there a clear mismatch > so we guess a ref hot loop as cold? Apparently the coverage looks very close to each other: diff -u train-report2.txt ref-report.txt --- train-report2.txt 2019-05-09 12:07:29.499603444 +0200 +++ ref-report.txt 2019-05-09 11:50:16.526575333 +0200 @@ -4,7 +4,7 @@ ESMF_Alarm.fppized.f90 : 53.26% of 92 ESMF_BaseTime.fppized.f90 : 49.28% of 69 ESMF_Calendar.fppized.f90 : 57.14% of 21 -ESMF_Clock.fppized.f90 : 55.43% of 175 +ESMF_Clock.fppized.f90 : 57.71% of 175 ESMF_Stubs.fppized.f90 : 69.57% of 23 ESMF_Time.fppized.f90 : 77.62% of 143 ESMF_TimeInterval.fppized.f90 : 45.56% of 169 @@ -13,7 +13,7 @@ io_int.fppized.f90 : 2.14% of 515 libmassv.fppized.f90 : 7.43% of 202 Meat.fppized.f90 : 58.94% of 302 -mediation_integrate.fppized.f90 : 19.40% of 701 +mediation_integrate.fppized.f90 : 19.83% of 701 mediation_wrfmain.fppized.f90 : 93.66% of 2113 module_advect_em.fppized.f90 : 17.44% of 5172 module_alloc_space_0.fppized.f90 : 43.48% of 21444 @@ -33,7 +33,7 @@ module_comm_dm_3.fppized.f90 : 2.41% of 748 module_comm_dm_4.fppized.f90 : 7.31% of 1738 module_configure.fppized.f90 : 49.57% of 24568 -module_cu_kfeta.fppized.f90 : 82.90% of 1439 +module_cu_kfeta.fppized.f90 : 83.53% of 1439 module_cumulus_driver.fppized.f90 : 54.88% of 164 module_date_time.fppized.f90 : 6.58% of 395 module_diag_misc.fppized.f90 : 10.88% of 294 @@ -48,7 +48,7 @@ module_force_scm.fppized.f90 : 18.38% of 272 module_integrate.fppized.f90 : 58.67% of 75 module_io_domain.fppized.f90 : 12.06% of 564 -module_io.fppized.f90 : 19.82% of 2609 +module_io.fppized.f90 : 20.01% of 2609 module_io_quilt.fppized.f90 : 5.37% of 149 module_io_wrf.fppized.f90 : 38.46% of 13 module_lightning_driver.fppized.f90 : 4.27% of 117 @@ -56,7 +56,7 @@ module_microphysics_driver.fppized.f90 : 40.96% of 166 module_microphysics_zero_out.fppized.f90 : 16.33% of 49 module_mp_radar.fppized.f90 : 45.28% of 265 -module_mp_wsm5.fppized.f90 : 88.01% of 784 +module_mp_wsm5.fppized.f90 : 88.52% of 784 module_nesting.fppized.f90 : 32.26% of 31 module_pbl_driver.fppized.f90 : 33.40% of 491 module_physics_addtendc.fppized.f90 : 30.54% of 537 @@ -112,6 +112,6 @@ track_driver.fppized.f90 : 1.49% of 335 wrf_bdyin.fppized.f90 : 73.55% of 121 wrf_ext_write_field.fppized.f90 : 85.11% of 47 -wrf_io.fppized.f90 : 20.39% of 4051 +wrf_io.fppized.f90 : 20.69% of 4051 wrf_timeseries.fppized.f90 : 3.58% of 307 wrf_tsin.fppized.f90 : 34.88% of 43