https://gcc.gnu.org/bugzilla/show_bug.cgi?id=90364

--- Comment #5 from Martin Liška <marxin at gcc dot gnu.org> ---
(In reply to Richard Biener from comment #4)
> (In reply to Martin Liška from comment #3)
> > So the problem is that without a profile tree-vectorizer does a
> > vectorization in 1162 functions, while with PGO only 49 functions are
> > vectorized.
> > Can you please Richi take a look? I can provide vectorizer dump files.
> 
> optimize_loop_nest_for_speed_p returning false?
> 
> Does the train profile match the ref profile or is there a clear mismatch
> so we guess a ref hot loop as cold?

Apparently the coverage looks very close to each other:

diff -u train-report2.txt ref-report.txt
--- train-report2.txt   2019-05-09 12:07:29.499603444 +0200
+++ ref-report.txt      2019-05-09 11:50:16.526575333 +0200
@@ -4,7 +4,7 @@
 ESMF_Alarm.fppized.f90                                      : 53.26% of 92
 ESMF_BaseTime.fppized.f90                                   : 49.28% of 69
 ESMF_Calendar.fppized.f90                                   : 57.14% of 21
-ESMF_Clock.fppized.f90                                      : 55.43% of 175
+ESMF_Clock.fppized.f90                                      : 57.71% of 175
 ESMF_Stubs.fppized.f90                                      : 69.57% of 23
 ESMF_Time.fppized.f90                                       : 77.62% of 143
 ESMF_TimeInterval.fppized.f90                               : 45.56% of 169
@@ -13,7 +13,7 @@
 io_int.fppized.f90                                          : 2.14% of 515
 libmassv.fppized.f90                                        : 7.43% of 202
 Meat.fppized.f90                                            : 58.94% of 302
-mediation_integrate.fppized.f90                             : 19.40% of 701
+mediation_integrate.fppized.f90                             : 19.83% of 701
 mediation_wrfmain.fppized.f90                               : 93.66% of 2113
 module_advect_em.fppized.f90                                : 17.44% of 5172
 module_alloc_space_0.fppized.f90                            : 43.48% of 21444
@@ -33,7 +33,7 @@
 module_comm_dm_3.fppized.f90                                : 2.41% of 748
 module_comm_dm_4.fppized.f90                                : 7.31% of 1738
 module_configure.fppized.f90                                : 49.57% of 24568
-module_cu_kfeta.fppized.f90                                 : 82.90% of 1439
+module_cu_kfeta.fppized.f90                                 : 83.53% of 1439
 module_cumulus_driver.fppized.f90                           : 54.88% of 164
 module_date_time.fppized.f90                                : 6.58% of 395
 module_diag_misc.fppized.f90                                : 10.88% of 294
@@ -48,7 +48,7 @@
 module_force_scm.fppized.f90                                : 18.38% of 272
 module_integrate.fppized.f90                                : 58.67% of 75
 module_io_domain.fppized.f90                                : 12.06% of 564
-module_io.fppized.f90                                       : 19.82% of 2609
+module_io.fppized.f90                                       : 20.01% of 2609
 module_io_quilt.fppized.f90                                 : 5.37% of 149
 module_io_wrf.fppized.f90                                   : 38.46% of 13
 module_lightning_driver.fppized.f90                         : 4.27% of 117
@@ -56,7 +56,7 @@
 module_microphysics_driver.fppized.f90                      : 40.96% of 166
 module_microphysics_zero_out.fppized.f90                    : 16.33% of 49
 module_mp_radar.fppized.f90                                 : 45.28% of 265
-module_mp_wsm5.fppized.f90                                  : 88.01% of 784
+module_mp_wsm5.fppized.f90                                  : 88.52% of 784
 module_nesting.fppized.f90                                  : 32.26% of 31
 module_pbl_driver.fppized.f90                               : 33.40% of 491
 module_physics_addtendc.fppized.f90                         : 30.54% of 537
@@ -112,6 +112,6 @@
 track_driver.fppized.f90                                    : 1.49% of 335
 wrf_bdyin.fppized.f90                                       : 73.55% of 121
 wrf_ext_write_field.fppized.f90                             : 85.11% of 47
-wrf_io.fppized.f90                                          : 20.39% of 4051
+wrf_io.fppized.f90                                          : 20.69% of 4051
 wrf_timeseries.fppized.f90                                  : 3.58% of 307
 wrf_tsin.fppized.f90                                        : 34.88% of 43

Reply via email to