ICX, 9% regression on znver3

rguenther at suse dot de via Gcc-bugs Wed, 28 Apr 2021 23:57:38 -0700

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100173


--- Comment #3 from rguenther at suse dot de <rguenther at suse dot de> ---
On Thu, 29 Apr 2021, crazylht at gmail dot com wrote:

> https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100173
> 
> --- Comment #2 from Hongtao.liu <crazylht at gmail dot com> ---
> 
> > but yes, cselim will also sink the first store, moving it across the
> > scalar compute in the block.  I might note that ideally we'd sink
> > all the compute as well and end up with just a conditional load of
> > either pIn1->m_esState or pIn2_89->m_esState.  That might then allow
> > scheduling to recover the original performance.
> > 
> 
> I want to clasify this regression is not related to 2 sinked stores, it just
> trigger some micro-architecture bound.
> 
> Also w/o -fvect-cost-model=very-cheap, it can be 2-3x faster, the tripper 
> count
> is constant, so i wonder why very-cheap cost model doesn't vectorize this 
> loop?

Because it needs versioning for alias

[Bug tree-optimization/100173] telecom/viterb00data_1 has 16.92% regression compared O2 -ftree-vectorize -fvect-cost-model=very-cheap to O2 on CLX/ICX, 9% regression on znver3

Reply via email to