[Bug tree-optimization/96208] New: non-power-of-2 group size can be vectorized for 2-element vectors case

2020-07-15 Thread dpochepk at gmail dot com
: normal Priority: P3 Component: tree-optimization Assignee: unassigned at gcc dot gnu.org Reporter: dpochepk at gmail dot com Target Milestone: --- Created attachment 48879 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=48879&action=edit

[Bug tree-optimization/94532] [10 Regression] ICE while compiling speccpu2017 blender

2020-04-09 Thread dpochepk at gmail dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94532 --- Comment #4 from Dmitrij Pochepko --- Yes. It'a a diplicate of 94443

[Bug tree-optimization/94532] [10 Regression] ICE while compiling speccpu2017 blender

2020-04-09 Thread dpochepk at gmail dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94532 Dmitrij Pochepko changed: What|Removed |Added Status|WAITING |RESOLVED Resolution|---

[Bug tree-optimization/94443] [10 Regression] 510.parest_r and 526.blender_r ICE: verify_ssa failed since r10-7491-gbd0f22a8d5caea8905f38ff1fafce31c1b7d33ad

2020-04-09 Thread dpochepk at gmail dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94443 Dmitrij Pochepko changed: What|Removed |Added CC||dpochepk at gmail dot com

[Bug c++/94532] New: ICE while compiling speccpu2017 blender

2020-04-08 Thread dpochepk at gmail dot com
++ Assignee: unassigned at gcc dot gnu.org Reporter: dpochepk at gmail dot com Target Milestone: --- Failed with ToT revision. At least on aarch64. Passed with 27 march build version. Log: blender/source/blender/blenkernel/intern/curve.c:1063:6: error: missing definition 1063

[Bug target/93720] [10 Regression] vector creation from two parts of two vectors produces TBL rather than ins

2020-03-31 Thread dpochepk at gmail dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93720 Dmitrij Pochepko changed: What|Removed |Added CC||dpochepk at gmail dot com

[Bug tree-optimization/94212] [8/9/10 Regression] Incorrect vectorization of loop with FP calculations

2020-03-18 Thread dpochepk at gmail dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94212 --- Comment #8 from Dmitrij Pochepko --- (In reply to Richard Biener from comment #7) > (In reply to Dmitrij Pochepko from comment #6) > > Just checked: non-vectorized assembly for aarch64 (O2) is using fmadd and > > fmsub intensively. > > Try w

[Bug tree-optimization/94212] [8/9/10 Regression] Incorrect vectorization of loop with FP calculations

2020-03-18 Thread dpochepk at gmail dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94212 --- Comment #6 from Dmitrij Pochepko --- Just checked: non-vectorized assembly for aarch64 (O2) is using fmadd and fmsub intensively.

[Bug c/94212] New: [AARCH64] [Regression] Incorrect vectorization of loop with FP calculations

2020-03-18 Thread dpochepk at gmail dot com
: normal Priority: P3 Component: c Assignee: unassigned at gcc dot gnu.org Reporter: dpochepk at gmail dot com Target Milestone: --- Created attachment 48054 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=48054&action=edit example application re

[Bug target/93720] [10 Regression] vector creation from two parts of two vectors produces TBL rather than ins

2020-02-16 Thread dpochepk at gmail dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93720 --- Comment #6 from Dmitrij Pochepko --- Created attachment 47851 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=47851&action=edit current patch version

[Bug target/93720] [10 Regression] vector creation from two parts of two vectors produces TBL rather than ins

2020-02-14 Thread dpochepk at gmail dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93720 --- Comment #2 from Dmitrij Pochepko --- I have a patch, which recognize such pattern and adds ins instructions. Example in this issue description is compiled fine and produce this assembly: : 0: 6e184420mov v0.

[Bug target/93390] New: AARCH64: FP move costs needs improvements for ThunderX2

2020-01-22 Thread dpochepk at gmail dot com
Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: dpochepk at gmail dot com Target Milestone: --- Target: aarch64-thunderx2t99 Current cpu_regmove_cost for thunderx2t99 seems to be not optimal. Preliminary experiments and

[Bug rtl-optimization/92892] New: [AARCH64] TBL-based permutations can be implemented more efficiently for 2-element vectors

2019-12-10 Thread dpochepk at gmail dot com
Severity: normal Priority: P3 Component: rtl-optimization Assignee: unassigned at gcc dot gnu.org Reporter: dpochepk at gmail dot com Target Milestone: --- Current vector elements permutation implementation generates different instructions depending on

[Bug tree-optimization/90839] Detect lsb ones counting loop (final value replacement?)

2019-10-07 Thread dpochepk at gmail dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=90839 --- Comment #4 from Dmitrij Pochepko --- (In reply to Andrew Pinski from comment #3) > ... I haven't tracked deepsjeng data passed for logL function specifically. I only measured totals. It might be not directly related to logL code execution ti

[Bug tree-optimization/90839] Detect lsb ones counting loop (final value replacement?)

2019-10-02 Thread dpochepk at gmail dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=90839 Dmitrij Pochepko changed: What|Removed |Added CC||dpochepk at gmail dot com