[Bug target/96789] x264: sub4x4_dct() improves when vectorization is disabled

2020-08-26 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=96789 --- Comment #3 from Kewen Lin --- Bisection shows it started to fail from r11-205.

[Bug target/96789] x264: sub4x4_dct() improves when vectorization is disabled

2020-08-26 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=96789 --- Comment #2 from Kewen Lin --- Created attachment 49124 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=49124=edit sub4x4_dct SLP dumping

[Bug tree-optimization/96789] New: x264: sub4x4_dct() improves when vectorization is disabled

2020-08-25 Thread linkw at gcc dot gnu.org
Priority: P3 Component: tree-optimization Assignee: unassigned at gcc dot gnu.org Reporter: linkw at gcc dot gnu.org Target Milestone: --- One of my workmates found that if we disable vectorization for SPEC2017 525.x264_r function sub4x4_dct in source file x264_src

[Bug testsuite/94077] gcc.dg/gomp/pr82374.c fails on power 7

2020-08-12 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94077 Kewen Lin changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug target/94077] gcc.dg/gomp/pr82374.c fails on power 7

2020-08-12 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94077 --- Comment #6 from Kewen Lin --- (In reply to Jakub Jelinek from comment #5) > I mean -fno-common, sorry. Good idea, that works! I'll send a patch by adding -fno-common into dg-options. Thanks for your suggestion!

[Bug target/94077] gcc.dg/gomp/pr82374.c fails on power 7

2020-08-12 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94077 --- Comment #3 from Kewen Lin --- > > I can observe this case fail if with explicit -fcommon. I mean even with gcc10 or trunk.

[Bug target/94077] gcc.dg/gomp/pr82374.c fails on power 7

2020-08-12 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94077 --- Comment #2 from Kewen Lin --- To be more specific, the reason causing the available alignment forcing is the default setting of -fcommon, we set -fno-common as default from GCC10, it makes decl_binds_to_current_def_p return true then. I can

[Bug target/94077] gcc.dg/gomp/pr82374.c fails on power 7

2020-08-12 Thread linkw at gcc dot gnu.org
|1 CC||linkw at gcc dot gnu.org Status|UNCONFIRMED |ASSIGNED --- Comment #1 from Kewen Lin --- This issue only exists on gcc8 and gcc9, it's gone with gcc10 and trunk. The main difference is listed below

[Bug tree-optimization/96451] [11 Regression] gcc.dg/pr68766.c ICE since r11-2453

2020-08-06 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=96451 Kewen Lin changed: What|Removed |Added Resolution|--- |FIXED Status|ASSIGNED

[Bug tree-optimization/96451] [11 Regression] gcc.dg/pr68766.c ICE since r11-2453

2020-08-05 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=96451 --- Comment #6 from Kewen Lin --- (In reply to rguent...@suse.de from comment #4) > On Wed, 5 Aug 2020, linkw at gcc dot gnu.org wrote: > > > https://gcc.gnu.org/bugzilla/show_bug.cgi?id=96451 > > > > --- C

[Bug tree-optimization/96451] [11 Regression] gcc.dg/pr68766.c ICE since r11-2453

2020-08-05 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=96451 --- Comment #5 from Kewen Lin --- Created attachment 49000 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=49000=edit untested patch Just noticed the dbgcnt supports several intervals, if we want to count epilogue loop, we probably need to

[Bug tree-optimization/96451] [11 Regression] gcc.dg/pr68766.c ICE since r11-2453

2020-08-04 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=96451 --- Comment #3 from Kewen Lin --- (In reply to Richard Biener from comment #2) > possibly a latent issue since the patch is supposed to be cost-only Yes, this case will hit ICE too with -fno-vect-cost-model even without the culprit commit.

[Bug tree-optimization/96451] [11 Regression] gcc.dg/pr68766.c ICE since r11-2453

2020-08-04 Thread linkw at gcc dot gnu.org
gnu.org |linkw at gcc dot gnu.org Last reconfirmed||2020-08-04 Status|UNCONFIRMED |ASSIGNED --- Comment #1 from Kewen Lin --- Thanks for reporting! I will have a look at it.

[Bug tree-optimization/94043] [9 Regression] ICE in superloop_at_depth, at cfgloop.c:78

2020-04-17 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94043 Kewen Lin changed: What|Removed |Added Resolution|--- |FIXED Status|ASSIGNED

[Bug testsuite/94079] gfortran.dg/vect/pr83232.f90 fails on power 7

2020-04-08 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94079 Kewen Lin changed: What|Removed |Added Resolution|--- |FIXED Status|UNCONFIRMED

[Bug tree-optimization/94443] [10 Regression] 510.parest_r and 526.blender_r ICE: verify_ssa failed since r10-7491-gbd0f22a8d5caea8905f38ff1fafce31c1b7d33ad

2020-04-03 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94443 Kewen Lin changed: What|Removed |Added Resolution|--- |FIXED Status|ASSIGNED

[Bug middle-end/26163] [meta-bug] missed optimization in SPEC (2k17, 2k and 2k6 and 95)

2020-04-03 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=26163 Bug 26163 depends on bug 94443, which changed state. Bug 94443 Summary: [10 Regression] 510.parest_r and 526.blender_r ICE: verify_ssa failed since r10-7491-gbd0f22a8d5caea8905f38ff1fafce31c1b7d33ad

[Bug tree-optimization/94443] [10 Regression] 510.parest_r and 526.blender_r ICE: verify_ssa failed since r10-7491-gbd0f22a8d5caea8905f38ff1fafce31c1b7d33ad

2020-04-03 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94443 --- Comment #15 from Kewen Lin --- *** Bug 94451 has been marked as a duplicate of this bug. ***

[Bug tree-optimization/94451] [10 Regression] April 1st 2020 GCC does not compile spec 2017 gcc_r benchmark with -O3

2020-04-03 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94451 Kewen Lin changed: What|Removed |Added Resolution|FIXED |DUPLICATE --- Comment #7 from Kewen Lin

[Bug tree-optimization/94401] [10 Regression] pr92420.c fails on aarch64 since r10-7415

2020-04-02 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94401 Kewen Lin changed: What|Removed |Added Resolution|--- |FIXED Status|ASSIGNED

[Bug tree-optimization/94443] [10 Regression] 510.parest_r and 526.blender_r ICE: verify_ssa failed since r10-7491-gbd0f22a8d5caea8905f38ff1fafce31c1b7d33ad

2020-04-02 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94443 --- Comment #13 from Kewen Lin --- (In reply to Khem Raj from comment #11) > this patch seems to be causing gcc ICE on ARM when compiling lz4 sources in > kernel, lz4, vlc almost identical ICE is seen > > attached is the test case please

[Bug tree-optimization/94456] ICE in aarch64/sve/pr87815.c since r10-7491

2020-04-02 Thread linkw at gcc dot gnu.org
||linkw at gcc dot gnu.org Resolution|--- |DUPLICATE --- Comment #1 from Kewen Lin --- Thanks for reporting, should be duplicated as the symptom. *** This bug has been marked as a duplicate of bug 94443 ***

[Bug tree-optimization/94443] [10 Regression] 510.parest_r and 526.blender_r ICE: verify_ssa failed since r10-7491-gbd0f22a8d5caea8905f38ff1fafce31c1b7d33ad

2020-04-02 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94443 Kewen Lin changed: What|Removed |Added CC||clyon at gcc dot gnu.org --- Comment #10

[Bug tree-optimization/94451] [10 Regression] April 1st 2020 GCC does not compile spec 2017 gcc_r benchmark with -O3

2020-04-02 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94451 Kewen Lin changed: What|Removed |Added Resolution|DUPLICATE |FIXED --- Comment #6 from Kewen Lin ---

[Bug tree-optimization/94443] [10 Regression] 510.parest_r and 526.blender_r ICE: verify_ssa failed since r10-7491-gbd0f22a8d5caea8905f38ff1fafce31c1b7d33ad

2020-04-02 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94443 --- Comment #8 from Kewen Lin --- > > > + remove_phi_node (, false); > > I prefer to have the PHI removed before you re-use its LHS. > Oops, missed this, will move it back when posting to email list.

[Bug tree-optimization/94443] [10 Regression] 510.parest_r and 526.blender_r ICE: verify_ssa failed since r10-7491-gbd0f22a8d5caea8905f38ff1fafce31c1b7d33ad

2020-04-02 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94443 --- Comment #7 from Kewen Lin --- Yes, thanks Richi! I had the same update locally but didn't update here. The latest whole patch is diff --git a/gcc/testsuite/gcc.dg/vect/pr94443.c b/gcc/testsuite/gcc.dg/vect/pr94443.c new file mode 100644

[Bug middle-end/94449] [10 Regression] FAIL: gcc.c-torture/execute/pr92904.c gcc.dg/torture/pr48731.c

2020-04-02 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94449 --- Comment #12 from Kewen Lin --- Sorry, correction: corei7-avx is from system gcc. With my built gcc, it's sandybridge. But no difference for the pass/fail result.

[Bug middle-end/94449] [10 Regression] FAIL: gcc.c-torture/execute/pr92904.c gcc.dg/torture/pr48731.c

2020-04-01 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94449 Kewen Lin changed: What|Removed |Added Resolution|--- |DUPLICATE Status|ASSIGNED

[Bug tree-optimization/94443] [10 Regression] 510.parest_r and 526.blender_r ICE: verify_ssa failed since r10-7491-gbd0f22a8d5caea8905f38ff1fafce31c1b7d33ad

2020-04-01 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94443 Kewen Lin changed: What|Removed |Added CC||hjl.tools at gmail dot com --- Comment #5

[Bug middle-end/94449] [10 Regression] FAIL: gcc.c-torture/execute/pr92904.c gcc.dg/torture/pr48731.c

2020-04-01 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94449 --- Comment #10 from Kewen Lin --- (In reply to H.J. Lu from comment #9) > (In reply to Kewen Lin from comment #8) > > May I ask for the configuration option? > > > > I used x86_64 machine in CFarm with cpuinfo > > > > I used > >

[Bug middle-end/94449] [10 Regression] FAIL: gcc.c-torture/execute/pr92904.c gcc.dg/torture/pr48731.c

2020-04-01 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94449 Kewen Lin changed: What|Removed |Added Status|NEW |ASSIGNED --- Comment #8 from Kewen Lin ---

[Bug tree-optimization/94443] [10 Regression] 510.parest_r and 526.blender_r ICE: verify_ssa failed since r10-7491-gbd0f22a8d5caea8905f38ff1fafce31c1b7d33ad

2020-04-01 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94443 --- Comment #4 from Kewen Lin --- This case has one conversion insn generated after bit_field_ref, the patch introduces one stupid mistake to use gsi_insert_before instead of gsi_insert_seq_before, it leads to miss the conversion insn. The

[Bug tree-optimization/94451] [10 Regression] April 1st 2020 GCC does not compile spec 2017 gcc_r benchmark with -O3

2020-04-01 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94451 Kewen Lin changed: What|Removed |Added CC||linkw at gcc dot gnu.org Status

[Bug middle-end/94449] [10 Regression] FAIL: gcc.c-torture/execute/pr92904.c gcc.dg/torture/pr48731.c

2020-04-01 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94449 Kewen Lin changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |linkw at gcc dot gnu.org --- Comment

[Bug tree-optimization/94443] [10 Regression] 510.parest_r and 526.blender_r ICE: verify_ssa failed since r10-7491-gbd0f22a8d5caea8905f38ff1fafce31c1b7d33ad

2020-04-01 Thread linkw at gcc dot gnu.org
at gcc dot gnu.org |linkw at gcc dot gnu.org --- Comment #3 from Kewen Lin --- Thanks for reporting this, confirmed.

[Bug tree-optimization/94043] [9 Regression] ICE in superloop_at_depth, at cfgloop.c:78

2020-04-01 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94043 --- Comment #21 from Kewen Lin --- (In reply to Richard Biener from comment #20) > Re-open. It's marked as broken in GCC 9 so a backport is in oder (if the > issue really reproduces there). Thanks for pointing it out. I'll backport it two

[Bug tree-optimization/94043] [9/10 Regression] ICE in superloop_at_depth, at cfgloop.c:78

2020-04-01 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94043 Kewen Lin changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug tree-optimization/94401] [10 Regression] pr92420.c fails on aarch64 since r10-7415

2020-03-30 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94401 --- Comment #5 from Kewen Lin --- Created attachment 48150 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=48150=edit untested patch This can fix the REG failures on aarch64.

[Bug tree-optimization/94401] [10 Regression] pr92420.c fails on aarch64 since r10-7415

2020-03-30 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94401 Kewen Lin changed: What|Removed |Added CC||segher at gcc dot gnu.org,

[Bug tree-optimization/94401] pr92420.c fails on aarch64 since r10-7415

2020-03-30 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94401 Kewen Lin changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |linkw at gcc dot gnu.org

[Bug tree-optimization/90332] New test case gcc.dg/vect/slp-reduc-sad-2.c in r270847 fails

2020-03-27 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=90332 Kewen Lin changed: What|Removed |Added Status|UNCONFIRMED |RESOLVED Resolution|---

[Bug tree-optimization/94043] [9/10 Regression] ICE in superloop_at_depth, at cfgloop.c:78

2020-03-26 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94043 Kewen Lin changed: What|Removed |Added Attachment #48122|0 |1 is obsolete|

[Bug tree-optimization/94043] [9/10 Regression] ICE in superloop_at_depth, at cfgloop.c:78

2020-03-26 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94043 --- Comment #14 from Kewen Lin --- (In reply to Richard Biener from comment #13) > > + /* Find all SSA NAMEs in stmts which is defined in current loop, > create > +PHIs for them, and replace them with phi results accordingly. */

[Bug tree-optimization/94043] [9/10 Regression] ICE in superloop_at_depth, at cfgloop.c:78

2020-03-25 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94043 --- Comment #12 from Kewen Lin --- Created attachment 48122 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=48122=edit ppc64le tested patch Thanks Richi! A patch draft attached to ensure on the right track, also

[Bug testsuite/93935] [9/10 regression] gcc.dg/vect/bb-slp-over-widen-2.c fails starting with r262371 (r10-6856)

2020-03-24 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93935 Kewen Lin changed: What|Removed |Added Resolution|--- |FIXED Status|ASSIGNED

[Bug tree-optimization/94043] [9/10 Regression] ICE in superloop_at_depth, at cfgloop.c:78

2020-03-23 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94043 --- Comment #10 from Kewen Lin --- (In reply to Richard Biener from comment #9) > OK, so it's indeed vectorizable_live_operation not paying attention to > loop-closed SSA form. > > What it should do before building the lane extract is create a

[Bug tree-optimization/94043] [9/10 Regression] ICE in superloop_at_depth, at cfgloop.c:78

2020-03-22 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94043 --- Comment #8 from Kewen Lin --- > It's most likely either SCEV or expand_simple_operations looking throuhg > the single-arg PHI (which we should avoid for LC PHI nodes) Thanks Richi, I found the loop-closed PHI form was broken after we

[Bug tree-optimization/94043] [9/10 Regression] ICE in superloop_at_depth, at cfgloop.c:78

2020-03-20 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94043 --- Comment #6 from Kewen Lin --- (In reply to rguent...@suse.de from comment #5) > On Fri, 20 Mar 2020, linkw at gcc dot gnu.org wrote: > > > https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94043 > > > > --- C

[Bug tree-optimization/94043] [9/10 Regression] ICE in superloop_at_depth, at cfgloop.c:78

2020-03-19 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94043 --- Comment #4 from Kewen Lin --- This was just exposed from my commit, it can also be reproduced without my commit but with -fno-vect-cost-model. Some loops we have for this case: ;; Loop 1 ;; header 3, latch 10 ;; depth 1, outer 0 ;;

[Bug tree-optimization/94043] [9/10 Regression] ICE in superloop_at_depth, at cfgloop.c:78

2020-03-17 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94043 Kewen Lin changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |linkw at gcc dot gnu.org

[Bug testsuite/94019] [9 regression] gcc.dg/vect/vect-over-widen-17.c fails starting with g:370c2ebe8fa20e0812cd2d533d4ed38ee2d37c85, r9-1590

2020-03-16 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94019 Kewen Lin changed: What|Removed |Added Resolution|--- |FIXED Status|ASSIGNED

[Bug testsuite/94023] [9 regression] gcc.dg/vect/slp-perm-12.c fails starting with r9-5008

2020-03-16 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94023 Kewen Lin changed: What|Removed |Added Resolution|--- |FIXED Status|ASSIGNED

[Bug tree-optimization/90332] New test case gcc.dg/vect/slp-reduc-sad-2.c in r270847 fails

2020-03-11 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=90332 Kewen Lin changed: What|Removed |Added CC||linkw at gcc dot gnu.org --- Comment #7

[Bug target/94019] [9 regression] gcc.dg/vect/vect-over-widen-17.c fails starting with g:370c2ebe8fa20e0812cd2d533d4ed38ee2d37c85, r9-1590

2020-03-03 Thread linkw at gcc dot gnu.org
||2020-03-04 CC||linkw at gcc dot gnu.org Assignee|unassigned at gcc dot gnu.org |linkw at gcc dot gnu.org Ever confirmed|0 |1 --- Comment #1 from Kewen Lin --- It's a power7 specific test case

[Bug target/94023] [9 regression] gcc.dg/vect/slp-perm-12.c fails starting with r9-5008

2020-03-03 Thread linkw at gcc dot gnu.org
||2020-03-04 CC||linkw at gcc dot gnu.org Assignee|unassigned at gcc dot gnu.org |linkw at gcc dot gnu.org Ever confirmed|0 |1 --- Comment #1 from Kewen Lin --- Yes, this is another one which

[Bug testsuite/93935] [9/10 regression] gcc.dg/vect/bb-slp-over-widen-2.c fails starting with r262371 (r10-6856)

2020-02-25 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93935 Kewen Lin changed: What|Removed |Added Status|UNCONFIRMED |ASSIGNED Last reconfirmed|

[Bug target/91052] [10 Regression] ICE in fix_reg_equiv_init, at ira.c:2705

2020-02-11 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91052 Kewen Lin changed: What|Removed |Added Status|NEW |RESOLVED Resolution|---

[Bug target/91052] [10 Regression] ICE in fix_reg_equiv_init, at ira.c:2705

2020-02-04 Thread linkw at gcc dot gnu.org
, ||segher at gcc dot gnu.org, ||wschmidt at gcc dot gnu.org Assignee|unassigned at gcc dot gnu.org |linkw at gcc dot gnu.org --- Comment #15 from Kewen Lin --- Thanks for your comments Alan! I've updated

[Bug target/91052] [10 Regression] ICE in fix_reg_equiv_init, at ira.c:2705

2020-02-03 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91052 --- Comment #13 from Kewen Lin --- > “The newly generated doesn't look incorrect since some semantic changes as > below.” Sorry, typo, it should be "The newly generated insn doesn't look correct since some semantic changes as below."

[Bug target/91052] [10 Regression] ICE in fix_reg_equiv_init, at ira.c:2705

2020-02-03 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91052 Kewen Lin changed: What|Removed |Added CC||amodra at gcc dot gnu.org --- Comment #12

[Bug target/91052] [10 Regression] ICE in fix_reg_equiv_init, at ira.c:2705

2020-02-02 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91052 --- Comment #11 from Kewen Lin --- Thanks a lot for Martin's help on reproduction. I can reproduce this ICE and confirmed that if commenting out my patch r272731 in loop-iv.c, it can pass. I can also reproduce it on powerpc64

[Bug target/91052] [10 Regression] ICE in fix_reg_equiv_init, at ira.c:2705

2020-01-17 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91052 --- Comment #9 from Kewen Lin --- I can't reproduce this on both powerpc64le-linux-gnu (edabbec31e3bfc9a9757f80c8610706ed00e5a1a) and ppc64-redhat-linux (r278916), IIUC I need the powerpc-e300c3 environment header/library as sysroot for

[Bug target/91052] [10 Regression] ICE in fix_reg_equiv_init, at ira.c:2705

2020-01-17 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91052 --- Comment #8 from Kewen Lin --- Sorry, I just saw this bug was starting to fail with my commit. Thanks for @ing me! My commit is just to pass the finiteness information down to RTL phase. The loops in that case are simple and have only an exit

[Bug target/62147] missed loop counter based optimization

2020-01-01 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=62147 Kewen Lin changed: What|Removed |Added CC||meissner at gcc dot gnu.org --- Comment #9

[Bug tree-optimization/84042] IVOPTS doesn't optimize int indexes on some PowerPC code starting with svn id r250482

2020-01-01 Thread linkw at gcc dot gnu.org
|--- |DUPLICATE Assignee|unassigned at gcc dot gnu.org |linkw at gcc dot gnu.org --- Comment #2 from Kewen Lin --- With the fix of PR62147, we can generate bdnz for this loop now. With -O2 -mcpu=power7 -mno-vsx: 3c: 00 00 42 60 ori r2,r2,0 40: 08 00 04 cc

[Bug tree-optimization/91790] ICE: verify_ssa failed (error: definition in block 2 follows the use)

2019-12-10 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91790 --- Comment #15 from Kewen Lin --- Author: linkw Date: Tue Dec 10 12:54:21 2019 New Revision: 279166 URL: https://gcc.gnu.org/viewcvs?rev=279166=gcc=rev Log: [PATCH] Fix PR91790 by considering different first_stmt_info for realign As PR91790

[Bug target/92760] [10 regression] several vector test cases fail on power 7 after r278800

2019-12-03 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92760 Kewen Lin changed: What|Removed |Added Status|UNCONFIRMED |RESOLVED Resolution|---

[Bug target/92760] [10 regression] several vector test cases fail on power 7 after r278800

2019-12-03 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92760 --- Comment #2 from Kewen Lin --- Author: linkw Date: Wed Dec 4 05:10:46 2019 New Revision: 278955 URL: https://gcc.gnu.org/viewcvs?rev=278955=gcc=rev Log: [rs6000] Fix PR92760 by checking VECTOR_MEM_NONE_P instead PR92760 exposed one issue

[Bug target/92760] [10 regression] several vector test cases fail on power 7 after r278800

2019-12-02 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92760 Kewen Lin changed: What|Removed |Added CC||segher at gcc dot gnu.org --- Comment #1

[Bug tree-optimization/91790] ICE: verify_ssa failed (error: definition in block 2 follows the use)

2019-11-28 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91790 --- Comment #14 from Kewen Lin --- Yes, I'd like to wait for two weeks to ensure it's safe enough and then backport to gcc9. Does it sound good?

[Bug target/92566] rs6000_preferred_simd_mode isn't very good

2019-11-27 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92566 Kewen Lin changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug target/92566] rs6000_preferred_simd_mode isn't very good

2019-11-27 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92566 --- Comment #13 from Kewen Lin --- Author: linkw Date: Thu Nov 28 06:34:31 2019 New Revision: 278800 URL: https://gcc.gnu.org/viewcvs?rev=278800=gcc=rev Log: [rs6000] Fix PR92566 by checking VECTOR_UNIT_NONE_P As Segher pointed out in PR92566,

[Bug tree-optimization/91790] ICE: verify_ssa failed (error: definition in block 2 follows the use)

2019-11-27 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91790 --- Comment #12 from Kewen Lin --- Author: linkw Date: Wed Nov 27 09:08:20 2019 New Revision: 278760 URL: https://gcc.gnu.org/viewcvs?rev=278760=gcc=rev Log: [PATCH] Fix PR91790 by considering different first_stmt_info for realign As PR91790

[Bug tree-optimization/91790] ICE: verify_ssa failed (error: definition in block 2 follows the use)

2019-11-26 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91790 Kewen Lin changed: What|Removed |Added Attachment #47357|0 |1 is obsolete|

[Bug tree-optimization/91790] ICE: verify_ssa failed (error: definition in block 2 follows the use)

2019-11-26 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91790 --- Comment #10 from Kewen Lin --- Created attachment 47357 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=47357=edit Defer vect_setup_realignment for different first_stmt_info Verified the fix on P7 BE, still bootstrapping.

[Bug tree-optimization/91790] ICE: verify_ssa failed (error: definition in block 2 follows the use)

2019-11-26 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91790 Kewen Lin changed: What|Removed |Added Status|REOPENED|ASSIGNED --- Comment #9 from Kewen Lin ---

[Bug target/92534] [10 regression] gcc.dg/vect/bb-slp-42.c fails after r278262

2019-11-26 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92534 Kewen Lin changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug target/92534] [10 regression] gcc.dg/vect/bb-slp-42.c fails after r278262

2019-11-26 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92534 --- Comment #9 from Kewen Lin --- Oops, sorry, the above comment should be for PR91790.

[Bug target/92534] [10 regression] gcc.dg/vect/bb-slp-42.c fails after r278262

2019-11-26 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92534 --- Comment #8 from Kewen Lin --- This issue won't be reproduced with latest trunk but I still can reproduce with the original reported revision. I checked the dumpings, it changes starting from inlining, the bg() doesn't have the loop of

[Bug tree-optimization/92618] [10 Regression] error: type mismatch in binary expression in reassoc since r273490

2019-11-23 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92618 --- Comment #7 from Kewen Lin --- Sorry that I didn't catch the bugzilla mails timely, thanks for fixing this!

[Bug target/92566] rs6000_preferred_simd_mode isn't very good

2019-11-21 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92566 --- Comment #12 from Kewen Lin --- FWIW, I did some statistics collection with regression testing on P8 machine, the #hits on early return is 516, while the other # is 1147412. So the conclusion is that early return is useless (at least for

[Bug target/92566] rs6000_preferred_simd_mode isn't very good

2019-11-21 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92566 Kewen Lin changed: What|Removed |Added Attachment #47325|0 |1 is obsolete|

[Bug target/92566] rs6000_preferred_simd_mode isn't very good

2019-11-21 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92566 --- Comment #10 from Kewen Lin --- Yes, you are right, it's fine to drop it. Since the previous code will early return if it's under (!TARGET_ALTIVEC && !TARGET_VSX), I was thinking it may be good to put an early return there. I'm fine to remove

[Bug target/92566] rs6000_preferred_simd_mode isn't very good

2019-11-21 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92566 Kewen Lin changed: What|Removed |Added Attachment #47306|0 |1 is obsolete|

[Bug target/92566] rs6000_preferred_simd_mode isn't very good

2019-11-21 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92566 --- Comment #6 from Kewen Lin --- Great! I was thinking there whether exists some array to map from mode to vector, but missed this one. Good to know we have this kind of function!

[Bug target/92534] [10 regression] gcc.dg/vect/bb-slp-42.c fails after r278262

2019-11-21 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92534 --- Comment #7 from Kewen Lin --- Thanks for your confirmation and notes! Yes, the realignment codes won't take effect from Power8 which supports unaligned vector load/store. I'll learn the code, follow your suggestion and cook some patches

[Bug target/92534] [10 regression] gcc.dg/vect/bb-slp-42.c fails after r278262

2019-11-20 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92534 Kewen Lin changed: What|Removed |Added CC||rguenth at gcc dot gnu.org --- Comment #5

[Bug target/92534] [10 regression] gcc.dg/vect/bb-slp-42.c fails after r278262

2019-11-20 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92534 Kewen Lin changed: What|Removed |Added Status|NEW |ASSIGNED --- Comment #4 from Kewen Lin ---

[Bug target/92566] rs6000_preferred_simd_mode isn't very good

2019-11-19 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92566 Kewen Lin changed: What|Removed |Added Attachment #47295|0 |1 is obsolete|

[Bug target/92534] [10 regression] gcc.dg/vect/bb-slp-42.c fails after r278262

2019-11-19 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92534 Kewen Lin changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |linkw at gcc dot gnu.org --- Comment

[Bug target/92566] rs6000_preferred_simd_mode isn't very good

2019-11-18 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92566 --- Comment #2 from Kewen Lin --- Created attachment 47295 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=47295=edit Guard V2DImode and V1TImode under VSX and P8VECTOR

[Bug target/92566] rs6000_preferred_simd_mode isn't very good

2019-11-18 Thread linkw at gcc dot gnu.org
||2019-11-19 CC||linkw at gcc dot gnu.org Assignee|unassigned at gcc dot gnu.org |linkw at gcc dot gnu.org Ever confirmed|0 |1 --- Comment #1 from Kewen Lin --- Currently we guard V2DImode under

[Bug testsuite/92464] [10 regression] r278033 breaks gcc.dg/vect/costmodel/ppc/costmodel-vect-76b.c

2019-11-13 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92464 Kewen Lin changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug testsuite/92464] [10 regression] r278033 breaks gcc.dg/vect/costmodel/ppc/costmodel-vect-76b.c

2019-11-13 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92464 --- Comment #5 from Kewen Lin --- Author: linkw Date: Thu Nov 14 05:57:12 2019 New Revision: 278195 URL: https://gcc.gnu.org/viewcvs?rev=278195=gcc=rev Log: [testsuite] Fix PR92464 by adjust test case loop bound The recent vectorization

[Bug testsuite/92464] [10 regression] r278033 breaks gcc.dg/vect/costmodel/ppc/costmodel-vect-76b.c

2019-11-12 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92464 --- Comment #4 from Kewen Lin --- By the way, if I removed the check_vect and result verification code, the vectorized version perform very slightly better than non-vectorized version. And yes, I think it was a bit off before.

[Bug testsuite/92464] [10 regression] r278033 breaks gcc.dg/vect/costmodel/ppc/costmodel-vect-76b.c

2019-11-12 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92464 --- Comment #3 from Kewen Lin --- (In reply to Segher Boessenkool from comment #2) > What is the testcase testing? Whether we can properly vectorize this > code, right? And for p7 we now do it correctly, but thought it was > too expensive

[Bug testsuite/92464] [10 regression] r278033 breaks gcc.dg/vect/costmodel/ppc/costmodel-vect-76b.c

2019-11-12 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92464 Kewen Lin changed: What|Removed |Added Status|UNCONFIRMED |ASSIGNED Last reconfirmed|

[Bug target/92132] new test case gcc.dg/vect/vect-cond-reduc-4.c fails with its introduction in r277067

2019-11-07 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92132 Kewen Lin changed: What|Removed |Added Status|UNCONFIRMED |RESOLVED Resolution|---

[Bug target/92132] new test case gcc.dg/vect/vect-cond-reduc-4.c fails with its introduction in r277067

2019-11-07 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92132 --- Comment #4 from Kewen Lin --- Author: linkw Date: Fri Nov 8 07:37:07 2019 New Revision: 277947 URL: https://gcc.gnu.org/viewcvs?rev=277947=gcc=rev Log: [rs6000]Fix PR92132 by adding vec_cmp and vcond_mask supports To support full

[Bug testsuite/92127] [10 regression] gcc.dg/vect/costmodel/ppc/costmodel-fast-math-vect-pr29925.c fails after r276645 on power7

2019-11-04 Thread linkw at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92127 --- Comment #6 from Kewen Lin --- Author: linkw Revision: 277704 Modified property: svn:log Modified: svn:log at Tue Nov 5 02:36:58 2019 -- --- svn:log (original) +++

<    4   5   6   7   8   9   10   >