[Bug ipa/92077] Multiple independent functions degrades optimizations

2020-01-15 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92077 --- Comment #4 from Jan Hubicka --- We have --param comdat-sharing-probablity which says that average comdat function has only 20% chance to be shared with another copy of same comdat in other unit. This was introduced because of Firefox develope

[Bug tree-optimization/93271] [8/9/10 regression] SRA producing wrong code on denormals

2020-01-15 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93271 --- Comment #5 from Jan Hubicka --- Also I think it is violation of C++ memory model since we introduce load+store pair where there was none before?

[Bug tree-optimization/93271] [8/9/10 regression] SRA producing wrong code on denormals

2020-01-15 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93271 Jan Hubicka changed: What|Removed |Added Keywords|wrong-code | Target|i?86-linux-gnu

[Bug tree-optimization/93271] SRA producing wrong code on denormals

2020-01-15 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93271 --- Comment #2 from Jan Hubicka --- Fails at least for gcc 4.9+, but it must be regression compared to pre-tree-ssa GCCs (which I don't have installed :)

[Bug tree-optimization/93271] New: SRA producing wrong code on denormals

2020-01-15 Thread hubicka at gcc dot gnu.org
-optimization Assignee: unassigned at gcc dot gnu.org Reporter: hubicka at gcc dot gnu.org Target Milestone: --- hubicka@lomikamen-jh:~$ cat t2.c #include union test {int a; float b;}; __attribute__ ((noinline)) union test set() { union test r; r.a = 0x7f842335; return r

[Bug lto/91576] [10 Regression] error: invalid conversion in gimple call since r272749

2020-01-14 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91576 Jan Hubicka changed: What|Removed |Added Status|NEW |RESOLVED Resolution|---

[Bug lto/92600] [9/10 Regression] ICE: symtab_node::verify failed, building 523.xalancbmk_r with -flto -fno-inline since r267359

2020-01-14 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92600 --- Comment #6 from Jan Hubicka --- Well, because the source files differs, the comdat group differs and the loosing one has fewer symbols in it. So we end up keeping some symbols from the other comdat group that happens to have same name. The

[Bug lto/89358] [8 Regression] Combining -std=c++14 and -std=c++17 objects gives ODR warnings

2020-01-14 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89358 --- Comment #19 from Jan Hubicka --- I think backporting would be a good idea :) If you beat me on it even better. Now I need to set up my trees in git...

[Bug ipa/93144] [10 Regression] 459.GemsFDTD debug info size increase by 50% since r279563

2020-01-14 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93144 --- Comment #5 from Jan Hubicka --- Well, the problem was debug info getting bigger due to more inlining? I guss in that case we could close it. That patch is expected to allow more inlines.

[Bug ipa/92749] [10 Regression] warning: inlining failed in call to ‘salsa20’: --param max-inline-insns-single limit reached after r276516

2020-01-14 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92749 --- Comment #2 from Jan Hubicka --- This is intentional, we got less aggressive at inlining inline functions for -O2 (since we do not need to do all inlining we want for -O3 when we have independent set of attributes). Indeed -Winline -Werror is

[Bug ipa/92240] [10 regression] ICE in duplicate, at ipa-prop.c:3883

2020-01-14 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92240 Jan Hubicka changed: What|Removed |Added CC||mjambor at suse dot cz --- Comment #5 from

[Bug lto/88081] [8/9/10 Regression] ICE in lto_varpool_replace_node, at lto/lto-symtab.c:109

2020-01-14 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=88081 Jan Hubicka changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug lto/91576] [10 Regression] error: invalid conversion in gimple call since r272749

2020-01-14 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91576 --- Comment #18 from Jan Hubicka --- OK, other testcases does not reproduce for me. However if they do it seems like fallout from the change dropping type checking from call statements.

[Bug lto/91576] [10 Regression] error: invalid conversion in gimple call since r272749

2020-01-14 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91576 --- Comment #17 from Jan Hubicka --- Created attachment 47651 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=47651&action=edit proposed patch This is patch I plan to test which fixes the last testcase. It adds warning about TREE_ADDRESSAB

[Bug lto/91576] [10 Regression] error: invalid conversion in gimple call since r272749

2020-01-14 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91576 --- Comment #16 from Jan Hubicka --- OK, i get an ICE because type is not compatible with its main variant. the two types are: constant 384> unit-size constant 48> align:64 warn_if_not_align:0 symtab:0 alias-set -1 canonical-type 0x7f

[Bug tree-optimization/93258] New: [10 regression] Missed constant folding from constructor

2020-01-14 Thread hubicka at gcc dot gnu.org
Priority: P3 Component: tree-optimization Assignee: unassigned at gcc dot gnu.org Reporter: hubicka at gcc dot gnu.org Target Milestone: --- In the testcase (reduced from folly by Mark Williams): typedef int a; template struct d { static constexpr b e = c; }; template

[Bug c++/93169] [10 regression] Variable incorrectly put into readonly section.

2020-01-06 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93169 Jan Hubicka changed: What|Removed |Added Status|UNCONFIRMED |NEW Last reconfirmed|

[Bug c++/93169] New: [10 regression] Variable incorrectly put into readonly section.

2020-01-06 Thread hubicka at gcc dot gnu.org
Priority: P3 Component: c++ Assignee: unassigned at gcc dot gnu.org Reporter: hubicka at gcc dot gnu.org Target Milestone: --- This is testcase by Mark Williams who also bisected it template class b { struct c { a h; constexpr c() {} ~c() {} } d

[Bug ipa/93115] gcc fails to emit inline function on llvm-roc project: -O1 -fPIC -fdevirtualize -fdevirtualize-speculatively -fipa-cp -fipa-cp-clone -fvisibility-inlines-hidden

2020-01-02 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93115 Jan Hubicka changed: What|Removed |Added CC||mjambor at suse dot cz --- Comment #5 from

[Bug ipa/93115] gcc fails to emit inline function on llvm-roc project: -O1 -fPIC -fdevirtualize -fdevirtualize-speculatively -fipa-cp -fipa-cp-clone -fvisibility-inlines-hidden

2020-01-02 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93115 --- Comment #4 from Jan Hubicka --- The problem here is that we produce ipa-cp clone to devirtualize v::av which also lead to devirtualization of m::av, but we miss this optimization. After inlining we remove m::av and while producing the ipa-cp

[Bug lto/93117] New: -g -flto -fdebug-types-section is broken for units with over 64k types

2020-01-01 Thread hubicka at gcc dot gnu.org
Priority: P3 Component: lto Assignee: unassigned at gcc dot gnu.org Reporter: hubicka at gcc dot gnu.org CC: marxin at gcc dot gnu.org Target Milestone: --- jan@skylake:~> more t.c #define def(name) struct name {int name;} name; #define d

[Bug tree-optimization/93055] accumulation loops in stepanov_vector benchmark use more instruction level parpallelism

2019-12-29 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93055 --- Comment #5 from Jan Hubicka --- Created attachment 47563 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=47563&action=edit fixed testcase I have verified that building with g++ -O3 -march=bdver1 -fno-prefetch-loop-arrays ~/stepanov_ve

[Bug tree-optimization/93084] Infinite loop in ipa-cp when building clang with LTO+PGO

2019-12-27 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93084 Jan Hubicka changed: What|Removed |Added Status|UNCONFIRMED |NEW Last reconfirmed|

[Bug tree-optimization/93084] Infinite loop in ipa-cp when building clang with LTO+PGO

2019-12-27 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93084 Jan Hubicka changed: What|Removed |Added CC||fxue at os dot amperecomputing.com --- Co

[Bug tree-optimization/93084] Infinite loop in ipa-cp when building clang with LTO+PGO

2019-12-27 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93084 Jan Hubicka changed: What|Removed |Added CC||mjambor at suse dot cz --- Comment #1 from

[Bug tree-optimization/93084] New: Infinite loop in ipa-cp when building clang with LTO+PGO

2019-12-27 Thread hubicka at gcc dot gnu.org
Priority: P3 Component: tree-optimization Assignee: unassigned at gcc dot gnu.org Reporter: hubicka at gcc dot gnu.org Target Milestone: --- I get infinite loop in (gdb) bt #0 0x012b0e4e in values_equal_for_ipcp_p (y=0x7fffaf8496c0, x=0x7fffafc26e80) at ../../gcc

[Bug c++/93033] [10 Regression] error: incorrect sharing of tree nodes

2019-12-27 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93033 --- Comment #7 from Jan Hubicka --- This patch fixes the testcase, but I am not familiar enough with the code to say if that is correct fix :) Index: ../../gcc/cp/cp-gimplify.c ===

[Bug tree-optimization/93079] ICE in mark_operand_necessary

2019-12-27 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93079 Jan Hubicka changed: What|Removed |Added Status|UNCONFIRMED |RESOLVED Resolution|---

[Bug c++/93033] [10 Regression] error: incorrect sharing of tree nodes

2019-12-27 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93033 Jan Hubicka changed: What|Removed |Added CC||hubicka at gcc dot gnu.org --- Comment #6

[Bug tree-optimization/93079] ICE in mark_operand_necessary

2019-12-27 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93079 --- Comment #1 from Jan Hubicka --- We rewrite VAR_DECL to invalid tree code by gimple_set_plf (gdb) p ((tree)0x7fffeccfba20).base.code $22 = VAR_DECL (gdb) c Continuing. Hardware watchpoint 1: *$13 Old value = 17039395 New value = 17041443 gim

[Bug tree-optimization/93079] New: ICE in mark_operand_necessary

2019-12-27 Thread hubicka at gcc dot gnu.org
Assignee: unassigned at gcc dot gnu.org Reporter: hubicka at gcc dot gnu.org Target Milestone: --- Created attachment 47553 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=47553&action=edit unreduced testcase $ ./xgcc -B ./ -O2 ~/aa.ii In file included from ../../

[Bug tree-optimization/93055] accumulation loops in stepanov_vector benchmark use more instruction level parpallelism

2019-12-24 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93055 --- Comment #3 from Jan Hubicka --- Created attachment 47546 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=47546&action=edit preprocessed benchmark I am attaching preprocessed source. I used -O3 -march=native -fno-prefetch-loops-arrays (

[Bug lto/93057] New: -flto=64 -o "t#o" leads to an error

2019-12-23 Thread hubicka at gcc dot gnu.org
onent: lto Assignee: unassigned at gcc dot gnu.org Reporter: hubicka at gcc dot gnu.org CC: marxin at gcc dot gnu.org Target Milestone: --- When generating the internal makefile in lto-wrapper we need to avoid or escape characters in filenames which are special for

[Bug tree-optimization/93056] New: Poor codegen for heapsort in stephanov_vector benchmark

2019-12-23 Thread hubicka at gcc dot gnu.org
Priority: P3 Component: tree-optimization Assignee: unassigned at gcc dot gnu.org Reporter: hubicka at gcc dot gnu.org Target Milestone: --- Created attachment 47543 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=47543&action=edit preprocessed benchmark wit

[Bug tree-optimization/93055] New: accumulation loops in stepanov_vector benchmark use more instruction level parpallelism

2019-12-23 Thread hubicka at gcc dot gnu.org
Severity: normal Priority: P3 Component: tree-optimization Assignee: unassigned at gcc dot gnu.org Reporter: hubicka at gcc dot gnu.org Target Milestone: --- stepanov_vector benchmark form https://gitlab.com/chriscox/CppPerformanceBenchmarks gets poor codegen

[Bug tree-optimization/92924] [10 regression] reproducible indirect call profile merging causes 80% slowdown in Firefox pref-reftest-singletons id-getter microbenchmarks

2019-12-13 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92924 --- Comment #4 from Jan Hubicka --- Looking into how getter variable is determined: vp_35 is function parameter _124 = MEM[(const struct Value *)vp_35(D)].asBits_; _125 = _124 ^ 18446181123756130304; _126 = (struct JSObject *) _125 ... _50 = MEM

[Bug tree-optimization/92924] [10 regression] reproducible indirect call profile merging causes 80% slowdown in Firefox pref-reftest-singletons id-getter microbenchmarks

2019-12-13 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92924 Jan Hubicka changed: What|Removed |Added Status|UNCONFIRMED |NEW Last reconfirmed|

[Bug tree-optimization/92924] [10 regression] reproducible indirect call profile merging causes 80% slowdown in Firefox pref-reftest-singletons id-getter microbenchmarks

2019-12-13 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92924 --- Comment #2 from Jan Hubicka --- Increasing number of entries does not seem to help: Indirect call counterall: 140960933, values: [429856732:-1], [484692916:1218], [1203869319:12593], [245854587:8179], [1829590552:52], [401302964:7072], [83957

[Bug rtl-optimization/92925] New: RTl expansion throws away alignment info

2019-12-12 Thread hubicka at gcc dot gnu.org
-optimization Assignee: unassigned at gcc dot gnu.org Reporter: hubicka at gcc dot gnu.org Target Milestone: --- Hi, this testcase originally started as attempt to produce self contained reproducer for ipa-cp bug. Problem is that RTL expansion is too limited and refuses to produce

[Bug bootstrap/92653] [10 Regression] PGO bootstrap is broken with --with-build-config=bootstrap-lto-lean

2019-12-12 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92653 Jan Hubicka changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug tree-optimization/92924] [10 regression] reproducible indirect call profile merging causes 80% slowdown in Firefox pref-reftest-singletons id-getter microbenchmarks

2019-12-12 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92924 Jan Hubicka changed: What|Removed |Added CC||hubicka at gcc dot gnu.org

[Bug tree-optimization/92924] New: [10 regression] reproducible indirect call profile merging causes 80% slowdown in Firefox pref-reftest-singletons id-getter microbenchmarks

2019-12-12 Thread hubicka at gcc dot gnu.org
: gcc Version: 10.0 Status: UNCONFIRMED Severity: normal Priority: P3 Component: tree-optimization Assignee: unassigned at gcc dot gnu.org Reporter: hubicka at gcc dot gnu.org Target Milestone: --- During the train run, in

[Bug tree-optimization/92860] [8/9/10 regression] Global flags affected by -O settings are clobbered by optimize attribute

2019-12-08 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92860 Jan Hubicka changed: What|Removed |Added Status|UNCONFIRMED |NEW Last reconfirmed|

[Bug tree-optimization/92860] [8,9,10 regression] Global flags affected by -O settings are clobbered by optimize attribute

2019-12-08 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92860 --- Comment #1 from Jan Hubicka --- Author: hubicka Date: Sun Dec 8 13:50:32 2019 New Revision: 279089 URL: https://gcc.gnu.org/viewcvs?rev=279089&root=gcc&view=rev Log: PR tree-optimization/92860 * common.opt (fprofile-reorder-

[Bug tree-optimization/92860] New: [8,9,10 regression] Global flags affected by -O settings are clobbered by optimize attribute

2019-12-08 Thread hubicka at gcc dot gnu.org
Severity: normal Priority: P3 Component: tree-optimization Assignee: unassigned at gcc dot gnu.org Reporter: hubicka at gcc dot gnu.org Target Milestone: --- Hi, the following testcase: void linker_error(); __attribute__ ((optimize("-O0"

[Bug c++/92831] CWG1299 extend_ref_init_temps_1 punts on COND_EXPRs

2019-12-07 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92831 Jan Hubicka changed: What|Removed |Added CC||hubicka at gcc dot gnu.org --- Comment #7

[Bug tree-optimization/92834] misssed SLP vectorization in LightPixel

2019-12-06 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92834 --- Comment #2 from Jan Hubicka --- Created attachment 47436 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=47436&action=edit Clang assembly from perf It is clang9 build https://treeherder.mozilla.org/#/jobs?repo=try&revision=7d7ee02817ab1

[Bug ipa/92809] [10 regression] error: calls_comdat_local is set outside of a comdat group

2019-12-06 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92809 Jan Hubicka changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug tree-optimization/92825] Unnecesary stack protection in Firefox's LightPixel.

2019-12-05 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92825 Jan Hubicka changed: What|Removed |Added Summary|Unnecesary stack protection |Unnecesary stack protection

[Bug tree-optimization/92834] New: misssed SLP vectorization in LightPixel

2019-12-05 Thread hubicka at gcc dot gnu.org
-optimization Assignee: unassigned at gcc dot gnu.org Reporter: hubicka at gcc dot gnu.org Target Milestone: --- Created attachment 47431 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=47431&action=edit simplified testcase Clang is able to vectorize LightPixel which l

[Bug tree-optimization/92825] New: Unnecesary stack protection and missed SLP vectorization in Firefox's LightPixel.

2019-12-05 Thread hubicka at gcc dot gnu.org
IRMED Severity: normal Priority: P3 Component: tree-optimization Assignee: unassigned at gcc dot gnu.org Reporter: hubicka at gcc dot gnu.org Target Milestone: --- Created attachment 47428 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=47428&acti

[Bug tree-optimization/92740] induct2 (from polyhedron) regresses 267% with -O2 -ftree-vectorize -ftree-slp-vectorize -fvect-cost-modes=dynamic compared to -O2

2019-11-30 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92740 --- Comment #1 from Jan Hubicka --- There is also 75% regression on fft2 and 5% on rnflow2. Induct2 reproduces on kaby lake, fft2 and rnflow seems zen specific.

[Bug tree-optimization/92740] New: induct2 (from polyhedron) regresses 267% with -O2 -ftree-vectorize -ftree-slp-vectorize -fvect-cost-modes=dynamic compared to -O2

2019-11-30 Thread hubicka at gcc dot gnu.org
Version: 10.0 Status: UNCONFIRMED Severity: normal Priority: P3 Component: tree-optimization Assignee: unassigned at gcc dot gnu.org Reporter: hubicka at gcc dot gnu.org Target Milestone: --- This is on zen2 hardware.

[Bug tree-optimization/92738] [10 regression] Large code size growth for -O2 binaries between 2019-05-19...2019-05-29

2019-11-30 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92738 --- Comment #2 from Jan Hubicka --- https://lnt.opensuse.org/db_default/v4/SPEC/graph?plot.0=10.542.4&highlight_run=7354 shows shorter range +2019-05-24 Jakub Jelinek + + * tree-core.h (enum omp_clause_code): Add OMP_CLAUSE__CONDTEMP_. +

[Bug tree-optimization/92738] [10 regression] Large code size growth for -O2 binaries between 2019-05-19...2019-05-29

2019-11-30 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92738 --- Comment #1 from Jan Hubicka --- This is seen on https://lnt.opensuse.org/db_default/v4/SPEC/graph?highlight_run=7361&plot.574=31.574.4

[Bug tree-optimization/92738] New: [10 regression] Large code size growth for -O2 binaries between 2019-05-19...2019-05-29

2019-11-30 Thread hubicka at gcc dot gnu.org
Severity: normal Priority: P3 Component: tree-optimization Assignee: unassigned at gcc dot gnu.org Reporter: hubicka at gcc dot gnu.org Target Milestone: ---

[Bug ipa/92737] New: cgraph_node and varpool_node needs explicit constructor

2019-11-30 Thread hubicka at gcc dot gnu.org
Priority: P3 Component: ipa Assignee: unassigned at gcc dot gnu.org Reporter: hubicka at gcc dot gnu.org CC: marxin at gcc dot gnu.org Target Milestone: --- cgraph_node and varpool_node are non-pods, but still allocated via alloc_cleared and we rely on

[Bug tree-optimization/92711] GCC 10 libxul.so -fprofile-generate binary is 360MB while clang needs only 163MB.

2019-11-28 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92711 --- Comment #3 from Jan Hubicka --- Proper GCC 9 -fprofile-generate build is 296MB https://firefox-ci-tc.services.mozilla.com/api/queue/v1/task/aMGsffWPQ1qzjgj4LIqcwQ/runs/0/artifacts/public/build/target.tar.bz2 So about 5% regression compared to

[Bug tree-optimization/92711] GCC 10 libxul.so -fprofile-generate binary is 360MB while clang needs only 163MB.

2019-11-28 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92711 Jan Hubicka changed: What|Removed |Added CC||mliska at suse dot cz Blocks|

[Bug tree-optimization/92711] GCC 10 libxul.so -fprofile-generate binary is 360MB while clang needs only 163MB.

2019-11-28 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92711 --- Comment #1 from Jan Hubicka --- https://firefox-ci-tc.services.mozilla.com/api/queue/v1/task/ObkoHsHHSriQdU0Twc12Wg/runs/0/artifacts/public/build/target.tar.bz2 This is GCC9 build. 310MB, so still a lot bigger than clang, but better than gcc1

[Bug tree-optimization/92711] New: GCC 10 libxul.so -fprofile-generate binary is 360MB while clang needs only 163MB.

2019-11-28 Thread hubicka at gcc dot gnu.org
Severity: normal Priority: P3 Component: tree-optimization Assignee: unassigned at gcc dot gnu.org Reporter: hubicka at gcc dot gnu.org Target Milestone: --- It seems that profiling became more expensive in GCC10 compared to clang or previous GCC releases. Clang

[Bug other/92681] New: PGO bootstrap is broken with --with-build-config=bootstrap-lto-lean is not training non-C++ frontends

2019-11-26 Thread hubicka at gcc dot gnu.org
Status: UNCONFIRMED Severity: normal Priority: P3 Component: other Assignee: unassigned at gcc dot gnu.org Reporter: hubicka at gcc dot gnu.org Target Milestone: --- This definitly leads to suboptimal compile time experience with Ada, Fortran

[Bug bootstrap/92680] New: PGO bootstrap is broken with --with-build-config=bootstrap-lto-lean and in-itree mpfr

2019-11-26 Thread hubicka at gcc dot gnu.org
: UNCONFIRMED Severity: normal Priority: P3 Component: bootstrap Assignee: unassigned at gcc dot gnu.org Reporter: hubicka at gcc dot gnu.org Target Milestone: --- Build with bootstrap-lto-lean with in-tree mpfr fails in profile mismatch on set_d.o. This

[Bug tree-optimization/92645] Hand written vector code is 450 times slower when compiled with GCC compared to Clang

2019-11-24 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92645 --- Comment #4 from Jan Hubicka --- Created attachment 47343 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=47343&action=edit GCC 10 output

[Bug tree-optimization/92645] Hand written vector code is 450 times slower when compiled with GCC compared to Clang

2019-11-24 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92645 --- Comment #2 from Jan Hubicka --- Created attachment 47341 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=47341&action=edit clang output with -O2 -mavx2 -mf16c -mfma

[Bug tree-optimization/92645] Hand written vector code is 450 times slower when compiled with GCC compared to Clang

2019-11-24 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92645 --- Comment #3 from Jan Hubicka --- Created attachment 47342 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=47342&action=edit GCC source

[Bug tree-optimization/92645] Hand written vector code is 450 times slower when compiled with GCC compared to Clang

2019-11-24 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92645 --- Comment #1 from Jan Hubicka --- Created attachment 47340 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=47340&action=edit Clang source

[Bug tree-optimization/92645] New: Hand written vector code is 450 times slower when compiled with GCC compared to Clang

2019-11-24 Thread hubicka at gcc dot gnu.org
Severity: normal Priority: P3 Component: tree-optimization Assignee: unassigned at gcc dot gnu.org Reporter: hubicka at gcc dot gnu.org Target Milestone: --- Hi, the attached are preprocessed files for Skia where Clang ifdefs was removed so we get roughly same

[Bug tree-optimization/92632] New: Calculix regression

2019-11-22 Thread hubicka at gcc dot gnu.org
Assignee: unassigned at gcc dot gnu.org Reporter: hubicka at gcc dot gnu.org Target Milestone: --- LNT testing show 137% regression of calculix with LTO and PGO https://lnt.opensuse.org/db_default/v4/SPEC/graph?plot.0=288.170.0 The range is between Revision: fbbadf0018292a93 (2019-11-15

[Bug ipa/60243] IPA is slow on large cgraph tree

2019-11-21 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=60243 --- Comment #27 from Jan Hubicka --- profile_estimate issue is still here, inliner and early inliner issues seems solved. Seems that ipa_profile just orders the nodes for propagation in wrong way - we propagate from callers to callees while topos

[Bug ipa/44563] GCC uses a lot of RAM when compiling a large numbers of functions

2019-11-21 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=44563 Jan Hubicka changed: What|Removed |Added Status|ASSIGNED|NEW Assignee|hubicka at gcc dot

[Bug c++/55135] Segfault of gcc on a big file

2019-11-21 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=55135 --- Comment #30 from Jan Hubicka --- Reconfirmed that we still take ages to build the testcase (early inliner is still running for me) The early inliner issue here is caused by tree-inline removing individual clones one by one. Each time a clon

[Bug ipa/92476] [10 regression] SEGV in cgraph_edge_brings_value_p

2019-11-18 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92476 Jan Hubicka changed: What|Removed |Added Assignee|hubicka at gcc dot gnu.org |mjambor at suse dot cz --- Comment

[Bug ipa/92508] [10 Regression] ICE in do_estimate_edge_time, at ipa-inline-analysis.c:223 since r278159

2019-11-18 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92508 Jan Hubicka changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug ipa/92508] [10 Regression] ICE in do_estimate_edge_time, at ipa-inline-analysis.c:223 since r278159

2019-11-18 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92508 --- Comment #15 from Jan Hubicka --- Author: hubicka Date: Mon Nov 18 19:28:53 2019 New Revision: 278419 URL: https://gcc.gnu.org/viewcvs?rev=278419&root=gcc&view=rev Log: PR ipa/92508 * ipa-inline.c (inline_small_functions): Ad

[Bug ipa/92535] [10 regression] ICF is relatively expensive and became less effective

2019-11-15 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92535 --- Comment #4 from Jan Hubicka --- Forgot bloaty report for GCC9 and disabling ICF $ bloaty libxul.so -- libxul.so.old VM SIZE FILE SIZE ++ GROWING++ +2.3% +1.87Mi .text

[Bug ipa/92535] [10 regression] ICF is relatively expensive and became less effective

2019-11-15 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92535 --- Comment #2 from Jan Hubicka --- Created attachment 47276 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=47276&action=edit Memory use of gcc9

[Bug ipa/92535] [10 regression] ICF is relatively expensive and became less effective

2019-11-15 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92535 --- Comment #3 from Jan Hubicka --- Created attachment 47277 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=47277&action=edit Meory use of gcc9 with ICF disabled

[Bug ipa/92535] [10 regression] ICF is relatively expensive and became less effective

2019-11-15 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92535 --- Comment #1 from Jan Hubicka --- Created attachment 47275 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=47275&action=edit memory use of GCC10 with icf disabled

[Bug ipa/92535] New: [10 regression] ICF is relatively expensive and became less effective

2019-11-15 Thread hubicka at gcc dot gnu.org
Priority: P3 Component: ipa Assignee: unassigned at gcc dot gnu.org Reporter: hubicka at gcc dot gnu.org CC: marxin at gcc dot gnu.org Target Milestone: --- Created attachment 47274 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=47

[Bug ipa/92508] [10 Regression] ICE in do_estimate_edge_time, at ipa-inline-analysis.c:223 since r278159

2019-11-15 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92508 --- Comment #8 from Jan Hubicka --- Aha, that makes sense for sreal it is not sure that a == a * 1 / 1 and the code was inconsistent about guaring the noop scales. Thanks for tracking this down! I suppose it would also make sense to pre

[Bug ipa/92528] [10 Regression] ICE in ipa_get_parm_lattices since r278219

2019-11-15 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92528 Jan Hubicka changed: What|Removed |Added Assignee|hubicka at gcc dot gnu.org |fxue at os dot amperecomputing.com

[Bug lto/48200] Implement function attribute for symbol versioning (.symver)

2019-11-15 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=48200 --- Comment #40 from Jan Hubicka --- I posted initial patch here https://gcc.gnu.org/ml/gcc-patches/2019-11/msg01334.html

[Bug testsuite/92520] [10 Regression] new test case gcc/testsuite/gcc.dg/ipa/inline-9.c in r278220 is unresolved

2019-11-15 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92520 Jan Hubicka changed: What|Removed |Added Status|UNCONFIRMED |RESOLVED Resolution|---

[Bug testsuite/92520] [10 Regression] new test case gcc/testsuite/gcc.dg/ipa/inline-9.c in r278220 is unresolved

2019-11-15 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92520 --- Comment #1 from Jan Hubicka --- Author: hubicka Date: Fri Nov 15 08:19:16 2019 New Revision: 278279 URL: https://gcc.gnu.org/viewcvs?rev=278279&root=gcc&view=rev Log: PR testsuite/92520 * gcc.dg/ipa/inline-9.c: Fix template.

[Bug lto/48200] Implement function attribute for symbol versioning (.symver)

2019-11-14 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=48200 Jan Hubicka changed: What|Removed |Added CC||carlos at redhat dot com --- Comment #37 f

[Bug c/66825] RFE: Add attributes for symbol versioning.

2019-11-14 Thread hubicka at gcc dot gnu.org
||hubicka at gcc dot gnu.org Resolution|--- |DUPLICATE --- Comment #2 from Jan Hubicka --- We have earlier bug on this. I am going to attach WIP patch there. *** This bug has been marked as a duplicate of bug 48200 ***

[Bug ipa/92421] [10 Regression] ICE in inline_small_functions, at ipa-inline.c:2001 since r277759

2019-11-13 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92421 --- Comment #6 from Jan Hubicka --- Author: hubicka Date: Wed Nov 13 21:02:11 2019 New Revision: 278159 URL: https://gcc.gnu.org/viewcvs?rev=278159&root=gcc&view=rev Log: PR c++/92421 * ipa-prop.c (update_indirect_edges_after_in

[Bug ipa/92498] [10 regression] gcc.dg/tree-prof/crossmodule-indircall-1.c fails starting with r278100

2019-11-13 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92498 --- Comment #1 from Jan Hubicka --- Author: hubicka Date: Wed Nov 13 19:44:35 2019 New Revision: 278157 URL: https://gcc.gnu.org/viewcvs?rev=278157&root=gcc&view=rev Log: PR ipa/92498 * ipa-profile.c (check_argument_count): Do no

[Bug ipa/92471] [ICE] lto1 segmentation fault: ipa-profile.c ipa_get_cs_argument_count (args=0x0)

2019-11-12 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92471 --- Comment #5 from Jan Hubicka --- Author: hubicka Date: Tue Nov 12 19:31:04 2019 New Revision: 278100 URL: https://gcc.gnu.org/viewcvs?rev=278100&root=gcc&view=rev Log: PR ipa/92471 * ipa-profile.c (check_argument_count): Break

[Bug ipa/92471] [ICE] lto1 segmentation fault: ipa-profile.c ipa_get_cs_argument_count (args=0x0)

2019-11-12 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92471 Jan Hubicka changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug lto/92406] [10 Regression] ICE in ipa_call_summary at ipa-fnsummary.h:253 with lto and pgo

2019-11-07 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92406 --- Comment #7 from Jan Hubicka --- Author: hubicka Date: Thu Nov 7 17:08:11 2019 New Revision: 277927 URL: https://gcc.gnu.org/viewcvs?rev=277927&root=gcc&view=rev Log: PR ipa/92406 * ipa-fnsummary.c (analyze_function_body): U

[Bug lto/92406] [10 Regression] ICE in ipa_call_summary at ipa-fnsummary.h:253 with lto and pgo

2019-11-07 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92406 --- Comment #4 from Jan Hubicka --- Created attachment 47193 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=47193&action=edit Proposed patch Hi, does this patch fix the problem? Honza

[Bug ipa/92394] operand_equal_p should compare as base+offset when comparing addresses

2019-11-06 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92394 --- Comment #3 from Jan Hubicka --- This is corresponding stats from gcc 9, so we definitly load a lot more bodies now 13228 false returned: 'memory operands are different' (compare_gimple_call:785) 14011 false returned: 'decl_or_type fla

[Bug ipa/92394] operand_equal_p should compare as base+offset when comparing addresses

2019-11-06 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92394 Jan Hubicka changed: What|Removed |Added Status|NEW |UNCONFIRMED Last reconfirmed|2019-11-06

[Bug ipa/92394] operand_equal_p should compare as base+offset when comparing addresses

2019-11-06 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92394 --- Comment #1 from Jan Hubicka --- Following testcase is mergeable: struct a {int a; int b;}; struct b {int c; short d;}; void * retadr1(struct a *a) { return &a->b; } void * retadr2(struct b *a) { return &a->d; }

[Bug ipa/92394] New: operand_equal_p should compare as base+offset when comparing addresses

2019-11-06 Thread hubicka at gcc dot gnu.org
: normal Priority: P3 Component: ipa Assignee: unassigned at gcc dot gnu.org Reporter: hubicka at gcc dot gnu.org CC: marxin at gcc dot gnu.org Target Milestone: --- Compiling firefox one gets many of: false returned: '' in operand_

[Bug ipa/92254] [10 regression] ICE LTO in inline_small_functions, at ipa-inline.c:2000

2019-10-30 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92254 Jan Hubicka changed: What|Removed |Added CC||mjambor at suse dot cz --- Comment #3 from

[Bug ipa/92278] [10 regression] LTO ICE ipa_get_ith_polymorhic_call_context ipa-prop.h:616

2019-10-30 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92278 Jan Hubicka changed: What|Removed |Added CC||mjambor at suse dot cz --- Comment #3 from

[Bug ipa/92242] [10 regression] LTO ICE in ipa_get_cs_argument_count ipa-prop.h:598

2019-10-28 Thread hubicka at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92242 Jan Hubicka changed: What|Removed |Added Status|UNCONFIRMED |RESOLVED Resolution|---

<    4   5   6   7   8   9   10   11   12   13   >