[Bug tree-optimization/33761] non-optimal inlining heuristics pessimizes gzip SPEC score at -O3

2008-02-06 Thread hubicka at gcc dot gnu dot org
--- Comment #19 from hubicka at gcc dot gnu dot org 2008-02-06 16:56 --- Created an attachment (id=15108) --> (http://gcc.gnu.org/bugzilla/attachment.cgi?id=15108&action=view) Complete continue heuristic patch Hi, this is the complete patch. With this patch we produce profi

[Bug tree-optimization/33761] non-optimal inlining heuristics pessimizes gzip SPEC score at -O3

2008-02-06 Thread hubicka at gcc dot gnu dot org
--- Comment #18 from hubicka at gcc dot gnu dot org 2008-02-06 16:44 --- Created an attachment (id=15107) --> (http://gcc.gnu.org/bugzilla/attachment.cgi?id=15107&action=view) Path to predict_paths_leading_to Hi, I've revived the continue heuristic patch. By itself it d

[Bug target/23305] [4.0/4.1/4.2/4.3 Regression] x87 load hoisting problem

2008-02-06 Thread hubicka at gcc dot gnu dot org
--- Comment #12 from hubicka at gcc dot gnu dot org 2008-02-06 15:10 --- Looks like last remaining problem is the missed loop invariant motion due to STACK_REGS hack as in the case of pr23322 [EMAIL PROTECTED]:/aux/hubicka/trunk-write/buidl2/gcc$ time ./a.out-nostackregs-hack real

[Bug target/23322] [4.1/4.2/4.3 regression] performance regression

2008-02-06 Thread hubicka at gcc dot gnu dot org
--- Comment #28 from hubicka at gcc dot gnu dot org 2008-02-06 15:10 --- *** Bug 23305 has been marked as a duplicate of this bug. *** -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=23322

[Bug tree-optimization/33761] non-optimal inlining heuristics pessimizes gzip SPEC score at -O3

2008-02-06 Thread hubicka at gcc dot gnu dot org
--- Comment #17 from hubicka at gcc dot gnu dot org 2008-02-06 13:28 --- One problem is the following: do { ; match = window + cur_match; if (match[best_len] != scan_end || match[best_len-1] != scan_end1 || *match != *scan

[Bug target/23322] [4.1/4.2/4.3 regression] performance regression

2008-02-06 Thread hubicka at gcc dot gnu dot org
--- Comment #27 from hubicka at gcc dot gnu dot org 2008-02-06 11:55 --- I noticed those posts. Part of the problem might be that hoisting triggers the partial memory stall bug I fixed. Partial memory stalls are quite expensive, so this might improve scores without the hack in some

[Bug objc/5587] Objc problems with user_label_prefix

2008-02-06 Thread hubicka at gcc dot gnu dot org
--- Comment #5 from hubicka at gcc dot gnu dot org 2008-02-06 11:34 --- Subject: Bug 5587 Author: hubicka Date: Wed Feb 6 11:34:00 2008 New Revision: 132145 URL: http://gcc.gnu.org/viewcvs?root=gcc&view=rev&rev=132145 Log: PR target/5587 * i386.md (moddf_inte

[Bug target/23322] [4.1/4.2/4.3 regression] performance regression

2008-02-06 Thread hubicka at gcc dot gnu dot org
--- Comment #25 from hubicka at gcc dot gnu dot org 2008-02-06 11:30 --- Hmm, sorry. IS_STACK_MODE obviously return false for SSE math, per Paolo's commit. Since it is not used except for the hacks to avoid constant hoisting, I guess it is OK. I will commit only the patch for pe

[Bug target/23322] [4.1/4.2/4.3 regression] performance regression

2008-02-06 Thread hubicka at gcc dot gnu dot org
--- Comment #24 from hubicka at gcc dot gnu dot org 2008-02-06 11:26 --- IS_STACK_MODE returns true for MODEs that *might* go to the stack registers. When we do SSE math, SFmode/DFmode will most likely go into SSE register, but still they are valid for stack register and might go

[Bug target/23322] [4.1/4.2/4.3 regression] performance regression

2008-02-05 Thread hubicka at gcc dot gnu dot org
--- Comment #21 from hubicka at gcc dot gnu dot org 2008-02-05 23:54 --- Looking at the -O2 and -O2 -fno-inline-small-functions, I believe last remaining problem is our inability to hoist load of 0 out of loop: The fill loop without inlining is taking the value as argument: .L7

[Bug target/23322] [4.1/4.2/4.3 regression] performance regression

2008-02-05 Thread hubicka at gcc dot gnu dot org
--- Comment #20 from hubicka at gcc dot gnu dot org 2008-02-05 22:50 --- Last rumors I heard about LR splitting was that it didn't really helped and worked and used LOOP notes, so it would need complete rewrite anyway. This problem wasn't really LR splitting issue, just wr

[Bug target/23322] [4.1/4.2/4.3 regression] performance regression

2008-02-05 Thread hubicka at gcc dot gnu dot org
--- Comment #18 from hubicka at gcc dot gnu dot org 2008-02-05 16:24 --- RA still don't split live ranges, but works sanely here: .L21: faddl (%ebx,%eax,8) addl$1, %eax cmpl%edx, %eax jl .L21 Honza -- hubicka at gcc dot gnu do

[Bug target/23322] [4.1/4.2/4.3 regression] performance regression: global regalloc doesn't split live ranges

2008-02-05 Thread hubicka at gcc dot gnu dot org
--- Comment #17 from hubicka at gcc dot gnu dot org 2008-02-05 16:18 --- The simplified testcase is dealt with the call crossed frequency patch. I now get: .L2: faddl (%edx,%eax,8) addl$1, %eax cmpl$2000, %eax jne .L2 fstpl -24

[Bug tree-optimization/33761] non-optimal inlining heuristics pessimizes gzip SPEC score at -O3

2008-02-05 Thread hubicka at gcc dot gnu dot org
--- Comment #16 from hubicka at gcc dot gnu dot org 2008-02-05 13:55 --- Thanks, looks comparable to K8 scores, except that -O3 is not actually that worse there. So it looks there is more than just random effect of code layout involved, I will try to look into the assembly produced

[Bug tree-optimization/33761] non-optimal inlining heuristics pessimizes gzip SPEC score at -O3

2008-02-05 Thread hubicka at gcc dot gnu dot org
--- Comment #15 from hubicka at gcc dot gnu dot org 2008-02-05 13:36 --- Thanks, looks comparable to K8 scores, except that -O3 is not actually that worse there. So it looks there is more than just random effect of code layout involved, I will try to look into the assembly produced

[Bug tree-optimization/23305] [4.0/4.1/4.2/4.3 Regression] Inlining related regression for gcc-4.x

2008-02-05 Thread hubicka at gcc dot gnu dot org
--- Comment #11 from hubicka at gcc dot gnu dot org 2008-02-05 13:31 --- This testcase is still slower, 4.4s with -O2 and 3.6s with -O2 -fno-inline-small-functions (on i386). I wondered if the patch counting frequency of calls crossed helped here. My slowdown is smaller than what

[Bug tree-optimization/33761] non-optimal inlining heuristics pessimizes gzip SPEC score at -O3

2008-02-03 Thread hubicka at gcc dot gnu dot org
--- Comment #13 from hubicka at gcc dot gnu dot org 2008-02-03 13:39 --- Tonight runs on haydn with patch in shows regression on gzip: 950->901 in 32bit. FDO 64bit runs are not affected. This is same score as we had in December, we improved a bit since then but not enough to ma

[Bug tree-optimization/33761] non-optimal inlining heuristics pessimizes gzip SPEC score at -O3

2008-02-02 Thread hubicka at gcc dot gnu dot org
--- Comment #12 from hubicka at gcc dot gnu dot org 2008-02-02 16:22 --- Created an attachment (id=15079) --> (http://gcc.gnu.org/bugzilla/attachment.cgi?id=15079&action=view) address accumulation patch While working on PR17863 I wrote the attached patch to make fwprop to

[Bug tree-optimization/17863] [4.0/4.1/4.2/4.3 Regression] performance loss (TER register presure and inlining limits problems)

2008-02-01 Thread hubicka at gcc dot gnu dot org
--- Comment #42 from hubicka at gcc dot gnu dot org 2008-02-01 22:33 --- So for summary. With TER disabled, I get 6.2s, so we are still worse than 3.4 that is 5.6s. With call-cost inline parameter increased and TER disabled, I get 5.3s. With forwprop fix, ter disabled and inline

[Bug tree-optimization/17863] [4.0/4.1/4.2/4.3 Regression] performance loss (not inlining as much??)

2008-02-01 Thread hubicka at gcc dot gnu dot org
--- Comment #40 from hubicka at gcc dot gnu dot org 2008-02-01 16:47 --- Well, I still meant that simplifying the cascaded addition into accumulator into direct addition from base makes the code to simplify. I implemented experimentally the trick in fwprop and will attach later, but

[Bug tree-optimization/17863] [4.0/4.1/4.2/4.3 Regression] performance loss (not inlining as much??)

2008-01-30 Thread hubicka at gcc dot gnu dot org
--- Comment #36 from hubicka at gcc dot gnu dot org 2008-01-30 18:14 --- Looking at the .optimized dump, one obvious problem is that we keep a lot of pointer arithmetic that should be forward propagated: :; D.184420 = *pz; p1 = pz + 8; D.184422 = *p1; p1 = p1 + 8; D.184424

[Bug tree-optimization/17863] [4.0/4.1/4.2/4.3 Regression] performance loss (not inlining as much??)

2008-01-30 Thread hubicka at gcc dot gnu dot org
--- Comment #35 from hubicka at gcc dot gnu dot org 2008-01-30 17:58 --- So for more proper analysis. The testcase is quite challenging for inlining heuristics and by introducing early inlining and reducing call cost we now inline less that we used to at a time I claimed that we inline

[Bug tree-optimization/17863] [4.0/4.1/4.2/4.3 Regression] performance loss (not inlining as much??)

2008-01-30 Thread hubicka at gcc dot gnu dot org
--- Comment #34 from hubicka at gcc dot gnu dot org 2008-01-30 17:13 --- [EMAIL PROTECTED]:/aux/hubicka/trunk-write/buidl2$ time ./test-3.4 real0m5.692s user0m5.588s sys 0m0.012s [EMAIL PROTECTED]:/aux/hubicka/trunk-write/buidl2$ time ./test-3.4 real0m5.536s user

[Bug target/31897] [4.3 Regression] 30% speed regression with -m32 on Opteron with rnflow

2008-01-30 Thread hubicka at gcc dot gnu dot org
--- Comment #11 from hubicka at gcc dot gnu dot org 2008-01-30 16:04 --- If you have nothing against, I would probably go for -mfpmath=sse implied by -ftree-vectorize route then. Since there is now i386 ABI mailing list I hope if we can move in direction of having -mfpmath=sse by

[Bug target/34982] [4.3 regression] calling a function with undefined parameters causes segmentation fault at -O1 or higher

2008-01-30 Thread hubicka at gcc dot gnu dot org
--- Comment #16 from hubicka at gcc dot gnu dot org 2008-01-30 15:55 --- Subject: Bug 34982 Author: hubicka Date: Wed Jan 30 15:54:14 2008 New Revision: 131966 URL: http://gcc.gnu.org/viewcvs?root=gcc&view=rev&rev=131966 Log: PR target/34982

[Bug target/34982] [4.3 regression] calling a function with undefined parameters causes segmentation fault at -O1 or higher

2008-01-30 Thread hubicka at gcc dot gnu dot org
--- Comment #15 from hubicka at gcc dot gnu dot org 2008-01-30 15:54 --- Fixed at mainline. I am really surprises this is 4.3 only regression since the code didn't see much changes in last few releases. -- hubicka at gcc dot gnu dot org changed: What|Re

[Bug middle-end/34852] [4.3 Regression] Revision 131576 miscompiled 178.galgel

2008-01-20 Thread hubicka at gcc dot gnu dot org
--- Comment #3 from hubicka at gcc dot gnu dot org 2008-01-20 11:56 --- Double checking patch, I don't see obvious mistakes. Since the patch should only affect register allocation decisions, either we see a latent bug, or another example of x86 extra precision causing FP progr

[Bug target/31897] [4.3 Regression] 30% speed regression with -m32 on Opteron with rnflow

2008-01-19 Thread hubicka at gcc dot gnu dot org
--- Comment #9 from hubicka at gcc dot gnu dot org 2008-01-19 12:03 --- Does the regression on C2 duo show even without vectorizing? It looks like generic SSE fpmath performance issue. There should be no reason why SSE math in combination with SSE vectorization should result in

[Bug tree-optimization/33761] non-optimal inlining heuristics pessimizes gzip SPEC score at -O3

2008-01-16 Thread hubicka at gcc dot gnu dot org
--- Comment #11 from hubicka at gcc dot gnu dot org 2008-01-16 16:46 --- Last time I looked into it, it was code alignment affected by inlining in the string matching loop (longest_match). This code is very atypical, since the internal loop

[Bug rtl-optimization/31396] Inline code performance much worse than out-of-line

2008-01-16 Thread hubicka at gcc dot gnu dot org
--- Comment #11 from hubicka at gcc dot gnu dot org 2008-01-16 16:33 --- Fixed on mainline. -- hubicka at gcc dot gnu dot org changed: What|Removed |Added

[Bug rtl-optimization/31396] Inline code performance much worse than out-of-line

2008-01-16 Thread hubicka at gcc dot gnu dot org
--- Comment #10 from hubicka at gcc dot gnu dot org 2008-01-16 16:32 --- Subject: Bug 31396 Author: hubicka Date: Wed Jan 16 16:32:05 2008 New Revision: 131576 URL: http://gcc.gnu.org/viewcvs?root=gcc&view=rev&rev=131576 Log: PR rtl-optimization/31396 * r

[Bug middle-end/18071] [4.0/4.1/4.2/4.3 Regression] -Winline does not respect -fno-default-inline

2008-01-15 Thread hubicka at gcc dot gnu dot org
--- Comment #32 from hubicka at gcc dot gnu dot org 2008-01-15 15:47 --- I am bit confused by logic of code here. Middle end now ingore DECL_INLINE consistently, it is still arround since it affect some instantiation decisions in C++ FE. Does fno-default-inline work? I think only way

[Bug middle-end/32135] [4.3 Regression] bogus array-ref fold triggering array overflow warning

2008-01-13 Thread hubicka at gcc dot gnu dot org
--- Comment #6 from hubicka at gcc dot gnu dot org 2008-01-13 11:26 --- Fixed. -- hubicka at gcc dot gnu dot org changed: What|Removed |Added Status|ASSIGNED

[Bug middle-end/32135] [4.3 Regression] bogus array-ref fold triggering array overflow warning

2008-01-13 Thread hubicka at gcc dot gnu dot org
--- Comment #5 from hubicka at gcc dot gnu dot org 2008-01-13 11:18 --- Subject: Bug 32135 Author: hubicka Date: Sun Jan 13 11:18:08 2008 New Revision: 131502 URL: http://gcc.gnu.org/viewcvs?root=gcc&view=rev&rev=131502 Log: PR middle-end/32135 * tree-s

[Bug rtl-optimization/31396] Inline code performance much worse than out-of-line

2008-01-12 Thread hubicka at gcc dot gnu dot org
--- Comment #9 from hubicka at gcc dot gnu dot org 2008-01-12 19:00 --- Created an attachment (id=14930) --> (http://gcc.gnu.org/bugzilla/attachment.cgi?id=14930&action=view) tentative fix I am testing the attached patch. It is obvious that we should use profile here. The PR

[Bug other/28023] documentation error max-inline-recursive-depth

2008-01-12 Thread hubicka at gcc dot gnu dot org
--- Comment #3 from hubicka at gcc dot gnu dot org 2008-01-12 14:04 --- Fixed in mainline. -- hubicka at gcc dot gnu dot org changed: What|Removed |Added

[Bug other/28023] documentation error max-inline-recursive-depth

2008-01-12 Thread hubicka at gcc dot gnu dot org
--- Comment #2 from hubicka at gcc dot gnu dot org 2008-01-12 14:03 --- Subject: Bug 28023 Author: hubicka Date: Sat Jan 12 14:02:06 2008 New Revision: 131492 URL: http://gcc.gnu.org/viewcvs?root=gcc&view=rev&rev=131492 Log: PR other/28023 * invoke.texi (ma

[Bug tree-optimization/33763] [4.1/4.2/4.3 Regression] Bogus inlining failed in call to `xxx': redefined extern inline functions are not considered for inlining

2008-01-12 Thread hubicka at gcc dot gnu dot org
--- Comment #8 from hubicka at gcc dot gnu dot org 2008-01-12 13:55 --- I can definitely commit the patch to silence the (IMO valid) diagnostics. However, why programs are using always_inline and extern inline combination at all? Just extern inline should be enough. /* For GNU C

[Bug tree-optimization/34708] Inlining heuristics issue

2008-01-12 Thread hubicka at gcc dot gnu dot org
--- Comment #12 from hubicka at gcc dot gnu dot org 2008-01-12 13:34 --- Fixed on mainline (at least checking number of references from cross compiler). -- hubicka at gcc dot gnu dot org changed: What|Removed |Added

[Bug middle-end/31529] [4.3 Regression] ICE in cgraph_expand_function with IMA

2008-01-11 Thread hubicka at gcc dot gnu dot org
--- Comment #11 from hubicka at gcc dot gnu dot org 2008-01-11 15:39 --- The problem here is really how we handle extern inline functions redefining non-extern inline functions. What forntend does is to handle all extern inline or extern functions of same name in all units as single

[Bug middle-end/32135] [4.3 Regression] bogus array-ref fold triggering array overflow warning

2008-01-11 Thread hubicka at gcc dot gnu dot org
--- Comment #4 from hubicka at gcc dot gnu dot org 2008-01-11 15:25 --- I am testing the attached patch. It disables the transformation and produce: in_cols.0 = (char *) in_cols; D.1180 = in_cols.0 + 500; perhaps more canonical way would be in_cols.0 = in_cols + 500; d.1180

[Bug middle-end/32135] [4.3 Regression] bogus array-ref fold triggering array overflow warning

2008-01-11 Thread hubicka at gcc dot gnu dot org
-- hubicka at gcc dot gnu dot org changed: What|Removed |Added AssignedTo|unassigned at gcc dot gnu |hubicka at gcc dot gnu dot |dot org

[Bug tree-optimization/34708] Inlining heuristics issue

2008-01-09 Thread hubicka at gcc dot gnu dot org
--- Comment #11 from hubicka at gcc dot gnu dot org 2008-01-09 19:20 --- Subject: Bug 34708 Author: hubicka Date: Wed Jan 9 19:19:40 2008 New Revision: 131433 URL: http://gcc.gnu.org/viewcvs?root=gcc&view=rev&rev=131433 Log: PR tree-optimization/34708

[Bug tree-optimization/34708] Inlining heuristics issue

2008-01-07 Thread hubicka at gcc dot gnu dot org
--- Comment #1 from hubicka at gcc dot gnu dot org 2008-01-07 22:09 --- mine. -- hubicka at gcc dot gnu dot org changed: What|Removed |Added AssignedTo

[Bug tree-optimization/31081] [4.3 Regression] Inliner messes up SSA for abnormals

2008-01-03 Thread hubicka at gcc dot gnu dot org
--- Comment #16 from hubicka at gcc dot gnu dot org 2008-01-03 21:25 --- Subject: Bug 31081 Author: hubicka Date: Thu Jan 3 21:23:26 2008 New Revision: 131306 URL: http://gcc.gnu.org/viewcvs?root=gcc&view=rev&rev=131306 Log: PR tree-optimization/31081

[Bug tree-optimization/31081] [4.3 Regression] Inliner messes up SSA for abnormals

2007-12-29 Thread hubicka at gcc dot gnu dot org
--- Comment #15 from hubicka at gcc dot gnu dot org 2007-12-29 22:31 --- Well, I would personally not worry that much about pesimization. The current behaviour of increasing register pressure is worse. We definitly can insert the initialization to common dominator of all uses of the

[Bug tree-optimization/31081] [4.3 Regression] Inliner messes up SSA for abnormals

2007-12-18 Thread hubicka at gcc dot gnu dot org
--- Comment #13 from hubicka at gcc dot gnu dot org 2007-12-18 18:54 --- Sorry, my last comment was about different inliner issue that seems to be gone now. I guess easiest way around would be to initialize to 0 at the beggining of inlined function body? This happens only for

[Bug tree-optimization/31081] [4.3 Regression] Inliner messes up SSA for abnormals

2007-12-18 Thread hubicka at gcc dot gnu dot org
--- Comment #12 from hubicka at gcc dot gnu dot org 2007-12-18 11:41 --- > But I wonder what would be the best way to add the PHI nodes. We really > shouldn't do mark_sym_for_renaming on underlying decl, perhaps create a dummy > decl, let intossa create PHIs etc. for i

[Bug tree-optimization/33434] [4.3 Regression] inlining miscompilation

2007-11-08 Thread hubicka at gcc dot gnu dot org
--- Comment #15 from hubicka at gcc dot gnu dot org 2007-11-08 12:42 --- The problem here is that code assumes that variable without DEFAULT_DEF is not in SSA. Arguments that are unused also do have DEFAULT_DEF=NULL, this patch should fix it. I am testing it now. Honza Index: tree

[Bug tree-optimization/33434] [4.3 Regression] inlining miscompilation

2007-10-28 Thread hubicka at gcc dot gnu dot org
--- Comment #9 from hubicka at gcc dot gnu dot org 2007-10-28 19:17 --- Mine -- hubicka at gcc dot gnu dot org changed: What|Removed |Added AssignedTo|unassigned

[Bug tree-optimization/31081] [4.3 Regression] Inliner messes up SSA for abnormals

2007-10-28 Thread hubicka at gcc dot gnu dot org
--- Comment #5 from hubicka at gcc dot gnu dot org 2007-10-28 19:09 --- Mine, it is another example of EH updating problem. I will try to add "may" EH edges pre-inlniing to avoid need to call mark_sym_for_renaming in inliner completely. -- hubicka at gcc dot gnu dot o

[Bug tree-optimization/33763] [4.1/4.2/4.3 Regression] Bogus inlining failed in call to `xxx': redefined extern inline functions are not considered for inlining

2007-10-14 Thread hubicka at gcc dot gnu dot org
--- Comment #3 from hubicka at gcc dot gnu dot org 2007-10-14 12:50 --- This patch should disable the sorry message in this case, but I would argue that outputting sorry here is right - we fail to inline alway_inline function because of implementation limitation. So I would claim 3.2.0

[Bug tree-optimization/33763] [4.1/4.2/4.3 Regression] Bogus inlining failed in call to `xxx': redefined extern inline functions are not considered for inlining

2007-10-14 Thread hubicka at gcc dot gnu dot org
--- Comment #2 from hubicka at gcc dot gnu dot org 2007-10-14 12:10 --- This is result of ugly hack handling redefined extern inline functions. The most sane semantics (not what 3.2.x did) would be that function if inlined should get the extern inline body, but the body from redefined

[Bug tree-optimization/18687] [4.0/4.1/4.2/4.3 Regression] ~50% compile time regression

2007-10-09 Thread hubicka at gcc dot gnu dot org
--- Comment #36 from hubicka at gcc dot gnu dot org 2007-10-09 18:57 --- Startup times was optimized a bit in 4.3. hashes.c, 100 compilations: GCC-3.3 GCC-3.4 mainline mainline&profiledbootstrap -O0 3.2s3.8s 3.6s3.3s -O1

[Bug target/31897] [4.3 Regression] 30% speed regression with -m32 on Opteron with rnflow

2007-10-09 Thread hubicka at gcc dot gnu dot org
--- Comment #6 from hubicka at gcc dot gnu dot org 2007-10-09 14:54 --- It is little bit sick, but what about implying -mfpmath=sse when -ftree-vectorize is used and SSE is available? The reason why we don't default to fpmath=sse is because the extra precision is told to be pa

[Bug target/33406] [4.3 Regression] At revision 128385 Bootstraping PPC Darwin still fail in Java

2007-09-17 Thread hubicka at gcc dot gnu dot org
--- Comment #12 from hubicka at gcc dot gnu dot org 2007-09-17 15:12 --- Subject: Bug 33406 Author: hubicka Date: Mon Sep 17 15:12:10 2007 New Revision: 128547 URL: http://gcc.gnu.org/viewcvs?root=gcc&view=rev&rev=128547 Log: PR middle-end/33348 PR targ

[Bug middle-end/33348] [4.3 Regression] gfortran.dg/g77/19990826-3.f fails at -O1

2007-09-17 Thread hubicka at gcc dot gnu dot org
--- Comment #7 from hubicka at gcc dot gnu dot org 2007-09-17 15:12 --- Subject: Bug 33348 Author: hubicka Date: Mon Sep 17 15:12:10 2007 New Revision: 128547 URL: http://gcc.gnu.org/viewcvs?root=gcc&view=rev&rev=128547 Log: PR middle-end/33348 PR targ

[Bug middle-end/33348] [4.3 Regression] gfortran.dg/g77/19990826-3.f fails at -O1

2007-09-16 Thread hubicka at gcc dot gnu dot org
--- Comment #3 from hubicka at gcc dot gnu dot org 2007-09-16 21:52 --- What happens in this testcase is (mis)behaviour of invariant code motion. On instruction: (set (mem/c/i:SI (plus:DI (reg/f:DI 113 sfp) (const_int 80 [0x50])) [3 S4 A32]) (fix:SI (reg:DF 135))) the

[Bug tree-optimization/33389] [4.3 Regression] Revision 128239 causes libgomp failure

2007-09-13 Thread hubicka at gcc dot gnu dot org
--- Comment #25 from hubicka at gcc dot gnu dot org 2007-09-13 11:27 --- Thanks a lot for tracking down this issue. The patch looks correct to me, can you please send it into gcc-patches? Honza -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=33389

[Bug target/33393] floatsisf2_sse_vector_nointernunit doesn't work on 32bit

2007-09-12 Thread hubicka at gcc dot gnu dot org
--- Comment #1 from hubicka at gcc dot gnu dot org 2007-09-12 07:03 --- Subject: Bug 33393 Author: hubicka Date: Wed Sep 12 07:02:31 2007 New Revision: 128414 URL: http://gcc.gnu.org/viewcvs?root=gcc&view=rev&rev=128414 Log: PR target/33393 *

[Bug tree-optimization/33383] [4.3 Regression] Revision 128092 miscompiles 400.perlbench

2007-09-11 Thread hubicka at gcc dot gnu dot org
--- Comment #3 from hubicka at gcc dot gnu dot org 2007-09-11 14:22 --- Sure, it will be neccessary to do some binary to figure out what inlining breaks the benchmark. Not having the CPU2006 sources, I will be able to do so until Monday. But if I remember right, it reproduced down to

[Bug tree-optimization/33383] [4.3 Regression] Revision 128092 miscompiles 400.perlbench

2007-09-11 Thread hubicka at gcc dot gnu dot org
--- Comment #1 from hubicka at gcc dot gnu dot org 2007-09-11 14:10 --- This seems to be latent problem unconvered by -finline-small-functions patch. Perlbmk was failing at -O3 on older SVN snapshots too. -finline-small-functions should work as an workaround until the issue is

[Bug other/33335] FAIL: 26_numerics/complex/inserters_extractors/wchar_t/1.cc

2007-09-07 Thread hubicka at gcc dot gnu dot org
--- Comment #2 from hubicka at gcc dot gnu dot org 2007-09-07 16:05 --- Apparently it is, it should get out as _U_Qfeq. I was fixing similar problem yesterday. Perhaps it works now on updated tree? Honza -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=5

[Bug middle-end/33318] [4.3 Regression] fortran/expr.c:305: internal compiler error: internal consistency failure

2007-09-06 Thread hubicka at gcc dot gnu dot org
--- Comment #3 from hubicka at gcc dot gnu dot org 2007-09-06 15:10 --- Subject: Bug 33318 Author: hubicka Date: Thu Sep 6 15:10:28 2007 New Revision: 128187 URL: http://gcc.gnu.org/viewcvs?root=gcc&view=rev&rev=128187 Log: PR target/33318 * cse.c (fold_rtx

[Bug bootstrap/33318] fortran/expr.c:305: internal compiler error: internal consistency failure

2007-09-05 Thread hubicka at gcc dot gnu dot org
--- Comment #1 from hubicka at gcc dot gnu dot org 2007-09-05 22:30 --- Created an attachment (id=14162) --> (http://gcc.gnu.org/bugzilla/attachment.cgi?id=14162&action=view) fix does the attached patch fix the bootstrap problem? -- http://gcc.gnu.org/bugzilla/show_bug

[Bug rtl-optimization/32504] [4.3 Regression] internal compiler error: in df_refs_verify, at df-scan.c:4058

2007-06-29 Thread hubicka at gcc dot gnu dot org
--- Comment #3 from hubicka at gcc dot gnu dot org 2007-06-29 20:19 --- The testcase works for me on current mainline with sharing checking patch on x86_64-linux, but since the target triplet is not specified, can you double check, please, that the bug is gone? Honza -- http

[Bug rtl-optimization/32372] [4.3 Regression] ICE in df_refs_verify, at df-scan.c:4065

2007-06-29 Thread hubicka at gcc dot gnu dot org
--- Comment #6 from hubicka at gcc dot gnu dot org 2007-06-29 20:15 --- Paolo's rumor is correct, Indeed. I somehow lost track in patches that was or wasn't approved or tested. I've comited the fix now. -- hubicka at gcc dot gnu dot org changed: W

[Bug rtl-optimization/32372] [4.3 Regression] ICE in df_refs_verify, at df-scan.c:4065

2007-06-29 Thread hubicka at gcc dot gnu dot org
--- Comment #5 from hubicka at gcc dot gnu dot org 2007-06-29 20:13 --- Subject: Bug 32372 Author: hubicka Date: Fri Jun 29 20:13:41 2007 New Revision: 126122 URL: http://gcc.gnu.org/viewcvs?root=gcc&view=rev&rev=126122 Log: PR middle-end/32372 * cse.c (

[Bug tree-optimization/30563] [4.3 Regression] ice for legal code with flags -O2 -fno-unit-at-a-time

2007-06-24 Thread hubicka at gcc dot gnu dot org
--- Comment #13 from hubicka at gcc dot gnu dot org 2007-06-24 23:24 --- Subject: Bug 30563 Author: hubicka Date: Sun Jun 24 23:24:10 2007 New Revision: 125991 URL: http://gcc.gnu.org/viewcvs?root=gcc&view=rev&rev=125991 Log: PR middle-end/30563 * cgra

[Bug middle-end/31541] [4.3 Regression] cannot take address of bit field

2007-06-23 Thread hubicka at gcc dot gnu dot org
--- Comment #18 from hubicka at gcc dot gnu dot org 2007-06-23 11:58 --- Subject: Bug 31541 Author: hubicka Date: Sat Jun 23 11:58:18 2007 New Revision: 125971 URL: http://gcc.gnu.org/viewcvs?root=gcc&view=rev&rev=125971 Log: PR middle-end/31541 * gi

[Bug c++/12076] gcov misreports coverage of return statement [NRV]

2007-05-06 Thread hubicka at gcc dot gnu dot org
--- Comment #15 from hubicka at gcc dot gnu dot org 2007-05-06 15:31 --- The problem with NRV transformation is that it removes the return statement from default path and keeps it in the catch path making GCOV to properly output it as unexecuted. I tried to look into the NRV

[Bug middle-end/28071] [4.1 regression] A file that can not be compiled in reasonable time/space

2007-04-17 Thread hubicka at gcc dot gnu dot org
--- Comment #65 from hubicka at gcc dot gnu dot org 2007-04-17 19:16 --- I can confirm that at -O2, memory consumption dropped from 0.5GB to 0.28GB, that is indeed good improvement. To summarize http://www.suse.de/~gcctest/memory/results/200704171438/pr28071-O2.rep Compile time wise

[Bug middle-end/30700] [4.2 Regression] YA bogus undefined reference error to static func with -g and -O

2007-04-17 Thread hubicka at gcc dot gnu dot org
--- Comment #8 from hubicka at gcc dot gnu dot org 2007-04-17 19:04 --- Fixed. -- hubicka at gcc dot gnu dot org changed: What|Removed |Added Status|NEW

[Bug middle-end/30700] [4.2 Regression] YA bogus undefined reference error to static func with -g and -O

2007-04-17 Thread hubicka at gcc dot gnu dot org
--- Comment #7 from hubicka at gcc dot gnu dot org 2007-04-17 19:02 --- Subject: Bug 30700 Author: hubicka Date: Tue Apr 17 19:02:09 2007 New Revision: 123922 URL: http://gcc.gnu.org/viewcvs?root=gcc&view=rev&rev=123922 Log: PR middle-end/30700 * dwa

[Bug target/27869] "-O -fregmove" handles SSE scalar instructions incorrectly

2007-04-16 Thread hubicka at gcc dot gnu dot org
--- Comment #16 from hubicka at gcc dot gnu dot org 2007-04-16 17:07 --- Subject: Bug 27869 Author: hubicka Date: Mon Apr 16 17:07:19 2007 New Revision: 123876 URL: http://gcc.gnu.org/viewcvs?root=gcc&view=rev&rev=123876 Log: PR target/27869 * config/i38

[Bug target/27869] "-O -fregmove" handles SSE scalar instructions incorrectly

2007-04-09 Thread hubicka at gcc dot gnu dot org
--- Comment #12 from hubicka at gcc dot gnu dot org 2007-04-10 00:06 --- Subject: Bug 27869 Author: hubicka Date: Tue Apr 10 00:06:16 2007 New Revision: 123682 URL: http://gcc.gnu.org/viewcvs?root=gcc&view=rev&rev=123682 Log: PR target/27869 * config/i38

[Bug gcov-profile/28441] Need atomic increment of gcov counters for MP programs

2007-03-14 Thread hubicka at gcc dot gnu dot org
--- Comment #13 from hubicka at gcc dot gnu dot org 2007-03-14 23:03 --- Note that Michael Matz commited a patch to solve the problem independently. http://gcc.gnu.org/ml/gcc-patches/2007-03/msg00950.html I hope it will get into mainline in few days. Honza -- http://gcc.gnu.org

[Bug middle-end/30509] [4.3 Regression] ice for legal code with -O3

2007-02-25 Thread hubicka at gcc dot gnu dot org
--- Comment #10 from hubicka at gcc dot gnu dot org 2007-02-25 18:11 --- Subject: Bug 30509 Author: hubicka Date: Sun Feb 25 18:11:20 2007 New Revision: 122314 URL: http://gcc.gnu.org/viewcvs?root=gcc&view=rev&rev=122314 Log: PR middle-end/30509 * tree-

[Bug target/30778] [4.3 Regression] invalid code generation for memset() with -mtune=k8

2007-02-25 Thread hubicka at gcc dot gnu dot org
--- Comment #8 from hubicka at gcc dot gnu dot org 2007-02-25 17:37 --- Subject: Bug 30778 Author: hubicka Date: Sun Feb 25 17:37:37 2007 New Revision: 122313 URL: http://gcc.gnu.org/viewcvs?root=gcc&view=rev&rev=122313 Log: PR target/30778 * i386.c (counter_mo

[Bug target/30778] [4.3 Regression] invalid code generation for memset() with -mtune=k8

2007-02-24 Thread hubicka at gcc dot gnu dot org
--- Comment #6 from hubicka at gcc dot gnu dot org 2007-02-24 23:47 --- Hi, this is version I ended up with. In general I don't think that code should handle with such a small constant sizes, we probably should bump move_by_pieces constants since it can do better job using proper

[Bug target/30778] [4.3 Regression] invalid code generation for memset() with -mtune=k8

2007-02-24 Thread hubicka at gcc dot gnu dot org
--- Comment #4 from hubicka at gcc dot gnu dot org 2007-02-24 23:09 --- Hi, testing for expected_size is wrong here - with profile feedback, expected_size is average size of the block and thus can be smaller than actual size of the block being copied. In general we should in this case

[Bug middle-end/30509] [4.3 Regression] ice for legal code with -O3

2007-02-24 Thread hubicka at gcc dot gnu dot org
--- Comment #6 from hubicka at gcc dot gnu dot org 2007-02-24 22:58 --- I am testing a fix: Index: tree-inline.c === *** tree-inline.c (revision 122292) --- tree-inline.c (working copy) *** copy_bb

[Bug middle-end/30908] -Os inlines functions that are called more than once (optimization regression)

2007-02-21 Thread hubicka at gcc dot gnu dot org
--- Comment #9 from hubicka at gcc dot gnu dot org 2007-02-21 17:26 --- The main problem here is that inliner really don't have enough of detailed information. In general inlining improves optimization and often leads to smaller code when inlining such a trivial function ARM o

[Bug gcov-profile/30650] [4.3 Regression] ICE with -fprofile-use

2007-02-13 Thread hubicka at gcc dot gnu dot org
--- Comment #9 from hubicka at gcc dot gnu dot org 2007-02-14 01:24 --- Hi, the reason is -msse3 that is supposed to be at both invocations. -msse3 enables the SSE builtins that do change declarations UIDs that in turn changes function names: < ;; Function ddx (ddx.1

[Bug middle-end/28071] [4.1 regression] A file that can not be compiled in reasonable time/space

2007-02-06 Thread hubicka at gcc dot gnu dot org
--- Comment #61 from hubicka at gcc dot gnu dot org 2007-02-06 22:14 --- Also forgot to mention, integration is slow because of the split_block quadraticness. For -O2: We need 531MB of ram, GGC memory is peaking at 100MB, large portion of the non-GGC memory are definitly the scheduler

[Bug middle-end/28071] [4.1 regression] A file that can not be compiled in reasonable time/space

2007-02-06 Thread hubicka at gcc dot gnu dot org
--- Comment #60 from hubicka at gcc dot gnu dot org 2007-02-06 22:05 --- Hi, small update on status. At -O3 -fno-tree-fre -fno-tree-pre we are now doing 1.1GB footprint, 800MB of this out of gimple. We still explode in FRE/PRE but majority of other problems was fixed: Execution times

[Bug middle-end/30696] [4.3 regression] Linker failure with OpenMP and inline function

2007-02-04 Thread hubicka at gcc dot gnu dot org
--- Comment #3 from hubicka at gcc dot gnu dot org 2007-02-04 23:41 --- Subject: Bug 30696 Author: hubicka Date: Sun Feb 4 23:40:58 2007 New Revision: 121582 URL: http://gcc.gnu.org/viewcvs?root=gcc&view=rev&rev=121582 Log: PR middle-end/30696 * ipa-

[Bug gcov-profile/30650] [4.3 Regression] ICE with -fprofile-use

2007-02-03 Thread hubicka at gcc dot gnu dot org
--- Comment #7 from hubicka at gcc dot gnu dot org 2007-02-03 23:35 --- Subject: Bug 30650 Author: hubicka Date: Sat Feb 3 23:35:42 2007 New Revision: 121554 URL: http://gcc.gnu.org/viewcvs?root=gcc&view=rev&rev=121554 Log: PR gcov-profile/30650 * valu

[Bug gcov-profile/30650] [4.3 Regression] ICE with -fprofile-use

2007-02-03 Thread hubicka at gcc dot gnu dot org
--- Comment #4 from hubicka at gcc dot gnu dot org 2007-02-03 21:49 --- Created an attachment (id=13002) --> (http://gcc.gnu.org/bugzilla/attachment.cgi?id=13002&action=view) Fix -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=30650

[Bug gcov-profile/30650] [4.3 Regression] ICE with -fprofile-use

2007-02-03 Thread hubicka at gcc dot gnu dot org
--- Comment #3 from hubicka at gcc dot gnu dot org 2007-02-03 21:48 --- Testing fix. -- hubicka at gcc dot gnu dot org changed: What|Removed |Added AssignedTo

[Bug c/4076] -Wunused doesn't warn about static function only called by itself.

2007-01-27 Thread hubicka at gcc dot gnu dot org
--- Comment #6 from hubicka at gcc dot gnu dot org 2007-01-27 15:57 --- update_inlined_to_pointers is obviously no longer needed and can be safely removed now. Thanks for noticing it ;) Honza -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=4076

[Bug middle-end/30537] [4.3 regression] ICE with -fno-unit-at-a-time an inlining

2007-01-21 Thread hubicka at gcc dot gnu dot org
--- Comment #1 from hubicka at gcc dot gnu dot org 2007-01-22 00:40 --- Mine -- hubicka at gcc dot gnu dot org changed: What|Removed |Added AssignedTo|unassigned

[Bug bootstrap/30467] [4.3 regression] Bootstrap failure on i386

2007-01-14 Thread hubicka at gcc dot gnu dot org
--- Comment #1 from hubicka at gcc dot gnu dot org 2007-01-15 01:56 --- since the bug does not reproduce on Linux setup, would be possible to have preprocessed testcase and possibly also backtrace of the crash? Thank you, Honza -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=30467

[Bug middle-end/30443] [4.3 Regression] 4.3 internal compiler error: verify_cgraph_node failed

2007-01-12 Thread hubicka at gcc dot gnu dot org
--- Comment #4 from hubicka at gcc dot gnu dot org 2007-01-12 18:08 --- Subject: Bug 30443 Author: hubicka Date: Fri Jan 12 18:07:57 2007 New Revision: 120730 URL: http://gcc.gnu.org/viewcvs?root=gcc&view=rev&rev=120730 Log: PR tree-optimization/30443 * tree-

[Bug middle-end/30443] [4.3 Regression] 4.3 internal compiler error: verify_cgraph_node failed

2007-01-11 Thread hubicka at gcc dot gnu dot org
--- Comment #3 from hubicka at gcc dot gnu dot org 2007-01-12 01:43 --- Patch in testing. It is related to function cloning update needed by Razya's IPCP code. Honza Index: tree-inline.c === *** tree-inl

[Bug tree-optimization/1046] gcc less efficient than jdk for recursion with -finline-functions

2007-01-11 Thread hubicka at gcc dot gnu dot org
--- Comment #16 from hubicka at gcc dot gnu dot org 2007-01-11 16:50 --- Subject: Bug 1046 Author: hubicka Date: Thu Jan 11 16:50:32 2007 New Revision: 120681 URL: http://gcc.gnu.org/viewcvs?root=gcc&view=rev&rev=120681 Log: PR tree-optimization/1046 * tree-ta

[Bug tree-optimization/1046] gcc less efficient than jdk for recursion with -finline-functions

2007-01-11 Thread hubicka at gcc dot gnu dot org
--- Comment #15 from hubicka at gcc dot gnu dot org 2007-01-11 16:45 --- Subject: Bug 1046 Author: hubicka Date: Thu Jan 11 16:44:56 2007 New Revision: 120680 URL: http://gcc.gnu.org/viewcvs?root=gcc&view=rev&rev=120680 Log: PR tree-optimization/1046 * tree-ta

[Bug target/30383] [4.3 Regression] ice for legal code on x86_64 in ix86_expand_movmem

2007-01-06 Thread hubicka at gcc dot gnu dot org
--- Comment #3 from hubicka at gcc dot gnu dot org 2007-01-07 03:23 --- Created an attachment (id=12868) --> (http://gcc.gnu.org/bugzilla/attachment.cgi?id=12868&action=view) fix in testing testing a fix. -- hubicka at gcc dot gnu dot org changed: What|

[Bug tree-optimization/30392] [4.3 Regression] ice for legal kernel code with -Os

2007-01-06 Thread hubicka at gcc dot gnu dot org
--- Comment #2 from hubicka at gcc dot gnu dot org 2007-01-06 18:10 --- Created an attachment (id=12866) --> (http://gcc.gnu.org/bugzilla/attachment.cgi?id=12866&action=view) patch in testing Mine, testing the attached fix. Honza -- hubicka at gcc dot gnu dot org

[Bug tree-optimization/30089] Compiling FreeFem3d uses unreasonable amount of time and memory

2006-12-23 Thread hubicka at gcc dot gnu dot org
--- Comment #14 from hubicka at gcc dot gnu dot org 2006-12-23 14:27 --- Well, actually the testcase now runs out of memory and ICE... -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=30089

<    1   2   3   4   5   6   7   >