https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109812
--- Comment #24 from edison ---
(In reply to Hongtao Liu from comment #23)
> (In reply to edison from comment #22)
> > for 607.cactuBSSN_s,if use preENV_GOMP_CPU_AFFINITY = 0-23 in CPU2017 .cfg,
> > all p-core(i9-13900k) usage will down to
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109812
--- Comment #23 from Hongtao Liu ---
(In reply to edison from comment #22)
> for 607.cactuBSSN_s,if use preENV_GOMP_CPU_AFFINITY = 0-23 in CPU2017 .cfg,
> all p-core(i9-13900k) usage will down to 15%(the e-core almost 100%), if
> comment out
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109812
edison changed:
What|Removed |Added
CC||edison_chan_gz at hotmail dot
com
---
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109812
liuhongt at gcc dot gnu.org changed:
What|Removed |Added
CC||liuhongt at gcc dot
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109812
--- Comment #20 from Jan Hubicka ---
On zen4 hardware I now get
GCC13 with -O3 -flto -march=native -fopenmp
2163
2161
2153
Average: 2159 Iterations Per Minute
clang 17 with -O3 -flto -march=native -fopenmp
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109812
--- Comment #19 from CVS Commits ---
The master branch has been updated by hongtao Liu :
https://gcc.gnu.org/g:e1e127de18dbee47b88fa0ce74a1c7f4d658dc68
commit r14-4571-ge1e127de18dbee47b88fa0ce74a1c7f4d658dc68
Author: Zhang, Jun
Date: Fri
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109812
--- Comment #18 from Uroš Bizjak ---
One interesting observation:
clang is able to do this:
0.09 │ │ vmovddup -0x8(%rdx,%rsi,1),%xmm3 ▒
...
0.11 │ │ vfmadd231sd %xmm2,%xmm3,%xmm1▒
...
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109812
--- Comment #17 from Jan Hubicka ---
I was also thinking of DCE. It looks like plausible idea. It may leads to a
surprise where you sture same undefined variable to two places and later
compare them for equality, but that is undefined anyway.
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109812
Jakub Jelinek changed:
What|Removed |Added
CC||jakub at gcc dot gnu.org
--- Comment
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109812
--- Comment #15 from Martin Jambor ---
Oh, because I missed the -DOPACITY in the second command line. The reason for
SRAs creating the repalcement is total scalarization :-/
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109812
--- Comment #14 from Martin Jambor ---
(In reply to Jan Hubicka from comment #13)
> The only difference between slp vectorization is:
>
> - # _68 = PHI <_5(3)>
> - # _67 = PHI <_11(3)>
> - # _66 = PHI <_16(3)>
> - .r = _68;
> - .g = _67;
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109812
Jan Hubicka changed:
What|Removed |Added
CC||rguenther at suse dot de
See
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109812
--- Comment #12 from Jan Hubicka ---
> /home/sdp/jun/btl0/install/bin/ld: /tmp/ccnX75zI.ltrans0.ltrans.o: in
> function `main':
> :(.text.startup+0x1): undefined reference to `GMCommand'
I wonder if your plugin is configured correctly. Can
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109812
--- Comment #11 from jun zhang ---
Hello, Hubicka and Artem
I try to reproduce this issue in Raptor Lake,
I use -fopenmp -O3 -flto, meet the following error,
but if use -fopenmp -O3, no -flto, build ok.
Could you help me?
libtool: link:
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109812
--- Comment #10 from Jan Hubicka ---
This is benchmarkeable version of the simplified testcase:
jan@localhost:/tmp> cat t.c
#define N 1000
struct rgb {unsigned char r,g,b;} rgbs[N];
int *addr;
struct drgb {double r,g,b;
#ifdef OPACITY
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109812
--- Comment #9 from Jan Hubicka ---
Oddly enough simplified version of the loop SLP vectorizes for me:
struct rgb {unsigned char r,g,b;} *rgbs;
int *addr;
double *weights;
struct drgb {double r,g,b;};
struct drgb sum()
{
struct drgb r;
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109812
--- Comment #8 from Jan Hubicka ---
Created attachment 55178
--> https://gcc.gnu.org/bugzilla/attachment.cgi?id=55178=edit
Preprocessed source of VerticalFiller and HorisontalFiller
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109812
Jan Hubicka changed:
What|Removed |Added
Summary|GraphicsMagick resize is a |GraphicsMagick resize is a
18 matches
Mail list logo