Re: [PATCH v2] LoongArch: testsuite:Added additional vectorization "-mlsx" option.

2024-01-14 Thread Xi Ruoyao
On Mon, 2024-01-15 at 15:10 +0800, chenxiaolong wrote: > At 14:42 +0800 on the first day of 2024-01-15, Xi Ruoyao wrote: > > On Mon, 2024-01-15 at 14:32 +0800, YunQiang Su wrote: > > > Xi Ruoyao wrote at 12:11pm on Monday, January > > > 15, 2024: > > > > On Mon, 2024-01-15 at 09:29 +0800,

Re: [PATCH v2] LoongArch: testsuite:Added additional vectorization "-mlsx" option.

2024-01-14 Thread chenglulu
在 2024/1/15 下午2:42, Xi Ruoyao 写道: On Mon, 2024-01-15 at 14:32 +0800, YunQiang Su wrote: Xi Ruoyao 于2024年1月15日周一 12:11写道: On Mon, 2024-01-15 at 09:29 +0800, chenxiaolong wrote: At 21:13 +0800 on Saturday, 2024-01-13, Xi Ruoyao wrote: At 15:28 +0800 on Saturday 2024-01-13, chenxiaolong

Re: [Patch, rs6000] Eliminate unnecessary byte swaps for block clear on P8 LE [PR113325]

2024-01-14 Thread Kewen.Lin
Hi Haochen, on 2024/1/11 16:28, HAO CHEN GUI wrote: > Hi, > This patch eliminates unnecessary byte swaps for block clear on P8 > LE. For block clear, all the bytes are set to zero. The byte order > doesn't make sense. So the alignment of destination could be set to > the store mode size in

Re: [PATCH v2] LoongArch: testsuite:Added additional vectorization "-mlsx" option.

2024-01-14 Thread chenxiaolong
At 14:42 +0800 on the first day of 2024-01-15, Xi Ruoyao wrote: > On Mon, 2024-01-15 at 14:32 +0800, YunQiang Su wrote: > > Xi Ruoyao wrote at 12:11pm on Monday, January > > 15, 2024: > > > On Mon, 2024-01-15 at 09:29 +0800, chenxiaolong wrote: > > > > At 21:13 +0800 on Saturday, 2024-01-13, Xi

[Committed] RISC-V: Fix attributes bug configuration of ternary instructions

2024-01-14 Thread Juzhe-Zhong
This patch fixes the following FAILs: Running target riscv-sim/-march=rv64gcv/-mabi=lp64d/-mcmodel=medlow/--param=riscv-autovec-preference=fixed-vlmax FAIL: gcc.c-torture/execute/pr68532.c -O0 execution test FAIL: gcc.c-torture/execute/pr68532.c -O1 execution test FAIL:

Re: [PATCH v2] LoongArch: testsuite:Added additional vectorization "-mlsx" option.

2024-01-14 Thread Xi Ruoyao
On Mon, 2024-01-15 at 14:32 +0800, YunQiang Su wrote: > Xi Ruoyao 于2024年1月15日周一 12:11写道: > > > > On Mon, 2024-01-15 at 09:29 +0800, chenxiaolong wrote: > > > At 21:13 +0800 on Saturday, 2024-01-13, Xi Ruoyao wrote: > > > > At 15:28 +0800 on Saturday 2024-01-13, chenxiaolong wrote: > > > > >

Re: [PATCH v2] LoongArch: testsuite:Added additional vectorization "-mlsx" option.

2024-01-14 Thread YunQiang Su
Xi Ruoyao 于2024年1月15日周一 12:11写道: > > On Mon, 2024-01-15 at 09:29 +0800, chenxiaolong wrote: > > At 21:13 +0800 on Saturday, 2024-01-13, Xi Ruoyao wrote: > > > At 15:28 +0800 on Saturday 2024-01-13, chenxiaolong wrote: > > > > gcc/testsuite/ChangeLog: > > > > > > > > * gcc.dg/pr104992.c: Added

Re: [PATCH, rs6000] Enable block compare expand on P9 with m32 and mpowerpc64

2024-01-14 Thread Kewen.Lin
Hi Haochen, on 2024/1/12 14:48, HAO CHEN GUI wrote: > Hi, > On P9 "setb" is used to set the result of block compare. So it works > with m32 and mpowerpc64. On P8, carry bit is used. So it can't work > with m32 and mpowerpc64. This patch enables block compare expand for > m32 and mpowerpc64 on

Re: [PATCH 1/2] RISC-V: delete all the vector psabi checking.

2024-01-14 Thread juzhe.zh...@rivai.ai
I think you should also remove riscv_vector_abi since vector ABI is ratified and we should by default enable vector calling convention by default. juzhe.zh...@rivai.ai From: yanzhang.wang Date: 2024-01-15 14:00 To: gcc-patches CC: juzhe.zhong; kito.cheng; pan2.li; lehua.ding; yanzhang.wang

Re: [PATCH, rs6000] Refactor expand_compare_loop and split it to two functions

2024-01-14 Thread Kewen.Lin
Hi Haochen, on 2024/1/10 09:35, HAO CHEN GUI wrote: > Hi, > This patch refactors function expand_compare_loop and split it to two > functions. One is for fixed length and another is for variable length. > These two functions share some low level common help functions. I'm expecting refactoring

Re: [PATCH] strub: Only unbias stack point for SPARC_STACK_BOUNDARY_HACK [PR113100]

2024-01-14 Thread Kewen.Lin
on 2024/1/12 19:03, Alexandre Oliva wrote: > On Jan 12, 2024, "Kewen.Lin" wrote: > By checking PR112917, IMHO we should keep this unbiasing guarded under SPARC_STACK_BOUNDARY_HACK (TARGET_ARCH64 && TARGET_STACK_BIAS), similar to some existing code special treating SPARC stack

[PATCH 2/2] RISC-V: delete vector abi checking in all relevant tests.

2024-01-14 Thread yanzhang . wang
From: Yanzhang Wang gcc/testsuite/ChangeLog: * gcc.target/riscv/rvv/base/abi-call-args-1-run.c: Delete the -Wno-psabi. * gcc.target/riscv/rvv/base/abi-call-args-1.c: Ditto. * gcc.target/riscv/rvv/base/abi-call-args-2-run.c: Ditto. *

[PATCH 1/2] RISC-V: delete all the vector psabi checking.

2024-01-14 Thread yanzhang . wang
From: Yanzhang Wang Thanks the https://hub.fgit.cf/riscv-non-isa/riscv-elf-psabi-doc/pull/389, we need not to maintain the psabi checking any more. gcc/ChangeLog: * config/riscv/riscv.cc (riscv_arg_has_vector): Delete. (riscv_pass_in_vector_p): Delete.

Re: MIPS: the method of getting GOT address for PIC code

2024-01-14 Thread YunQiang Su
YunQiang Su 于2023年8月25日周五 15:16写道: > > When working on LLVM, I found this problem > https://github.com/llvm/llvm-project/issues/64974. > Maybe it's time for us to reconsider the way of getting GOT address > for PIC code. > I have my draft patch pushed to GitHub:

[PATCH] MIPS: avoid $gp store if global_pointer is not $gp

2024-01-14 Thread YunQiang Su
$GP is used for expanding GOT load, and in the afterward passes, we will try to use a temporary register instead. If sucess, we have no need to store and reload $gp. The example of failure is that the function calls a preemtive function. We shouldn't use $GP for any other purpose in the code we

Re: [Patch] libgomp.texi: Document omp_pause_resource{,_all} and omp_target_memcpy* (was: [Patch] libgomp.texi: Document omp_pause_resource{,_all})

2024-01-14 Thread Sandra Loosemore
On 1/14/24 16:15, Tobias Burnus wrote: +@node omp_target_memcpy +@subsection @code{omp_target_memcpy} -- Copy data between devices +@table @asis +@item @emph{Description}: +This routine tests copies @var{length} of bytes of data from the device +identified by device number @var{src_device_num}

Re: [PATCH v2] LoongArch: testsuite:Added additional vectorization "-mlsx" option.

2024-01-14 Thread Xi Ruoyao
On Mon, 2024-01-15 at 09:29 +0800, chenxiaolong wrote: > At 21:13 +0800 on Saturday, 2024-01-13, Xi Ruoyao wrote: > > At 15:28 +0800 on Saturday 2024-01-13, chenxiaolong wrote: > > > gcc/testsuite/ChangeLog: > > > > > > * gcc.dg/pr104992.c: Added additional "-mlsx" compilation > > > options. >

Re: [PATCH] libstdc++: atomic: Add missing clear_padding in __atomic_float constructor

2024-01-14 Thread H.J. Lu
On Sun, Jan 7, 2024, 5:02 PM xndcn wrote: > Hi, I found __atomic_float constructor does not clear padding, > while __compare_exchange assumes it as zeroed padding. So it is easy to > reproducing a infinite loop in X86-64 with long double type like: > --- > -O0 -std=c++23 -mlong-double-80 >

Re: [PATCH] libsupc++: Fix UB terminating on foreign exception

2024-01-14 Thread Julia DeMille
On 2024-01-14 18:51, Julia DeMille wrote: I'm unsure if my patch actually fixes it with this demo -- I need to work out how to use a patched GCC without installing it on my system, but without it breaking from not having things it expects to exist on the system. I've gotten this to work, and

Re: [PATCH] Pass GUILE down to subdirectories

2024-01-14 Thread Eric Gallager
On Sat, Jan 13, 2024 at 6:36 AM Andrew Burgess wrote: > > Tom Tromey writes: > > > When I enable cgen rebuilding in the binutils-gdb tree, the default is > > to run cgen using 'guile'. However, on my host, guile is guile 2.2, > > which doesn't work for me -- I have to use guile3.0. > > > > This

[PATCH] RISC-V: Fix regression (GCC-14 compare with GCC-13.2) of SHA256 from coremark-pro

2024-01-14 Thread Juzhe-Zhong
This patch fixes -70% performance drop from GCC-13.2 to GCC-14 with -march=rv64gcv in real hardware. The root cause is incorrect cost model cause inefficient vectorization which makes us performance drop significantly. So this patch does: 1. Adjust vector to scalar cost by introducing v to

Re: [PATCH v2] LoongArch: testsuite:Added additional vectorization "-mlsx" option.

2024-01-14 Thread chenxiaolong
At 21:13 +0800 on Saturday, 2024-01-13, Xi Ruoyao wrote: > At 15:28 +0800 on Saturday 2024-01-13, chenxiaolong wrote: > > gcc/testsuite/ChangeLog: > > > > * gcc.dg/pr104992.c: Added additional "-mlsx" compilation > > options. > > * gcc.dg/signbit-2.c: Dito. > > *

[PATCH] RISC-V: Adjust loop len by costing 1 when NITER < VF

2024-01-14 Thread Juzhe-Zhong
Update in v2: Add dynmaic lmul test. This patch fixes the regression between GCC 13.2.0 and trunk GCC (GCC-14) GCC 13.2.0: lui a5,%hi(a) li a4,19 sb a4,%lo(a)(a5) li a0,0 ret Trunk GCC: vsetvli a5,zero,e8,mf2,ta,ma li

Re: [PATCH] libsupc++: Fix UB terminating on foreign exception

2024-01-14 Thread Julia DeMille
On 2024-01-14 01:52, Jonathan Wakely wrote: The reason for this is that the ChangeLog files are auto-generated from the git commit messages, not edited by hand. Patches to those files rarely apply cleanly anyway, because they change so frequently that patches are stale almost immediately.

[PING][PATCH] libstdc++: atomic: Add missing clear_padding in __atomic_float constructor

2024-01-14 Thread xndcn
Ping. Thanks. xndcn 于2024年1月8日周一 09:01写道: > Hi, I found __atomic_float constructor does not clear padding, > while __compare_exchange assumes it as zeroed padding. So it is easy to > reproducing a infinite loop in X86-64 with long double type like: > --- > -O0 -std=c++23 -mlong-double-80 >

[Patch] libgomp.texi: Document omp_pause_resource{,_all} and omp_target_memcpy* (was: [Patch] libgomp.texi: Document omp_pause_resource{,_all})

2024-01-14 Thread Tobias Burnus
Hi Sandra, hi all, Sandra Loosemore: On 1/14/24 07:26, Tobias Burnus wrote: I have some minor nits about typos and copy-editing. Thanks. That's the downside of doing editing while being sleepy on a train. Updated and extended version (documenting also omp_target_memcpy) is attached.

[committed] Disable tests for strdup/strndup on __hpux__ in various builtin-object-size tests

2024-01-14 Thread John David Anglin
Tested on hppa64-hp-hpux11.11. Committed to trunk. Dave --- Disable tests for strdup/strndup on __hpux__ hppa*-*-hpux* doesn't have strdup or strndup. 2024-01-14 John David Anglin gcc/testsuite/ChangeLog: * gcc.dg/builtin-object-size-1.c: Disable tests for strdup/strndup

[committed] Skip several gcc.dg/builtin-dynamic-object-size tests on hppa*-*-hpux*

2024-01-14 Thread John David Anglin
Tested on hppa64-hp-hpux11.11. Committed to trunk. Dave --- Skip several gcc.dg/builtin-dynamic-object-size tests on hppa*-*-hpux* hppa*-*-hpux* doesn't have strdup or strndup. 2024-01-14 John David Anglin gcc/testsuite/ChangeLog: * gcc.dg/builtin-dynamic-object-size-0.c: Skip on

[PATCH/RFC] Add --with-dwarf4 configure option.

2024-01-14 Thread Roger Sayle
This patch fixes three of the four unexpected failures that I'm seeing in the gcc testsuite on x86_64-pc-linux-gnu. The three FAILs are: FAIL: gcc.c-torture/execute/fprintf-2.c -O3 -g (test for excess errors) FAIL: gcc.c-torture/execute/printf-2.c -O3 -g (test for excess errors) FAIL:

[committed] Fix dg-warning on hppa*64*-*-*

2024-01-14 Thread John David Anglin
Tested on hppa64-hp-hpux11.11. Committed to trunk. Dave --- Fix dg-warning on hppa*64*-*-* 2024-01-14 John David Anglin gcc/testsuite/ChangeLog: * gcc.dg/Wattributes-6.c: Fix dg-warning on hppa*64*-*-*. diff --git a/gcc/testsuite/gcc.dg/Wattributes-6.c

[patch,wwwdocs,avr,applied] Add AVR news for v14.

2024-01-14 Thread Georg-Johann Lay
https://gcc.gnu.org/gcc-14/changes.html#avr Johann -- diff --git a/htdocs/gcc-14/changes.html b/htdocs/gcc-14/changes.html index 9c9dfa44..8c738683 100644 --- a/htdocs/gcc-14/changes.html +++ b/htdocs/gcc-14/changes.html @@ -342,7 +342,55 @@ a work-in-progress. - +AVR + + On AVR64* and

[committed] Skip several analyzer socket tests on hppa*-*-hpux*

2024-01-14 Thread John David Anglin
Tested on hppa64-hp-hpux11.11. Committed to trunk. Dave --- Skip several analyzer socket tests on hppa*-*-hpux* 2024-01-14 John David Anglin gcc/testsuite/ChangeLog: PR analyzer/113150 * c-c++-common/analyzer/fd-glibc-byte-stream-socket.c: Skip on hppa*-*-hpux*.

Re: [Patch] libgomp.texi: Document omp_pause_resource{,_all}

2024-01-14 Thread Sandra Loosemore
On 1/14/24 07:26, Tobias Burnus wrote: This documents two more OpenMP (5.0) routines, omp_pause_resource and omp_pause_resource_all. Comments, remarks, suggestions - to the patch or the documentation in general? I have some minor nits about typos and copy-editing. I assume the formatting

[PATCH V1] rs6000: New pass for replacement of adjacent (load) lxv with lxvp

2024-01-14 Thread Ajit Agarwal
Hello All: This patch add the vecload pass to replace adjacent memory accesses lxv with lxvp instructions. This pass is added before ira pass. vecload pass removes one of the defined adjacent lxv (load) and replace with lxvp. Due to removal of one of the defined loads the allocno is has only

Re: [patch, avr, ping #3] PR target/112944: Support .rodata in RAM for AVR64* and AVR128* devices

2024-01-14 Thread Jeff Law
On 1/14/24 06:05, Georg-Johann Lay wrote: Ping #3 RFA: https://gcc.gnu.org/pipermail/gcc-patches/2023-December/640140.html Ping #1 https://gcc.gnu.org/pipermail/gcc-patches/2023-December/640981.html Ping #2 https://gcc.gnu.org/pipermail/gcc-patches/2024-January/641912.html This is a patch

[committed] Fix MIPS bootstrap

2024-01-14 Thread Jeff Law
mips bootstraps have been broken for a while. They've been triggering an error about mutually exclusive equal-tests always being false when building gencondmd. This was ultimately tracked down to the ior3_mips16_asmacro pattern. The pattern uses the GPR mode iterator which looks like this:

[Patch] libgomp.texi: Document omp_pause_resource{,_all}

2024-01-14 Thread Tobias Burnus
This documents two more OpenMP (5.0) routines, omp_pause_resource and omp_pause_resource_all. Comments, remarks, suggestions - to the patch or the documentation in general? Tobias PS: When looking at it, I found an issue in the spec with regards to a new constant (post TR12, hence, not

[patch,avr,ping #3] PR target/112944: Support .rodata in RAM for AVR64* and AVR128* devices

2024-01-14 Thread Georg-Johann Lay
Ping #3 RFA: https://gcc.gnu.org/pipermail/gcc-patches/2023-December/640140.html Ping #1 https://gcc.gnu.org/pipermail/gcc-patches/2023-December/640981.html Ping #2 https://gcc.gnu.org/pipermail/gcc-patches/2024-January/641912.html This is a patch that locates .rodata in flash for some AVR