[PATCH v3, rs6000] Add V1TI into vector comparison expand [PR103316]

2022-03-20 Thread HAO CHEN GUI via Gcc-patches
Hi, This patch adds V1TI mode into a new mode iterator used in vector comparison expands.Without the patch, the comparisons between two vector __int128 are converted to scalar comparisons with branches. The code is suboptimal.The patch fixes the issue. Now all comparisons between two vector __in

Re: [PATCH] AVX512FP16: Fix wrong code for _mm_mask_f[c]madd.*sch [PR 104978]

2022-03-20 Thread Hongyu Wang via Gcc-patches
> > > Would it be better to use vmovss under avx512vl without & 1 for mask. > > > > vmovss clears the upper bits, but the intrinsic requires src1. We > > still need either a mask move or blend for the high part. > not for __m128 _mm_mask_move_ss (__m128 src, __mmask8 k, __m128 a, __m128 b) > https:

[PATCH] libstdc++: Work around clang misdesign in time_get<>::get [PR104990]

2022-03-20 Thread Jakub Jelinek via Gcc-patches
Hi! Apparently clang has a -fgnuc-version= option which allows it to pretend it is any GCC version the user likes. It is already bad that it claims to be GCC 4.2 compatible by default when it is not (various unimplemented extensions at least), but this option is a horrible idea. Anyway, this pat

Re: [PATCH] libgompd: add OMPD support, libgompd initialization and global ICVs functions

2022-03-20 Thread Mohamed Atef via Gcc-patches
hello, I know it's too much. we fixed the functions' names that are not part of the standard form ompd_ * prefix to gompd_ Thanks On Wed, Mar 16, 2022 at 5:48 AM Mohamed Atef wrote: > Hi, >we found some typos in the ChangeLog and some wrong spaces (nightmare) > in the files. > So here's

[PATCH] fortran: use fpu-glibc on powerpc*-unknown-freebsd

2022-03-20 Thread Piotr Kubaj
fpu-glibc name is a bit incorrect, since it also works on other systems. I have verified that it builds and can also build other packages correctly (that were failing before). From 0feb83fae30070cf250781449670638d7addd947 Mon Sep 17 00:00:00 2001 From: Piotr Kubaj Date: Sun, 20 Mar 2022 13:03:13 +

New Swedish PO file for 'gcc' (version 12.1-b20220213)

2022-03-20 Thread Translation Project Robot
Hello, gentle maintainer. This is a message from the Translation Project robot. A revised PO file for textual domain 'gcc' has been submitted by the Swedish team of translators. The file is available at: https://translationproject.org/latest/gcc/sv.po (This file, 'gcc-12.1-b20220213.sv.po'

Re: [PATCH] fortran: use fpu-glibc on powerpc*-unknown-freebsd

2022-03-20 Thread Segher Boessenkool
> From: Piotr Kubaj > Date: Sun, 20 Mar 2022 13:03:13 +0100 > Subject: [PATCH] fortran: on powerpc*-unknown-freebsd*, also use fpu-glibc > > It builds fine and correctly bulds packages on FreeBSD. > It looks like "fpu-glibc" name is a bit misleading since it also works > on FreeBSD. > > Signed-o

Re: [PATCH] fortran: use fpu-glibc on powerpc*-unknown-freebsd

2022-03-20 Thread FX via Gcc-patches
Hi, (Please send all Fortran (front-end and libgfortran) patches in CC to the Fortran list.) Please hold from pushing the patch as is, I have some questions: - If FreeBSD has feenableexcept() and related functions, it should already use the fpu-glibc code, because of this: if test "x${have_fe

[committed] Fix testsuite fallout from pr104960 change

2022-03-20 Thread Jeff Law via Gcc-patches
Recent changes twiddled the output for s390/arch13/sel-1.c causing testsuite failures.  As far as I can tell both sequences are equivalent from a performance standpoint.   This patch changes the test to accept both forms. Installed on the trunk, Jeff commit 6f30c4cd38b8385d62358065d795df590e7

Re: [PATCH v9 04/12] LoongArch Port: Machine description files.

2022-03-20 Thread Richard Sandiford via Gcc-patches
Thanks, this addresses most of my comments from the v8 review. There were a couple left over though: chenglulu writes: > +(define_attr "compression" "none,all" > + (const_string "none")) I still don't understand the purpose of keeping this for LoongArch. > +(define_insn "truncdisi2_extended" >

[PATCH] rtl-ssa: Fix prev/next_def confusion [PR104869]

2022-03-20 Thread Richard Sandiford via Gcc-patches
rtl-ssa chains definitions into an RPO list. It also groups sequences of clobbers together into a single node, so that it's possible to skip over the clobbers in constant time in order to get the next or previous set. When adding a clobber to an insn, the main DF barriers for that clobber are the

Re: [PATCH] AVX512FP16: Fix masm=intel output for vfc?(madd|mul)csh [PR 104977]

2022-03-20 Thread Hongtao Liu via Gcc-patches
On Sat, Mar 19, 2022 at 8:06 AM Hongyu Wang via Gcc-patches wrote: > > Hi, > > This patch fixes typo in subst for scalar complex mask_round operand. > > Bootstraped/regtested on x86_64-pc-linux-gnu{-m32,} and sde. > > Ok for master? > Ok. > gcc/ChangeLog: > > PR target/104977 > * c

Re: [PATCH] AVX512FP16: Fix wrong code for _mm_mask_f[c]madd.*sch [PR 104978]

2022-03-20 Thread Hongtao Liu via Gcc-patches
On Sat, Mar 19, 2022 at 8:09 AM Hongyu Wang via Gcc-patches wrote: > > Hi, > > For complex scalar intrinsic like _mm_mask_fcmadd_sch, the > mask should be and by 1 to ensure the mask is bind to lowest byte. > > Bootstraped/regtested on x86_64-pc-linux-gnu{-m32,} and sde. > > Ok for master? > > gcc

Re: [PATCH] AVX512FP16: Fix wrong code for _mm_mask_f[c]madd.*sch [PR 104978]

2022-03-20 Thread Hongyu Wang via Gcc-patches
> Would it be better to use vmovss under avx512vl without & 1 for mask. vmovss clears the upper bits, but the intrinsic requires src1. We still need either a mask move or blend for the high part. LLVM generates mask & 1 for these intrinsics. Hongtao Liu via Gcc-patches 于2022年3月21日周一 09:08写道: >

Re: [PATCH] AVX512FP16: Fix wrong code for _mm_mask_f[c]madd.*sch [PR 104978]

2022-03-20 Thread Hongtao Liu via Gcc-patches
On Mon, Mar 21, 2022 at 9:22 AM Hongyu Wang wrote: > > > Would it be better to use vmovss under avx512vl without & 1 for mask. > > vmovss clears the upper bits, but the intrinsic requires src1. We > still need either a mask move or blend for the high part. not for __m128 _mm_mask_move_ss (__m128 s