date:20210802

[PATCH] Add cond_add/sub/mul for vector integer modes.

2021-08-02 Thread liuhongt via Gcc-patches

Hi: This is a follow up of [1]. Bootstrapped and regtested on x86_64-linux-gnu{-m32,}. Pushed to trunk. [1] https://gcc.gnu.org/pipermail/gcc-patches/2021-August/576514.html gcc/ChangeLog: * config/i386/sse.md (cond_): New expander. (cond_mul): Ditto. gcc/testsuite/ChangeLo

Re: Re: [PATCH] Fix ICE when mixing VLAs and statement expressions [PR91038]

2021-08-02 Thread Martin Uecker

(resending from a different account, as emails seem to do not go out from my other account at this time) Am Montag, den 02.08.2021, 16:05 +0200 schrieb Martin Uecker: > > On Sun, Aug 1, 2021 at 7:37 PM Uecker, Martin > > wrote: > > > > > > > > > Here is an attempt to fix some old and annoyin

[PATCH] x86: Add testcases for PR target/80566

2021-08-02 Thread H.J. Lu via Gcc-patches

PR target/80566 * g++.target/i386/pr80566-1.C: New test. * g++.target/i386/pr80566-2.C: Likewise. --- gcc/testsuite/g++.target/i386/pr80566-1.C | 15 +++ gcc/testsuite/g++.target/i386/pr80566-2.C | 14 ++ 2 files changed, 29 insertions(+) create mod

Re: [PATCH 4/6] Support -fexcess-precision=16 which will enable FLT_EVAL_METHOD_PROMOTE_TO_FLOAT16 when backend supports _Float16.

2021-08-02 Thread Hongtao Liu via Gcc-patches

On Tue, Aug 3, 2021 at 3:34 AM Joseph Myers wrote: > > On Mon, 2 Aug 2021, liuhongt via Gcc-patches wrote: > > > diff --git a/gcc/config/i386/i386.c b/gcc/config/i386/i386.c > > index 7979e240426..dc673c89bc8 100644 > > --- a/gcc/config/i386/i386.c > > +++ b/gcc/config/i386/i386.c > > @@ -23352,6

Re: [PATCH] x86: Use XMM31 for scratch SSE register

2021-08-02 Thread Hongtao Liu via Gcc-patches

On Tue, Aug 3, 2021 at 1:48 AM H.J. Lu via Gcc-patches wrote: > > In 64-bit mode, use XMM31 for scratch SSE register to avoid vzeroupper > if possible. > > gcc/ > > * config/i386/i386.c (ix86_gen_scratch_sse_rtx): In 64-bit mode, > try XMM31 to avoid vzeroupper. LGTM. > > gcc/tests

Re: [PATCH] Support cond_add/sub/mul/div for vector float/double.

2021-08-02 Thread Hongtao Liu via Gcc-patches

On Mon, Aug 2, 2021 at 6:20 PM Richard Biener via Gcc-patches wrote: > > On Mon, Aug 2, 2021 at 6:33 AM liuhongt wrote: > > > > Hi: > > This patch supports cond_add/sub/mul/div expanders for vector > > float/double. > > There're still cond_fma/fms/fnms/fma/max/min/xor/ior/and left which I >

Re: [PATCH 42/55] rs6000: Handle gimple folding of target built-ins

2021-08-02 Thread Segher Boessenkool

On Mon, Aug 02, 2021 at 08:31:43AM -0500, Bill Schmidt wrote: > Interestingly, when the quadword compares are expanded at GIMPLE time, > we generate worse code involving individual 64-bit compares. For the > time being, I will not expand these at GIMPLE time; independently, this > bears looking

Re: [PATCH v3 2/2] rs6000: Add test for _mm_minpos_epu16

2021-08-02 Thread Segher Boessenkool

Hi! On Thu, Jul 15, 2021 at 06:29:18PM -0500, Paul A. Clarke wrote: > Copy the test for _mm_minpos_epu16 from > gcc/testsuite/gcc.target/i386/sse4_1-phminposuw.c, with > a few adjustments: > --- /dev/null > +++ b/gcc/testsuite/gcc.target/powerpc/sse4_1-phminposuw.c > @@ -0,0 +1,68 @@ > +/* { dg-d

Go patch committed: Allow converting from slice to pointer-to-array

2021-08-02 Thread Ian Lance Taylor via Gcc-patches

The upcoming Go 1.17 release has a new language feature: it permits conversions from slice types to pointer-to-array types. If the slice is too short, the conversion panics. This patch implements this new feature in gccgo. Bootstrapped and ran Go testsuite on x86_64-pc-linux-gnu. Committed to m

Re: [PATCH v3 1/2] rs6000: Add support for _mm_minpos_epu16

2021-08-02 Thread Segher Boessenkool

Hi! On Thu, Jul 15, 2021 at 06:29:17PM -0500, Paul A. Clarke wrote: > Add a naive implementation of the subject x86 intrinsic to > ease porting. > --- a/gcc/config/rs6000/smmintrin.h > +++ b/gcc/config/rs6000/smmintrin.h > @@ -172,4 +172,31 @@ _mm_test_mix_ones_zeros (__m128i __A, __m128i __mask)

Go patch committed: Support unsafe.Add and unsafe.Slice

2021-08-02 Thread Ian Lance Taylor via Gcc-patches

The upcoming Go 1.17 release adds two new functions to the unsafe package: unsafe.Add and unsafe.Slice. These functions must be implemented in the compiler. This patch implements them for gccgo. Bootstrapped and ran Go testsuite on x86_64-pc-linux-gnu. Committed to mainline. Ian 06d0437d4a5faca

Warn for reads from write-only arguments [PR101734]

2021-08-02 Thread Martin Sebor via Gcc-patches

The write_only mode to attribute access specifies that the pointer applies to is used to write to the referenced object but not read from it. A function that uses the pointer to read the referenced object might rely on the contents of uninitialized memory and so such attempts should be diagnose

Re: [PATCH] Objective-C: don't require redundant -fno-objc-sjlj-exceptions for the NeXT v2 ABI

2021-08-02 Thread Matt Jacobson via Gcc-patches

> On Aug 2, 2021, at 5:09 PM, Eric Gallager wrote: > > On Wed, Jul 28, 2021 at 11:36 PM Matt Jacobson via Gcc-patches > wrote: >> >> As is, an invocation of GCC with -fnext-runtime -fobjc-abi-version=2 crashes, >> unless target-specific code adds an implicit -fno-objc-sjlj-exceptions (which

Re: [PATCH] Objective-C: don't require redundant -fno-objc-sjlj-exceptions for the NeXT v2 ABI

2021-08-02 Thread Eric Gallager via Gcc-patches

On Wed, Jul 28, 2021 at 11:36 PM Matt Jacobson via Gcc-patches wrote: > > As is, an invocation of GCC with -fnext-runtime -fobjc-abi-version=2 crashes, > unless target-specific code adds an implicit -fno-objc-sjlj-exceptions (which > Darwin does). > > This patch makes the general case not crash. >

[PATCH, rs6000] Add store fusion support for Power10

2021-08-02 Thread Pat Haugen via Gcc-patches

Enable store fusion on Power10. Use the SCHED_REORDER hook to implement Power10 specific ready list reordering. As of now, pairing stores for store fusion is the only function being performed. Bootstrap/regtest on powerpc64le(Power10) with no new regressions. Ok for master? -Pat 2021-08-02 Pa

Re: [PATCH 4/6] Support -fexcess-precision=16 which will enable FLT_EVAL_METHOD_PROMOTE_TO_FLOAT16 when backend supports _Float16.

2021-08-02 Thread Joseph Myers

On Mon, 2 Aug 2021, liuhongt via Gcc-patches wrote: > diff --git a/gcc/config/i386/i386.c b/gcc/config/i386/i386.c > index 7979e240426..dc673c89bc8 100644 > --- a/gcc/config/i386/i386.c > +++ b/gcc/config/i386/i386.c > @@ -23352,6 +23352,8 @@ ix86_get_excess_precision (enum excess_precision_type

[PATCH] x86: Use XMM31 for scratch SSE register

2021-08-02 Thread H.J. Lu via Gcc-patches

In 64-bit mode, use XMM31 for scratch SSE register to avoid vzeroupper if possible. gcc/ * config/i386/i386.c (ix86_gen_scratch_sse_rtx): In 64-bit mode, try XMM31 to avoid vzeroupper. gcc/testsuite/ * gcc.target/i386/avx-vzeroupper-14.c: Pass -mno-avx512f to dis

Re: [PATCH 2/2] Ada: Remove debug line number for DECL_IGNORED_P functions

2021-08-02 Thread Bernd Edlinger

On 8/2/21 3:07 PM, Eric Botcazou wrote: >> It was pointed out in PR101598 to be inappropriate, that >> ignored Ada decls receive the source line number which was >> recorded in the function decl's DECL_SOURCE_LOCATION. >> Therefore set all front-end-generated Ada decls with >> DECL_IGNORED_P to UNK

Re: [PATCH v7 03/10] x86: Update piecewise move and store

2021-08-02 Thread Uros Bizjak via Gcc-patches

On Mon, Aug 2, 2021 at 4:57 PM H.J. Lu wrote: > > On Mon, Aug 2, 2021 at 4:20 AM Uros Bizjak wrote: > > > > On Fri, Jul 30, 2021 at 11:32 PM H.J. Lu wrote: > > > > > > We can use TImode/OImode/XImode integers for piecewise move and store. > > > > > > 1. Define MAX_MOVE_MAX to 64, which is the co

[committed] libstdc++: Fix filesystem::temp_directory_path [PR101709]

2021-08-02 Thread Jonathan Wakely via Gcc-patches

On 30/07/21 18:13 +0100, Jonathan Wakely wrote: This adds a configure check for the GNU extension secure_getenv and then uses it for looking up TMPDIR and similar variables. Signed-off-by: Jonathan Wakely libstdc++-v3/ChangeLog: PR libstdc++/65018 * configure.ac: Check for sec

[committed] libstc++: Add dg-error for additional error in C++11 mode

2021-08-02 Thread Jonathan Wakely via Gcc-patches

When the comparison with a nullptr_t is ill-formed, there is an additional error for C++11 mode due to the constexpr function body being invalid. Signed-off-by: Jonathan Wakely libstdc++-v3/ChangeLog: * testsuite/20_util/tuple/comparison_operators/overloaded2.cc: Add dg-error fo

[PUSHED] Remove --param=threader-iterative.

2021-08-02 Thread Aldy Hernandez via Gcc-patches

This was meant to be an internal construct, but I see folks are using it and submitting PRs against it. Let's just remove this to avoid further confusion. Tested on x86-64 Linux. Pushed to keep PRs down and Jeff happy :). gcc/ChangeLog: PR tree-optimization/101724 * params.opt:

[PATCH v7 03/10] x86: Update piecewise move and store

2021-08-02 Thread H.J. Lu via Gcc-patches

On Mon, Aug 2, 2021 at 4:20 AM Uros Bizjak wrote: > > On Fri, Jul 30, 2021 at 11:32 PM H.J. Lu wrote: > > > > We can use TImode/OImode/XImode integers for piecewise move and store. > > > > 1. Define MAX_MOVE_MAX to 64, which is the constant maximum number of > > bytes that a single instruction ca

[PATCH 3/3] arm: reorder assembler architecture directives [PR101723]

2021-08-02 Thread Richard Earnshaw via Gcc-patches

A change to the way gas interprets the .fpu directive in binutils-2.34 means that issuing .fpu will clear any features set by .arch_extension that apply to the floating point or simd units. This unfortunately causes problems for more recent versions of the architecture because we currently emit .

[PATCH 2/3] arm: Don't reconfigure globals in arm_configure_build_target

2021-08-02 Thread Richard Earnshaw via Gcc-patches

arm_configure_build_target is usually used to reconfigure the arm_active_target structure, which is then used to reconfigure a number of other global variables describing the current target. Occasionally, however, we need to use arm_configure_build_target to construct a temporary target structure

[PATCH 1/3] arm: ensure the arch_name is always set for the build target

2021-08-02 Thread Richard Earnshaw via Gcc-patches

This should never happen now if GCC is invoked by the driver, but in the unusual case of calling cc1 (or its ilk) directly from the command line the build target's arch_name string can remain NULL. This can complicate later processing meaning that we need to check for this case explicitly in some

[PATCH 0/3] arm: fix problems when targetting extended FPUs [PR101723]

2021-08-02 Thread Richard Earnshaw via Gcc-patches

This patch series addresses an issue that has come to light due to a change in the way GAS handles .fpu directives in the assembler. A fix to the assembler made in binutils 2.34 to clear out all features realated to the FPU when .fpu is emitted has started causing problems for GCC because of the o

[PATCH 2/2] Rewrite more vector loads to scalar loads

2021-08-02 Thread Richard Biener

This teaches forwprop to rewrite more vector loads that are only used in BIT_FIELD_REFs as scalar loads. This provides the remaining uplift to SPEC CPU 2017 510.parest_r on Zen 2 which has CPU gathers disabled. In particular vector load + vec_unpack + bit-field-ref is turned into (extending) scal

[PATCH 1/2] Add emulated gather capability to the vectorizer

2021-08-02 Thread Richard Biener

This adds a gather vectorization capability to the vectorizer without target support by decomposing the offset vector, doing sclar loads and then building a vector from the result. This is aimed mainly at cases where vectorizing the rest of the loop offsets the cost of vectorizing the gather. Not

Re: [PATCH 42/55] rs6000: Handle gimple folding of target built-ins

2021-08-02 Thread Bill Schmidt via Gcc-patches

Hi Will, On 7/29/21 7:42 AM, Bill Schmidt wrote: On 7/28/21 4:21 PM, will schmidt wrote: On Thu, 2021-06-17 at 10:19 -0500, Bill Schmidt via Gcc-patches wrote: +/* Vector compares; EQ, NE, GE, GT, LE. */ +case RS6000_BIF_VCMPEQUB: +case RS6000_BIF_VCMPEQUH: +case RS6000_BIF_V

Re: [PATCH 1/2] Fix debug info for ignored decls at start of assembly

2021-08-02 Thread Richard Biener

On Fri, 30 Jul 2021, Bernd Edlinger wrote: > > > On 7/29/21 9:23 AM, Richard Biener wrote: > > On Wed, 28 Jul 2021, Bernd Edlinger wrote: > > > >> On 7/28/21 2:51 PM, Richard Biener wrote: > >>> On Mon, 26 Jul 2021, Bernd Edlinger wrote: > >>> > Ignored functions decls that are compiled at

Re: [PATCH] libstdc++: Skip atomic instructions in _Sp_counted_base::_M_release when both counts are 1

2021-08-02 Thread Maged Michael via Gcc-patches

This is the right patch. The previous one is missing noexcept. Sorry. On Mon, Aug 2, 2021 at 9:23 AM Maged Michael wrote: > Please find attached an updated patch after incorporating Jonathan's > suggestions. > > Changes from the last patch include: > - Add a TSAN macro to bits/c++config. > - Us

Re: [PATCH] libstdc++: Skip atomic instructions in _Sp_counted_base::_M_release when both counts are 1

2021-08-02 Thread Maged Michael via Gcc-patches

Please find attached an updated patch after incorporating Jonathan's suggestions. Changes from the last patch include: - Add a TSAN macro to bits/c++config. - Use separate constexpr bool-s for the conditions for lock-freedom, double-width and alignment. - Move the code in the optimized path to a s

[PATCH, v2, libgomp, OpenMP 5.0] Implement omp_get_device_num

2021-08-02 Thread Chung-Lin Tang

On 2021/7/23 6:39 PM, Jakub Jelinek wrote: On Fri, Jul 23, 2021 at 06:21:41PM +0800, Chung-Lin Tang wrote: --- a/libgomp/icv-device.c +++ b/libgomp/icv-device.c @@ -61,8 +61,17 @@ omp_is_initial_device (void) return 1; } +int +omp_get_device_num (void) +{ + /* By specification, this i

Re: [PATCH, libgomp, OpenMP 5.0] Implement omp_get_device_num

2021-08-02 Thread Chung-Lin Tang

On 2021/7/23 7:01 PM, Tobias Burnus wrote: I personally prefer having: int initial_dev; and inside 'omp target' (with 'map(from:initial_dev)'): initial_device = omp_is_initial_device(); Then the check would be: if (initial_device && host_device_num != device_num) abort(); i

Re: [PATCH 2/2] Ada: Remove debug line number for DECL_IGNORED_P functions

2021-08-02 Thread Eric Botcazou

> It was pointed out in PR101598 to be inappropriate, that > ignored Ada decls receive the source line number which was > recorded in the function decl's DECL_SOURCE_LOCATION. > Therefore set all front-end-generated Ada decls with > DECL_IGNORED_P to UNKNOWN_LOCATION. > > 2021-07-24 Bernd Edlinge

Re: [PATCH] Add a simple fraction class

2021-08-02 Thread Richard Biener via Gcc-patches

On Mon, Aug 2, 2021 at 1:31 PM Richard Sandiford wrote: > > Richard Biener writes: > > On Mon, Aug 2, 2021 at 12:43 PM Richard Sandiford > > wrote: > >> > >> Richard Biener via Gcc-patches writes: > >> > On Fri, Jul 30, 2021 at 5:59 PM Richard Sandiford via Gcc-patches > >> > wrote: > >> >> >

Re: [PATCH] Add a simple fraction class

2021-08-02 Thread Richard Sandiford via Gcc-patches

Richard Biener writes: > On Mon, Aug 2, 2021 at 12:43 PM Richard Sandiford > wrote: >> >> Richard Biener via Gcc-patches writes: >> > On Fri, Jul 30, 2021 at 5:59 PM Richard Sandiford via Gcc-patches >> > wrote: >> >> >> >> This patch adds a simple class for holding A/B fractions. >> >> As the

Re: [PATCH v6 03/10] x86: Update piecewise move and store

2021-08-02 Thread Uros Bizjak via Gcc-patches

On Fri, Jul 30, 2021 at 11:32 PM H.J. Lu wrote: > > We can use TImode/OImode/XImode integers for piecewise move and store. > > 1. Define MAX_MOVE_MAX to 64, which is the constant maximum number of > bytes that a single instruction can move quickly between memory and > registers or between two memo

Re: [PATCH] Add a simple fraction class

2021-08-02 Thread Richard Biener via Gcc-patches

On Mon, Aug 2, 2021 at 12:43 PM Richard Sandiford wrote: > > Richard Biener via Gcc-patches writes: > > On Fri, Jul 30, 2021 at 5:59 PM Richard Sandiford via Gcc-patches > > wrote: > >> > >> This patch adds a simple class for holding A/B fractions. > >> As the comments in the patch say, the clas

Re: [PATCH] Add a simple fraction class

2021-08-02 Thread Richard Sandiford via Gcc-patches

Richard Biener via Gcc-patches writes: > On Fri, Jul 30, 2021 at 5:59 PM Richard Sandiford via Gcc-patches > wrote: >> >> This patch adds a simple class for holding A/B fractions. >> As the comments in the patch say, the class isn't designed >> to have nice numerial properties at the extremes. >>

Re: [PATCH] Support cond_add/sub/mul/div for vector float/double.

2021-08-02 Thread Richard Biener via Gcc-patches

On Mon, Aug 2, 2021 at 6:33 AM liuhongt wrote: > > Hi: > This patch supports cond_add/sub/mul/div expanders for vector float/double. > There're still cond_fma/fms/fnms/fma/max/min/xor/ior/and left which I > failed to figure out a testcase to validate them. > Also cond_add/sub/mul for vector i

Re: [PATCH] Fix ICE when mixing VLAs and statement expressions [PR91038]

2021-08-02 Thread Richard Biener via Gcc-patches

On Sun, Aug 1, 2021 at 7:37 PM Uecker, Martin wrote: > > > > Here is an attempt to fix some old and annoying bugs related > to VLAs and statement expressions. In particulary, this seems > to fix the issues with variably-modified types which are > returned from statement expressions (which works on

Re: [PATCH] Optimize x ? bswap(x) : 0 in tree-ssa-phiopt

2021-08-02 Thread Richard Biener via Gcc-patches

On Sat, Jul 31, 2021 at 9:56 AM Roger Sayle wrote: > > > Many thanks again to Jakub Jelinek for a speedy fix for PR 101642. > Interestingly, that test case "bswap16(x) ? : x" also reveals a > missed optimization opportunity. The resulting "x ? bswap(x) : 0" > can be further simplified to just bsw

Re: [PATCH] Fix PR 101683: FP exceptions for float->unsigned

2021-08-02 Thread Richard Biener via Gcc-patches

On Fri, Jul 30, 2021 at 9:22 PM apinski--- via Gcc-patches wrote: > > From: Andrew Pinski > > Just like the old bug PR9651, unsigned_fix rtl should > also be handled as a trapping instruction. > > OK? Bootstrapped and tested on x86_64-linux-gnu with no regressions. OK. > gcc/ChangeLog: > >

Re: [PATCH] gcov-profile/71672 Fix indirect call inlining with AutoFDO

2021-08-02 Thread Richard Biener via Gcc-patches

On Fri, Jul 30, 2021 at 9:09 AM Eugene Rozenfeld via Gcc-patches wrote: > > This patch has the following changes: > > 1. The main fix is in auto-profile.c: the histogram value for >indirect calls was incorrectly set up. That is fixed now. > > 2. Several tests now have -fdump-ipa-afdo-optimized

Re: [PATCH] Add a simple fraction class

2021-08-02 Thread Richard Biener via Gcc-patches

On Fri, Jul 30, 2021 at 5:59 PM Richard Sandiford via Gcc-patches wrote: > > This patch adds a simple class for holding A/B fractions. > As the comments in the patch say, the class isn't designed > to have nice numerial properties at the extremes. > > The motivating use case was some aarch64 costi

Re: [PATCH] Fix typos in move_sese_region_to_fn

2021-08-02 Thread Richard Biener via Gcc-patches

On Fri, Jul 30, 2021 at 1:15 PM Kewen.Lin wrote: > > Hi, > > This patch is to fix the typos in the move_sese_region_to_fn. > As mentioned here [1], I tried to debug the test case > gcc.dg/graphite/pr83359.c with trunk, but I found it didn't > go into the hunk guard with "if (moved_orig_loop_num)".

Re: [PATCH] Add emulated gather capability to the vectorizer

2021-08-02 Thread Kewen.Lin via Gcc-patches

on 2021/8/2 下午5:11, Richard Biener wrote: > On Mon, 2 Aug 2021, Kewen.Lin wrote: > >> on 2021/8/2 下午3:09, Richard Biener wrote: >>> On Mon, 2 Aug 2021, Kewen.Lin wrote: >>> on 2021/7/30 下午10:04, Kewen.Lin via Gcc-patches wrote: > Hi Richi, > > on 2021/7/30 下午7:34, Richard Biener w

Re: [PATCH] Add emulated gather capability to the vectorizer

2021-08-02 Thread Richard Biener

On Mon, 2 Aug 2021, Kewen.Lin wrote: > on 2021/8/2 下午3:09, Richard Biener wrote: > > On Mon, 2 Aug 2021, Kewen.Lin wrote: > > > >> on 2021/7/30 下午10:04, Kewen.Lin via Gcc-patches wrote: > >>> Hi Richi, > >>> > >>> on 2021/7/30 下午7:34, Richard Biener wrote: > This adds a gather vectorization

Re: [PATCH] Add emulated gather capability to the vectorizer

2021-08-02 Thread Kewen.Lin via Gcc-patches

on 2021/8/2 下午3:09, Richard Biener wrote: > On Mon, 2 Aug 2021, Kewen.Lin wrote: > >> on 2021/7/30 下午10:04, Kewen.Lin via Gcc-patches wrote: >>> Hi Richi, >>> >>> on 2021/7/30 下午7:34, Richard Biener wrote: This adds a gather vectorization capability to the vectorizer without target suppo

Re: [PATCH] Add emulated gather capability to the vectorizer

2021-08-02 Thread Richard Sandiford via Gcc-patches

Richard Biener writes: > On Mon, 2 Aug 2021, Richard Sandiford wrote: > >> Richard Biener writes: >> > On Fri, 30 Jul 2021, Richard Sandiford wrote: >> >> > @@ -9456,6 +9499,51 @@ vectorizable_load (vec_info *vinfo, >> >> > data_ref = NULL_TREE; >> >> >

Re: [PATCH] Add emulated gather capability to the vectorizer

2021-08-02 Thread Richard Biener

On Mon, 2 Aug 2021, Richard Sandiford wrote: > Richard Biener writes: > > On Fri, 30 Jul 2021, Richard Sandiford wrote: > >> > @@ -9456,6 +9499,51 @@ vectorizable_load (vec_info *vinfo, > >> > data_ref = NULL_TREE; > >> > break; > >> >

Re: [PATCH] Add emulated gather capability to the vectorizer

2021-08-02 Thread Richard Sandiford via Gcc-patches

Richard Biener writes: > On Fri, 30 Jul 2021, Richard Sandiford wrote: >> > @@ -9456,6 +9499,51 @@ vectorizable_load (vec_info *vinfo, >> >data_ref = NULL_TREE; >> >break; >> > } >> > + else if (memory_access_type == VMAT_GATHER

Re: [PATCH] Add emulated gather capability to the vectorizer

2021-08-02 Thread Richard Biener

On Fri, 30 Jul 2021, Richard Sandiford wrote: > Richard Biener writes: > > This adds a gather vectorization capability to the vectorizer > > without target support by decomposing the offset vector, doing > > sclar loads and then building a vector from the result. This > > is aimed mainly at case

Re: [PATCH] Add emulated gather capability to the vectorizer

2021-08-02 Thread Richard Biener

On Mon, 2 Aug 2021, Kewen.Lin wrote: > on 2021/7/30 下午10:04, Kewen.Lin via Gcc-patches wrote: > > Hi Richi, > > > > on 2021/7/30 下午7:34, Richard Biener wrote: > >> This adds a gather vectorization capability to the vectorizer > >> without target support by decomposing the offset vector, doing > >

56 matches

Mail list logo