date:20161115

Re: Optimise CONCAT handling in emit_group_load

2016-11-15 Thread Eric Botcazou

> 2016-11-15 Richard Sandiford > Alan Hayward > David Sherwood > > * expr.c (emit_group_load_1): Tighten check for whether an > access involves only one operand of a CONCAT. Use extract_bit_field > for constants if the bit range does span the whole ope

Re: [PATCH] fix PR68468

2016-11-15 Thread Jakub Jelinek

On Wed, Nov 16, 2016 at 07:31:59AM +0100, Waldemar Brodkorb wrote: > > On Wed, Nov 09, 2016 at 04:08:39PM +0100, Bernd Schmidt wrote: > > > On 11/05/2016 06:14 PM, Waldemar Brodkorb wrote: > > > >Hi, > > > > > > > >the following patch fixes PR68468. > > > >Patch is used for a while in Buildroot wit

Re: [PATCH] fix PR68468

2016-11-15 Thread Waldemar Brodkorb

Hi, Jakub Jelinek wrote, > On Wed, Nov 09, 2016 at 04:08:39PM +0100, Bernd Schmidt wrote: > > On 11/05/2016 06:14 PM, Waldemar Brodkorb wrote: > > >Hi, > > > > > >the following patch fixes PR68468. > > >Patch is used for a while in Buildroot without issues. > > > > > >2016-11-05 Waldemar Brodkorb

Re: Rework subreg_get_info

2016-11-15 Thread Joseph Myers

On Tue, 15 Nov 2016, Richard Sandiford wrote: > Richard Sandiford writes: > > This isn't intended to change the behaviour, just rewrite the > > existing logic in a different (and hopefully clearer) way. > > The new form -- particularly the part based on the "block" > > concept -- is easier to con

Re: C++ PATCH for C++17 selection statements with initializer

2016-11-15 Thread Marek Polacek

On Sat, Nov 05, 2016 at 10:03:37PM -0400, David Edelsohn wrote: > The patch adds testcase init-statement6.C, which includes the declaration > > extern void publish (int), raise (int); > > POSIX defines > > int raise (int); > > in which gets included by the C++ headers for the testcase on AIX.

Re: [patch] remove more GCJ references

2016-11-15 Thread Matthias Klose

On 15.11.2016 23:03, Eric Gallager wrote: > On 11/15/16, Matthias Klose wrote: >> On 15.11.2016 21:41, Matthias Klose wrote: >>> On 15.11.2016 16:52, Jeff Law wrote: On 11/15/2016 03:55 AM, Matthias Klose wrote: > This patch removes some references to gcj in the top level and config >

[PATCH, IRA] PR78325, R_MIPS_JALR failures

2016-11-15 Thread Alan Modra

This is a fix for my PR70890 patch, which incorrectly removed all REG_EQUIV notes rather than just one regarding a reg that dies. Bootstrapped and regression tested powerpc64le-linux, x86_64-linux, and mips-linux. OK to apply? PR rtl-optimization/78325 PR rtl-optimization/70890

Re: [patch] remove more GCJ references

2016-11-15 Thread Eric Gallager

On 11/15/16, Matthias Klose wrote: > On 15.11.2016 21:41, Matthias Klose wrote: >> On 15.11.2016 16:52, Jeff Law wrote: >>> On 11/15/2016 03:55 AM, Matthias Klose wrote: This patch removes some references to gcj in the top level and config directories and in the gcc documentation. The c

Re: [PATCH] Significantly reduce memory usage of genattrtab

2016-11-15 Thread Richard Sandiford

Bernd Edlinger writes: > On 11/15/16 13:21, Richard Sandiford wrote: >> Bernd Edlinger writes: >>> Hi! >>> >>> The genattrtab build-tool uses way too much memory in general. >>> I think there is no other build step that uses more memory. >>> >>> On the currently trunk it takes around 700MB to bui

[PATCH] Follow-up patch on enabling new AVX512 instructions

2016-11-15 Thread Andrew Senkevich

Hi, this is follow-up with tests for new __target__ attributes and __builtin_cpu_supports update. gcc/ * config/i386/i386.c (processor_features): Add F_AVX5124VNNIW, F_AVX5124FMAPS. (isa_names_table): Handle new features. libgcc/ * config/i386/cpuinfo.c (processor_

Re: Use df_read_modify_subreg_p in cprop.c

2016-11-15 Thread Jeff Law

On 11/15/2016 09:27 AM, Richard Sandiford wrote: local_cprop_find_used_regs punted on all multiword registers, with the comment: /* Setting a subreg of a register larger than word_mode leaves the non-written words unchanged. */ But this only applies if the outer mode is smaller

Re: Add a mem_alias_size helper class

2016-11-15 Thread Richard Sandiford

Eric Botcazou writes: >> alias.c encodes memory sizes as follows: >> >> size > 0: the exact size is known >> size == 0: the size isn't known >> size < 0: the exact size of the reference itself is known, >> but the address has been aligned via AND. In this case >> "-size" includes the size of

Re: [PATCH] Significantly reduce memory usage of genattrtab

2016-11-15 Thread Bernd Edlinger

On 11/15/16 13:21, Richard Sandiford wrote: > Bernd Edlinger writes: >> Hi! >> >> The genattrtab build-tool uses way too much memory in general. >> I think there is no other build step that uses more memory. >> >> On the currently trunk it takes around 700MB to build the >> ARM latency tab files.

Re: [PATCH, Fortran, pr78356, v1] [7 Regression] [OOP] segfault allocating polymorphic variable with polymorphic component with allocatable component

2016-11-15 Thread Janus Weil

Hi Andre, > attached patch fixes the issue raised. The issue here was, that a copy of the > base class was generated and its address passed to the _vptr->copy()-method, > which then accessed memory, that was not present in the copy being an object > of > the base class. The patch fixes this by ma

[PATCH] spellcheck bugfixes: don't offer the goal string as a suggestion

2016-11-15 Thread David Malcolm

This patch addresses various bugs in the spellcheck code in which the goal string somehow makes it into the candidate list. The goal string will always have an edit distance of 0 to itself, and thus is the "closest" string to the goal, but offering it as a suggestion will always be nonsensical e.g.

Re: [PATCH][PPC] Fix ICE using power9 with soft-float

2016-11-15 Thread Andrew Stubbs

On 15/11/16 21:06, Michael Meissner wrote: Now, that I have a little time, I can look into this, to at least make predicate and peepholes match. There is some other stuff (support for the new load/store that were added to the compiler after that we should also tackle). I've been investigating

Re: [PATCH 9/9] Add "__RTL" to cc1 (v4)

2016-11-15 Thread David Malcolm

On Mon, 2016-11-14 at 16:14 +0100, Richard Biener wrote: > On Fri, Nov 11, 2016 at 10:15 PM, David Malcolm > wrote: > > Changed in this version: > > > > * Rather than running just one pass, run *all* passes, but start at > > the given pass; support for "dg-do run" tests that execute the > > r

Re: [PATCH][PPC] Fix ICE using power9 with soft-float

2016-11-15 Thread Michael Meissner

On Mon, Nov 14, 2016 at 04:57:58PM +, Andrew Stubbs wrote: > The testcase powerpc/fusion3.c causes an ICE when compiled with > -msoft-float. > > The key line in the testcase looks fairly harmless: > >void fusion_float_write (float *p, float f){ p[LARGE] = f; } LARGE is large enough that

Re: [PATCH 5/9] Add patterns and predicates foutline-msabi-xlouges

2016-11-15 Thread Daniel Santos

On 11/15/2016 02:06 PM, Daniel Santos wrote: +;; Save multiple registers out-of-line after realignment +(define_insn "save_multiple_realign" + [(match_parallel 0 "save_multiple" +[(use (match_operand:P 1 "symbol_operand")) + (set (reg:P SP_REG) (plus:P (reg:P AX_REG) + (match_ope

Re: [patch] remove more GCJ references

2016-11-15 Thread Matthias Klose

On 15.11.2016 21:41, Matthias Klose wrote: > On 15.11.2016 16:52, Jeff Law wrote: >> On 11/15/2016 03:55 AM, Matthias Klose wrote: >>> This patch removes some references to gcj in the top level and config >>> directories and in the gcc documentation. The change to the config >>> directory >>> req

Re: Fix vec_cmp comparison mode

2016-11-15 Thread Jeff Law

On 11/15/2016 09:49 AM, Richard Sandiford wrote: vec_cmps assign the result of a vector comparison to a mask. The optab was called with the destination having mode mask_mode but with the source (the comparison) having mode VOIDmode, which led to invalid rtl if the source operand was used directly

Re: [patch] remove more GCJ references

2016-11-15 Thread Matthias Klose

On 15.11.2016 16:52, Jeff Law wrote: > On 11/15/2016 03:55 AM, Matthias Klose wrote: >> This patch removes some references to gcj in the top level and config >> directories and in the gcc documentation. The change to the config directory >> requires regenerating aclocal.m4 and configure in each su

[Patch, Fortran] PR 66227: [5/6/7 Regression] [OOP] EXTENDS_TYPE_OF n returns wrong result for polymorphic variable allocated to extended type

2016-11-15 Thread Janus Weil

Hi all, the attached patch fixes a wrong-code problem with the intrinsic function EXTENDS_TYPE_OF. The simplification function which tries to reduce calls to EXTENDS_TYPE_OF to a compile-time constant (if possible) was a bit over-zealous and simplified cases that were actually not decidable at com

Re: [PATCH,rs6000] Add built-in function support for Power9 byte instructions

2016-11-15 Thread Segher Boessenkool

On Tue, Nov 15, 2016 at 12:16:19PM -0700, Kelvin Nilsen wrote: > The reason I am using SI mode is so that I don't have to disqualify the > use of these functions on a 32-bit big-endian configuration. > > Do you want me to switch to DI mode for all the operands? SI is fine, and can give slightly b

[PATCH] PR 59406 note that FNV hash functions are incorrect

2016-11-15 Thread Jonathan Wakely

The PR points out that our FNV hash functions don't correctly implement the FNV-1a function. Since the code is only kept for backwards compatibility we probably don't want to change the results, so this just adds comments to point out the issue. PR libstdc++/59406 * include/bits/f

[PATCH] Fix PR77848

2016-11-15 Thread Bill Schmidt

Hi, https://gcc.gnu.org/bugzilla/show_bug.cgi?id=77848 identifies a situation where if-conversion causes degradation when the if-converted loop is not subsequently vectorized. The if-conversion pass does not have a cost model to avoid such degradations. However, it does have a capability to vers

[PATCH 9/9] Add remainder of foutline-msabi-xlogues implementation

2016-11-15 Thread Daniel Santos

Adds functions emit_msabi_outlined_save and emit_msabi_outlined_restore, which are called from ix86_expand_prologue and ix86_expand_epilogue, respectively. --- gcc/config/i386/i386.c | 307 ++--- 1 file changed, 288 insertions(+), 19 deletions(-) diff -

[PATCH 6/9] Adds class xlouge_layout to i386.c

2016-11-15 Thread Daniel Santos

This C++ class adds the basic support for foutline-msabi-xlogues by manging the layout (where registers are stored based upon and other facets of the optimization) and providing the proper symbol rtx for the required stub. xlouge_layout should not be used until a call to ix86_compute_frame_layout

[PATCH 8/9] Modify ix86_compute_frame_layout for foutline-msabi-xlogues

2016-11-15 Thread Daniel Santos

ix86_compute_frame_layout will now populate fields added to structs machine_function and ix86_frame and modify the frame layout specific to facilitate the use of save & restore stubs. --- gcc/config/i386/i386.c | 117 - 1 file changed, 116 insertions

[PATCH 5/9] Add patterns and predicates foutline-msabi-xlouges

2016-11-15 Thread Daniel Santos

Adds the predicates save_multiple and restore_multiple to predicates.md, which are used by following patterns in sse.md: * save_multiple - insn that calls a save stub * save_multiple_realign - insn that calls a save stub and also manages a realign and hard frame pointer * restore_multiple - call

[PATCH 2/9] Minor refactor in ix86_compute_frame_layout

2016-11-15 Thread Daniel Santos

This refactor is separated from a future patch that actually alters ix86_compute_frame_layout. --- gcc/config/i386/i386.c | 13 +++-- 1 file changed, 7 insertions(+), 6 deletions(-) diff --git a/gcc/config/i386/i386.c b/gcc/config/i386/i386.c index 56cc67d..5ed8fb6 100644 --- a/gcc/config

[PATCH 7/9] Modify ix86_save_reg to optionally omit stub-managed registers

2016-11-15 Thread Daniel Santos

Adds static HARD_REG_SET stub_managed_regs to track registers that will be managed by the pro/epilogue stubs for the function. Adds a third parameter bool ignore_outlined to ix86_save_reg to specify rather or not the count should include registers marked in stub_managed_regs. --- gcc/config/i386/

[PATCH 3/9] Add msabi pro/epilogue stubs to libgcc

2016-11-15 Thread Daniel Santos

Adds libgcc/config/i386/i386-asm.h to manage common cpp and gas macros. stubs use the following naming convention: (sav|res)ms64[f][x] save|resSave or restore ms64Avoid possible name collisions with future stubs (specific to 64-bit msabi --> sysv scenario)

[PATCH 1/9] Change type of x86_64_ms_sysv_extra_clobbered_registers

2016-11-15 Thread Daniel Santos

This will need to be unsigned for a subsequent patch. Also adds the constant NUM_X86_64_MS_CLOBBERED_REGS for brievity. --- gcc/config/i386/i386.c | 8 +++- gcc/config/i386/i386.h | 4 +++- 2 files changed, 6 insertions(+), 6 deletions(-) diff --git a/gcc/config/i386/i386.c b/gcc/config/i386/

[PATCH 4/9] Add struct fields and option for foutline-msabi-xlouges

2016-11-15 Thread Daniel Santos

Adds fountline-msabi-xlogues to common.opt and various fields to structs machine_function and ix86_frame --- gcc/common.opt | 7 +++ gcc/config/i386/i386.c | 35 ++- gcc/config/i386/i386.h | 18 ++ 3 files changed, 55 insertions(+), 5 de

[PATCH 0/9] RFC: Add optimization -foutline-msabi-xlougues (for Wine 64)

2016-11-15 Thread Daniel Santos

Due to differences between the 64-bit Microsoft and System V ABIs, any msabi function that calls a sysv function must consider RSI, RDI and XMM6-15 as clobbered. The result is that such functions are bloated with SSE saves/restores costing as much as 106 bytes each (up to 200-ish bytes per func

Re: [patch,libgfortran] PR51119 - MATMUL slow for large matrices

2016-11-15 Thread Janne Blomqvist

On Tue, Nov 15, 2016 at 6:37 PM, Jerry DeLisle wrote: > All comments incorporated. Standing by for approval. Looks good, nice job! Ok for trunk. I was thinking that for strided arrays, it probably is faster to copy them to dense arrays before doing the matrix multiplication. That would also enab

Re: [Patch] Remove variant, variant and variant<>

2016-11-15 Thread Tim Shen

On Tue, Nov 15, 2016 at 11:31 AM, Jonathan Wakely wrote: > On 15/11/16 12:08 +, Jonathan Wakely wrote: >> >> On 12/11/16 12:11 -0800, Tim Shen wrote: >>> >>> At Issaquah we decided to remove the supports above. >> >> >> OK with a suitable ChangeLog, thanks. > > > I've adjusted your ChangeLog e

Re: Add a mem_alias_size helper class

2016-11-15 Thread Eric Botcazou

> alias.c encodes memory sizes as follows: > > size > 0: the exact size is known > size == 0: the size isn't known > size < 0: the exact size of the reference itself is known, > but the address has been aligned via AND. In this case > "-size" includes the size of the reference and the worst-c

[PATCH, i386 testsuite]: Move common 32-bit and 64-bit function specific options to an include file

2016-11-15 Thread Uros Bizjak

Hello! Just noticed that we don't test many function specific options on 64-bit targets. 2016-11-15 Uros Bizjak * gcc.target/i386/funcspec-56.inc: New file. * gcc.target/i386.funcspec-5.c: Include funcspec-56.inc. Remove common 32-bit and 64-bit function specific options. * g

[PATCH] Make std::tuple_size SFINAE-friendly (LWG 2770)

2016-11-15 Thread Jonathan Wakely

This is needed to avoid problems with the new structured bindings feature that landed in trunk yesterday. As part of this patch I'm removing the docs for the DR 2742 and 2748 changes that I added earlier today. The manual doesn't need to track changes against new features that only appear in trun

Re: [PATCH] Add std::string constructor for substring of string_view (LWG 2742)

2016-11-15 Thread Jonathan Wakely

On 15/11/16 14:33 +, Jonathan Wakely wrote: This is another issue resolution for C++17 features that was approved at the recent meeting. I think this resolution is wrong too, but in this case the fix is obvious so I've gone ahead and done it. * doc/xml/manual/intro.xml: Document LWG

Re: [Patch] Remove variant, variant and variant<>

2016-11-15 Thread Jonathan Wakely

On 15/11/16 12:08 +, Jonathan Wakely wrote: On 12/11/16 12:11 -0800, Tim Shen wrote: At Issaquah we decided to remove the supports above. OK with a suitable ChangeLog, thanks. I've adjusted your ChangeLog entry to fit under 80 columns with TAB set to 8 spaces. I've also adjusted the pret

Re: [1/9][RFC][DWARF] Reserve three DW_OP numbers in vendor extension space

2016-11-15 Thread Richard Earnshaw (lists)

On 15/11/16 16:48, Jiong Wang wrote: > > > On 15/11/16 16:18, Jakub Jelinek wrote: >> On Tue, Nov 15, 2016 at 04:00:40PM +, Jiong Wang wrote: >Takes one signed LEB128 offset and retrieves 8-byte contents > from the address >calculated by CFA plus this offset, the contents

Re: [PATCH,rs6000] Add built-in function support for Power9 byte instructions

2016-11-15 Thread Kelvin Nilsen

> >> Thanks for catching this. I think I got endian confusion inside my head >> while I was writing the above. I will rewrite these comments, below also. > > Note the ISA calls the bits in 32-bit registers 32..63, so that 63 is > the rightmost bit in all registers. > True, but the ISA only u

Re: Add a load_extend_op wrapper

2016-11-15 Thread Jeff Law

On 11/15/2016 11:56 AM, Jeff Law wrote: On 11/15/2016 11:12 AM, Richard Sandiford wrote: Jeff Law writes: On 11/15/2016 05:42 AM, Richard Sandiford wrote: LOAD_EXTEND_OP only applies to scalar integer modes that are narrower than a word. However, callers weren't consistent about which of the

Re: Add a load_extend_op wrapper

2016-11-15 Thread Jeff Law

On 11/15/2016 11:12 AM, Richard Sandiford wrote: Jeff Law writes: On 11/15/2016 05:42 AM, Richard Sandiford wrote: LOAD_EXTEND_OP only applies to scalar integer modes that are narrower than a word. However, callers weren't consistent about which of these checks they made beforehand, and also

Re: [PATCH v2] aarch64: Add split-stack initial support

2016-11-15 Thread Wilco Dijkstra

On 07/11/2016 16:59, Adhemerval Zanella wrote: > On 14/10/2016 15:59, Wilco Dijkstra wrote: > There is no limit afaik on gold split stack allocation handling, > and I think one could be added for each backend (in the method > override require to implement it). > > In fact it is not really requir

[PATCH] rs6000: Separate shrink-wrapping for the FPRs

2016-11-15 Thread Segher Boessenkool

This implements separate shrink-wrapping for the save/restore of the floating point registers. It regression checks fine, but that does not test on many big floating point routines (and neither does the bootstrap itself). So I'm not proposing this for trunk just yet. I'll write a changelog if th

Re: Add a load_extend_op wrapper

2016-11-15 Thread Eric Botcazou

> 2016-11-15 Richard Sandiford > Alan Hayward > David Sherwood > > * rtl.h (load_extend_op): Declare. > * rtlanal.c (load_extend_op): New function. I'd make it an inline function. -- Eric Botcazou

Re: Fix handling of unknown sizes in rtx_addr_can_trap_p

2016-11-15 Thread Richard Sandiford

Jeff Law writes: > On 11/15/2016 09:21 AM, Richard Sandiford wrote: >> If the size passed in to rtx_addr_can_trap_p was zero, the frame >> handling would get the size from the mode instead. However, this >> too can be zero if the mode is BLKmode, i.e. if we have a BLKmode >> memory reference with

Re: [PATCH,rs6000] Add built-in function support for Power9 byte instructions

2016-11-15 Thread Segher Boessenkool

On Tue, Nov 15, 2016 at 11:05:07AM -0700, Kelvin Nilsen wrote: > >>* config/rs6000/altivec.md (UNSPEC_CMPRB): New unspec value. > >>(UNSPEC_CMPRB2): New unspec value. > > > > I wonder if you really need both? The number of arguments will tell > > which is which, anyway? > > I appreciate

Re: Fix instances of gen_rtx_REG (VOIDmode, ...)

2016-11-15 Thread Jeff Law

On 11/15/2016 09:52 AM, Richard Sandiford wrote: Several definitions of INCOMING_RETURN_ADDR_RTX used gen_rtx_REG (VOIDmode, ...), which with later patches would trip an assert. This patch converts them to use Pmode instead. Tested on aarch64-linux-gnu and x86_64-linux-gnu. OK to install? Tha

Re: Add a load_extend_op wrapper

2016-11-15 Thread Richard Sandiford

Jeff Law writes: > On 11/15/2016 05:42 AM, Richard Sandiford wrote: >> LOAD_EXTEND_OP only applies to scalar integer modes that are narrower >> than a word. However, callers weren't consistent about which of these >> checks they made beforehand, and also weren't consistent about whether >> "small

[hsa-branch] Replace all omp references of GPGPU with HSA grid

2016-11-15 Thread Martin Jambor

Hi, this is the last patch to the hsa branch before using it to create the merge-to-trunk patches. Basically, it replaces all references to "GPGPU gridification" in omp-low.c to "HSA gridification" as requested by Jakub at the Cauldron. Committed to the HSA branch, it is part of the posted patch

[hsa branch] Move hsa headers to plugin libgomp directory

2016-11-15 Thread Martin Jambor

Hi, this is one of the last two commits to the hsa branch I made while preparing the merge to trunk, it moves headers generated from HSA documentation to the plugin directory. Committed to the HSA branch, it is part of the posted patches merging it to trunk. Thanks, Martin 2016-11-12 Martin

Re: [PATCH,rs6000] Add built-in function support for Power9 byte instructions

2016-11-15 Thread Kelvin Nilsen

Thank you very much for the prompt and thorough review. There are a few points below where I'd like to seek further clarification. On 11/15/2016 04:19 AM, Segher Boessenkool wrote: > Hi! > > On Mon, Nov 14, 2016 at 04:43:35PM -0700, Kelvin Nilsen wrote: >> * config/rs6000/altivec.md (UNSPE

Re: Move misplaced assignment in num_sign_bit_copies1

2016-11-15 Thread Richard Sandiford

Eric Botcazou writes: >> 2016-11-15 Richard Sandiford >> Alan Hayward >> David Sherwood >> >> * rtlanal.c (num_sign_bit_copies1): Calculate bitwidth after >> handling VOIDmode. > > OK, thanks, but please change the comment too, I think "For a smaller mode"

Re: [patch] remove more GCJ references

2016-11-15 Thread Matthias Klose

On 15.11.2016 16:52, Jeff Law wrote: > On 11/15/2016 03:55 AM, Matthias Klose wrote: >> This patch removes some references to gcj in the top level and config >> directories and in the gcc documentation. The change to the config directory >> requires regenerating aclocal.m4 and configure in each su

Re: [C++ PATCH] SOme further g++.dg/cpp1z/decomp*.C tests

2016-11-15 Thread Jason Merrill

OK. On Tue, Nov 15, 2016 at 9:13 AM, Jakub Jelinek wrote: > Hi! > > This patch adds 3 new tests. Tested on x86_64-linux, ok for trunk? > > 2016-11-15 Jakub Jelinek > > * g++.dg/cpp1z/decomp13.C: New test. > * g++.dg/cpp1z/decomp14.C: New test. > * g++.dg/cpp1z/decomp15

Re: [PATCH] Add map clauses to libgomp test device-3.f90

2016-11-15 Thread Alexander Monakov

On Tue, 15 Nov 2016, Alexander Monakov wrote: > Yep, I do see new test execution failures with both Intel MIC and PTX > offloading > on device-1.f90, device-3.f90 and target2.f90. Here's an actually-tested > patch > for the first two (on target2.f90 there's a different problem). And here's a pa

Re: [PATCH/AARCH64] Have the verbose cost model output output be controllable

2016-11-15 Thread James Greenhalgh

On Tue, Nov 15, 2016 at 08:48:04AM -0800, Andrew Pinski wrote: > On Fri, Oct 7, 2016 at 1:01 AM, Kyrill Tkachov > wrote: > > Hi Andrew, > > > > > > On 24/09/16 06:46, Andrew Pinski wrote: > >> > >> Hi, > >>As reported in PR 61367, the aarch64 back-end is too verbose when it > >> is dealing wit

Re: [C++ PATCH] Add mangling for P0217R3 decompositions at namespace scope

2016-11-15 Thread Jason Merrill

OK. On Tue, Nov 15, 2016 at 9:12 AM, Jakub Jelinek wrote: > Hi! > > On the following testcase we ICE, because the underlying artificial decls > have NULL DECL_NAME (intentional), thus mangling is not able to figure out > what to do. This patch attempts to follow the > http://sourcerytools.com/pi

Re: [PATCH] Add map clauses to libgomp test device-3.f90

2016-11-15 Thread Jakub Jelinek

On Tue, Nov 15, 2016 at 07:52:56PM +0300, Alexander Monakov wrote: > On Mon, 14 Nov 2016, Alexander Monakov wrote: > > On Mon, 14 Nov 2016, Martin Jambor wrote: > > > > > Hi, > > > > > > yesterday I forgot to send out the following patch. The test > > > libgomp/testsuite/libgomp.fortran/examples

Re: [PATCH] libiberty: Fix some demangler crashes caused by reading past end of input.

2016-11-15 Thread Ian Lance Taylor

On Mon, Nov 14, 2016 at 1:19 AM, Mark Wielaard wrote: > In various situations the cplus_demangle () function could read past the > end of input causing crashes. Add checks in various places to not advance > the demangle string location and fail early when end of string is reached. > Add various ex

Re: [PATCH][PR libgfortran/78314] Fix ieee_support_halting

2016-11-15 Thread FX

> disabling/enabling makes this api a lot heavier > than before, but trapping cannot be decided at > compile-time, although the result may be cached, > i think this should not be a frequent operation. > > otoh rereading my patch i think i fail to restore > the original exception state correctly.

Re: [PATCH] Add map clauses to libgomp test device-3.f90

2016-11-15 Thread Alexander Monakov

On Mon, 14 Nov 2016, Alexander Monakov wrote: > On Mon, 14 Nov 2016, Martin Jambor wrote: > > > Hi, > > > > yesterday I forgot to send out the following patch. The test > > libgomp/testsuite/libgomp.fortran/examples-4/device-3.f90 was failing > > for me when I was testing the HSA branch merge bu

Fix instances of gen_rtx_REG (VOIDmode, ...)

2016-11-15 Thread Richard Sandiford

Several definitions of INCOMING_RETURN_ADDR_RTX used gen_rtx_REG (VOIDmode, ...), which with later patches would trip an assert. This patch converts them to use Pmode instead. Tested on aarch64-linux-gnu and x86_64-linux-gnu. OK to install? Thanks, Richard [ This patch is part of the SVE seri

Re: [PATCH][PPC] Fix ICE using power9 with soft-float

2016-11-15 Thread Andrew Stubbs

On 15/11/16 12:29, Segher Boessenkool wrote: The peepholes do not support it, or maybe the define_insns do not either. The machine of course will not care. Oh, OK, so probably the bug is not in the peephole at all, but in the define_insn, or lack thereof. More investigation required. Thanks

Re: [1/9][RFC][DWARF] Reserve three DW_OP numbers in vendor extension space

2016-11-15 Thread Jiong Wang

On 15/11/16 16:18, Jakub Jelinek wrote: On Tue, Nov 15, 2016 at 04:00:40PM +, Jiong Wang wrote: Takes one signed LEB128 offset and retrieves 8-byte contents from the address calculated by CFA plus this offset, the contents then authenticated as per A key for instruction pointer usin

Fix vec_cmp comparison mode

2016-11-15 Thread Richard Sandiford

vec_cmps assign the result of a vector comparison to a mask. The optab was called with the destination having mode mask_mode but with the source (the comparison) having mode VOIDmode, which led to invalid rtl if the source operand was used directly. Tested on aarch64-linux-gnu and x86_64-linux-gnu

Re: [1/9][RFC][DWARF] Reserve three DW_OP numbers in vendor extension space

2016-11-15 Thread Jiong Wang

On 15/11/16 16:18, Jakub Jelinek wrote: On Tue, Nov 15, 2016 at 04:00:40PM +, Jiong Wang wrote: Takes one signed LEB128 offset and retrieves 8-byte contents from the address calculated by CFA plus this offset, the contents then authenticated as per A key for instruction pointer us

Re: [PATCH/AARCH64] Have the verbose cost model output output be controllable

2016-11-15 Thread Andrew Pinski

On Fri, Oct 7, 2016 at 1:01 AM, Kyrill Tkachov wrote: > Hi Andrew, > > > On 24/09/16 06:46, Andrew Pinski wrote: >> >> Hi, >>As reported in PR 61367, the aarch64 back-end is too verbose when it >> is dealing with the cost model. I tend to agree, no other back-end is >> this verbose. So I dec

Re: Rework subreg_get_info

2016-11-15 Thread Richard Sandiford

Richard Sandiford writes: > This isn't intended to change the behaviour, just rewrite the > existing logic in a different (and hopefully clearer) way. > The new form -- particularly the part based on the "block" > concept -- is easier to convert to polynomial sizes. > > Tested on aarch64-linux-gnu

Re: [PATCH] Add sem_item::m_hash_set (PR ipa/78309)

2016-11-15 Thread Jan Hubicka

> Hi. > > As seen on ppc64le during compilation of Firefox with LTO, combining inchash > value > with a pointer, enum value and an integer, one can eventually get zero value. > Thus I decided to introduce a new flag that would distinguish between not set > hash value > and a valid and (possibly)

Re: [PATCH] Add sem_item::m_hash_set (PR ipa/78309)

2016-11-15 Thread Jeff Law

On 11/15/2016 09:43 AM, Martin Liška wrote: Hi. As seen on ppc64le during compilation of Firefox with LTO, combining inchash value with a pointer, enum value and an integer, one can eventually get zero value. Thus I decided to introduce a new flag that would distinguish between not set hash va

[PATCH] Add sem_item::m_hash_set (PR ipa/78309)

2016-11-15 Thread Martin Liška

Hi. As seen on ppc64le during compilation of Firefox with LTO, combining inchash value with a pointer, enum value and an integer, one can eventually get zero value. Thus I decided to introduce a new flag that would distinguish between not set hash value and a valid and (possibly) zero value. I'

Re: Fix handling of unknown sizes in rtx_addr_can_trap_p

2016-11-15 Thread Jeff Law

On 11/15/2016 09:21 AM, Richard Sandiford wrote: If the size passed in to rtx_addr_can_trap_p was zero, the frame handling would get the size from the mode instead. However, this too can be zero if the mode is BLKmode, i.e. if we have a BLKmode memory reference with no MEM_SIZE (which should be

Rework subreg_get_info

2016-11-15 Thread Richard Sandiford

This isn't intended to change the behaviour, just rewrite the existing logic in a different (and hopefully clearer) way. The new form -- particularly the part based on the "block" concept -- is easier to convert to polynomial sizes. Tested on aarch64-linux-gnu and x86_64-linux-gnu. OK to install?

Re: [PATCH][PR libgfortran/78314] Fix ieee_support_halting

2016-11-15 Thread Szabolcs Nagy

On 15/11/16 16:22, FX wrote: >> There seems to be a separate api for checking trapping support: >> ieee_support_halting, but it only checked if the exception status >> flags are available, so check trapping support too by enabling >> and disabling traps. > > Thanks for the patch. > > I am worried

Re: [patch,libgfortran] PR51119 - MATMUL slow for large matrices

2016-11-15 Thread Jerry DeLisle

On 11/15/2016 07:59 AM, Jerry DeLisle wrote: On 11/14/2016 11:22 PM, Thomas Koenig wrote: Hi Jerry, With these changes, OK for trunk? Just going over this with a fine comb... One thing just struck me: The loop variables should be index_type, so const index_type m = xcount, n = ycou

Re: [RFC][PATCH] Remove a bad use of SLOW_UNALIGNED_ACCESS

2016-11-15 Thread Jeff Law

On 11/01/2016 03:39 PM, Wilco Dijkstra wrote: Jeff Law wrote: I think you'll need to look at bz61320 before this could go in. I had a look, but there is nothing there that is related - eventually a latent alignment bug was fixed in IVOpt. Excellent. Thanks for digging into what really hap

Use df_read_modify_subreg_p in cprop.c

2016-11-15 Thread Richard Sandiford

local_cprop_find_used_regs punted on all multiword registers, with the comment: /* Setting a subreg of a register larger than word_mode leaves the non-written words unchanged. */ But this only applies if the outer mode is smaller than the inner mode. If they're the same size then

Add more subreg offset helpers

2016-11-15 Thread Richard Sandiford

Provide versions of subreg_lowpart_offset and subreg_highpart_offset that work on mode sizes rather than modes. Also provide a routine that converts an lsb position to a subreg offset. The intent (in combination with later patches) is to move the handling of the BYTES_BIG_ENDIAN != WORDS_BIG_ENDI

C++ PATCH for c++/78358 (decltype and decomposition)

2016-11-15 Thread Jason Merrill

OK, (hopefully) one more patch for decltype and C++17 decomposition declarations. I hadn't been thinking that "referenced type" meant to look through references in the tuple case, since other parts of [dcl.decomp] define "the referenced type" directly, but that does seem to be how it's used elsewh

Re: [PATCH] Enable Intel AVX512_4FMAPS and AVX512_4VNNIW instructions

2016-11-15 Thread Andrew Senkevich

2016-11-15 17:56 GMT+03:00 Jeff Law : > On 11/15/2016 05:55 AM, Andrew Senkevich wrote: >> >> 2016-11-11 14:16 GMT+03:00 Uros Bizjak : >>> >>> --- a/gcc/genmodes.c >>> +++ b/gcc/genmodes.c >>> --- a/gcc/init-regs.c >>> +++ b/gcc/init-regs.c >>> --- a/gcc/machmode.h >>> +++ b/gcc/machmode.h >>> >>>

[PATCH, Fortran, pr78356, v1] [7 Regression] [OOP] segfault allocating polymorphic variable with polymorphic component with allocatable component

2016-11-15 Thread Andre Vehreschild

Hi all, attached patch fixes the issue raised. The issue here was, that a copy of the base class was generated and its address passed to the _vptr->copy()-method, which then accessed memory, that was not present in the copy being an object of the base class. The patch fixes this by making sure the

Optimise CONCAT handling in emit_group_load

2016-11-15 Thread Richard Sandiford

The CONCAT handling in emit_group_load chooses between doing an extraction from a single component or forcing the whole thing to memory and extracting from there. The condition for the former (more efficient) option was: if ((bytepos == 0 && bytelen == slen0) || (bytepos !

RE: [PATCH] MIPS/GCC: Mark text contents as code or data

2016-11-15 Thread Matthew Fortune

Maciej Rozycki writes: > gcc/ > * config/mips/mips-protos.h (mips_set_text_contents_type): New > prototype. > * config/mips/mips.h (ASM_OUTPUT_BEFORE_CASE_LABEL): New macro. > (ASM_OUTPUT_CASE_END): Likewise. > * config/mips/mips.c (mips_set_text_contents_type):

Re: [PATCH][PR libgfortran/78314] Fix ieee_support_halting

2016-11-15 Thread FX

Hi, > There seems to be a separate api for checking trapping support: > ieee_support_halting, but it only checked if the exception status > flags are available, so check trapping support too by enabling > and disabling traps. Thanks for the patch. I am worried about the unnecessary operations th

Re: Use MEM_SIZE rather than GET_MODE_SIZE in dce.c

2016-11-15 Thread Jeff Law

On 11/15/2016 09:17 AM, Richard Sandiford wrote: Using MEM_SIZE is more general, since it copes with cases where targets are forced to use BLKmode references for whatever reason. Thanks, Richard [ This patch is part of the SVE series posted here: https://gcc.gnu.org/ml/gcc/2016-11/msg00030.h

Fix handling of unknown sizes in rtx_addr_can_trap_p

2016-11-15 Thread Richard Sandiford

If the size passed in to rtx_addr_can_trap_p was zero, the frame handling would get the size from the mode instead. However, this too can be zero if the mode is BLKmode, i.e. if we have a BLKmode memory reference with no MEM_SIZE (which should be rare these days). This meant that the conditions fo

Re: Some backward threader refactoring

2016-11-15 Thread Jeff Law

On 11/14/2016 02:39 AM, Jeff Law wrote: I was looking at the possibility of dropping threading from VRP1/VRP2 or DOM1/DOM2 in favor of the backwards threader -- the obvious idea being to recover some compile-time for gcc-7. Of the old-style threader passes (VRP1, VRP2, DOM1, DOM2), VRP2 is by

Re: [1/9][RFC][DWARF] Reserve three DW_OP numbers in vendor extension space

2016-11-15 Thread Jakub Jelinek

On Tue, Nov 15, 2016 at 04:00:40PM +, Jiong Wang wrote: > >> Takes one signed LEB128 offset and retrieves 8-byte contents from the > >> address > >> calculated by CFA plus this offset, the contents then authenticated as > >> per A > >> key for instruction pointer using current CFA as sa

Use MEM_SIZE rather than GET_MODE_SIZE in dce.c

2016-11-15 Thread Richard Sandiford

Using MEM_SIZE is more general, since it copes with cases where targets are forced to use BLKmode references for whatever reason. Thanks, Richard [ This patch is part of the SVE series posted here: https://gcc.gnu.org/ml/gcc/2016-11/msg00030.html ] gcc/ 2016-11-15 Richard Sandiford

Tweak LRA handling of shared spill slots

2016-11-15 Thread Richard Sandiford

The previous code processed the users of a stack slot in order of decreasing size and allocated the slot based on the first user. This seems a bit dangerous, since the ordering is based on the mode of the biggest reference while the allocation is based also on the size of the register itself (which

Re: Use simplify_gen_binary in canon_rtx

2016-11-15 Thread Jeff Law

On 11/15/2016 09:07 AM, Richard Sandiford wrote: After simplifying the operands of a PLUS, canon_rtx checked only for cases in which one of the simplified operands was a constant, falling back to gen_rtx_PLUS otherwise. This left the PLUS in a non-canonical order if one of the simplified operand

[PATCH][PR libgfortran/78314] Fix ieee_support_halting

2016-11-15 Thread Szabolcs Nagy

When fpu trapping is enabled in libgfortran, the return value of feenableexcept is not checked. Glibc reports there if the operation was unsuccessful which happens if the target has no trapping support. There seems to be a separate api for checking trapping support: ieee_support_halting, but it o

Use simplify_gen_binary in canon_rtx

2016-11-15 Thread Richard Sandiford

After simplifying the operands of a PLUS, canon_rtx checked only for cases in which one of the simplified operands was a constant, falling back to gen_rtx_PLUS otherwise. This left the PLUS in a non-canonical order if one of the simplified operands was (plus (reg R1) (const_int X)); we'd end up wi

Add a mem_alias_size helper class

2016-11-15 Thread Richard Sandiford

alias.c encodes memory sizes as follows: size > 0: the exact size is known size == 0: the size isn't known size < 0: the exact size of the reference itself is known, but the address has been aligned via AND. In this case "-size" includes the size of the reference and the worst-case number o

1 2 >

1 - 100 of 164 matches

Mail list logo