Re: 2nd Ping: Re: [PATCH v3] doc: Document order of define_peephole2 scanning
On Thu, May 4, 2023 at 2:10 AM Hans-Peter Nilsson via Gcc-patches wrote: > > Ping again. OK. > > From: Hans-Peter Nilsson > > Date: Thu, 27 Apr 2023 01:55:24 +0200 > > > > > From: Hans-Peter Nilsson > > > Date: Wed, 19 Apr 2023 18:59:14 +0200 > > [...] > > > > > So again: Approvers: pdf output reviewed. Ok to commit? > > > -- >8 -- > > > I was a bit surprised when my newly-added define_peephole2 didn't > > > match, but it was because it was expected to partially match the > > > generated output of a previous define_peephole2, which matched and > > > modified the last insn of a sequence to be matched. I had assumed > > > that the algorithm backed-up the size of the match-buffer, thereby > > > exposing newly created opportunities *with sufficient context* to all > > > define_peephole2's. While things can change in that direction, let's > > > start with documenting the current state. > > > > > > * doc/md.texi (define_peephole2): Document order of scanning. > > > --- > > > gcc/doc/md.texi | 9 + > > > 1 file changed, 9 insertions(+) > > > > > > diff --git a/gcc/doc/md.texi b/gcc/doc/md.texi > > > index 07bf8bdebffb..300d104d58ab 100644 > > > --- a/gcc/doc/md.texi > > > +++ b/gcc/doc/md.texi > > > @@ -9362,6 +9362,15 @@ If the preparation falls through (invokes neither > > > @code{DONE} nor > > > @code{FAIL}), then the @code{define_peephole2} uses the replacement > > > template. > > > > > > +Insns are scanned in forward order from beginning to end for each basic > > > +block. Matches are attempted in order of @code{define_peephole2} > > > +appearance in the @file{md} file. After a successful replacement, > > > +scanning for further opportunities for @code{define_peephole2}, resumes > > > +with the first generated replacement insn as the first insn to be > > > +matched against all @code{define_peephole2}. For the example above, > > > +after its successful replacement, the first insn that can be matched by > > > +a @code{define_peephole2} is @code{(set (match_dup 4) (match_dup 1))}. > > > + > > > @end ifset > > > @ifset INTERNALS > > > @node Insn Attributes > > > -- > > > 2.30.2 > > > > >
2nd Ping: Re: [PATCH v3] doc: Document order of define_peephole2 scanning
Ping again. > From: Hans-Peter Nilsson > Date: Thu, 27 Apr 2023 01:55:24 +0200 > > > From: Hans-Peter Nilsson > > Date: Wed, 19 Apr 2023 18:59:14 +0200 > [...] > > > So again: Approvers: pdf output reviewed. Ok to commit? > > -- >8 -- > > I was a bit surprised when my newly-added define_peephole2 didn't > > match, but it was because it was expected to partially match the > > generated output of a previous define_peephole2, which matched and > > modified the last insn of a sequence to be matched. I had assumed > > that the algorithm backed-up the size of the match-buffer, thereby > > exposing newly created opportunities *with sufficient context* to all > > define_peephole2's. While things can change in that direction, let's > > start with documenting the current state. > > > > * doc/md.texi (define_peephole2): Document order of scanning. > > --- > > gcc/doc/md.texi | 9 + > > 1 file changed, 9 insertions(+) > > > > diff --git a/gcc/doc/md.texi b/gcc/doc/md.texi > > index 07bf8bdebffb..300d104d58ab 100644 > > --- a/gcc/doc/md.texi > > +++ b/gcc/doc/md.texi > > @@ -9362,6 +9362,15 @@ If the preparation falls through (invokes neither > > @code{DONE} nor > > @code{FAIL}), then the @code{define_peephole2} uses the replacement > > template. > > > > +Insns are scanned in forward order from beginning to end for each basic > > +block. Matches are attempted in order of @code{define_peephole2} > > +appearance in the @file{md} file. After a successful replacement, > > +scanning for further opportunities for @code{define_peephole2}, resumes > > +with the first generated replacement insn as the first insn to be > > +matched against all @code{define_peephole2}. For the example above, > > +after its successful replacement, the first insn that can be matched by > > +a @code{define_peephole2} is @code{(set (match_dup 4) (match_dup 1))}. > > + > > @end ifset > > @ifset INTERNALS > > @node Insn Attributes > > -- > > 2.30.2 > > >
Ping: Re: [PATCH v3] doc: Document order of define_peephole2 scanning
> From: Hans-Peter Nilsson > Date: Wed, 19 Apr 2023 18:59:14 +0200 [...] > So again: Approvers: pdf output reviewed. Ok to commit? > -- >8 -- > I was a bit surprised when my newly-added define_peephole2 didn't > match, but it was because it was expected to partially match the > generated output of a previous define_peephole2, which matched and > modified the last insn of a sequence to be matched. I had assumed > that the algorithm backed-up the size of the match-buffer, thereby > exposing newly created opportunities *with sufficient context* to all > define_peephole2's. While things can change in that direction, let's > start with documenting the current state. > > * doc/md.texi (define_peephole2): Document order of scanning. > --- > gcc/doc/md.texi | 9 + > 1 file changed, 9 insertions(+) > > diff --git a/gcc/doc/md.texi b/gcc/doc/md.texi > index 07bf8bdebffb..300d104d58ab 100644 > --- a/gcc/doc/md.texi > +++ b/gcc/doc/md.texi > @@ -9362,6 +9362,15 @@ If the preparation falls through (invokes neither > @code{DONE} nor > @code{FAIL}), then the @code{define_peephole2} uses the replacement > template. > > +Insns are scanned in forward order from beginning to end for each basic > +block. Matches are attempted in order of @code{define_peephole2} > +appearance in the @file{md} file. After a successful replacement, > +scanning for further opportunities for @code{define_peephole2}, resumes > +with the first generated replacement insn as the first insn to be > +matched against all @code{define_peephole2}. For the example above, > +after its successful replacement, the first insn that can be matched by > +a @code{define_peephole2} is @code{(set (match_dup 4) (match_dup 1))}. > + > @end ifset > @ifset INTERNALS > @node Insn Attributes > -- > 2.30.2 >
Re: [PATCH v3] doc: Document order of define_peephole2 scanning
> Date: Wed, 19 Apr 2023 22:16:36 +0200 > From: Bernhard Reutner-Fischer > On 19 April 2023 21:21:18 CEST, Bernhard Reutner-Fischer > wrote: > >Hi H-P! > > > >This begs the question iff now (i fear it's not), upon > >successful match(es), the whole peepholes get re-run > >again per BB (ATM?), exposing more opportunities? (Not sure IIUC the question, but:) no; as mentioned the peephole2 machinery doesn't back up to include the context of longer sequences when having done replacement for a shorter match. It resumes matching at the beginning of the shorter, matched and replaced, sequence. > >If not, would we want to retry, at least for > >-fexpensive-optimisations (sp?) or would all hell break > >loose? I don't think hell would break loose, but maybe slowdown and/or buggy define_peephole2s would be weeded out. Or something else entirely unexpected. :) > >Please also see below. > > > >On 19 April 2023 18:59:14 CEST, Hans-Peter Nilsson via Gcc-patches > > wrote: > >>Anyway, the missing-context problem I ran into remains: if > >>you have an insn sequence {foo bar} and a define_peephole2 > >>matching and replacing {bar} into {baz}, the resulting {foo > >>baz} *will not be matched* against a define_peephole2 > >>looking for {foo baz}. But, I'm not trying to document this > >>caveat specifically, though at least it'll now be implied by > >>the documentation. > >> > >>This could be fixed by always backing up MAX_INSNS_PER_PEEP2 > >>- 1 insns after a successful replacement. I'm somewhat > >>worries that this would also mean lots of futile re-match > >>attempts. Thoughts? > > > >Good point. Probably. Do you have stats? None whatsoever. I'm going to test with the "original" define_peephole2 for CRIS that was suffering, but with an "extra" define_peephole2 that also does the modification for the one that caused it do be missed and see if the original free one still fires off. (I see I cut down my foo bar baz examples too much to make sense to explain that.) > >If there is even a slight benefit, then some certainly > >would be willing to take that penalty for sure. I.e. iff > >it helps -Os or locality then such expensive > >optimisations certainly provide benefit for at least a > >few if not some, IMO. Right. Not sure I'll be doing it though, having other things, gcc-related and others, on my plate. If so, first I'd do a crude attempt at getting statistics for x86_64, by after a successful replacement, restarting at the beginning of a BB and checking whether any define_peephole2 fires off before reaching the first replaced insn. brgds, H-P
Re: [PATCH v3] doc: Document order of define_peephole2 scanning
[+list] On 19 April 2023 21:21:18 CEST, Bernhard Reutner-Fischer wrote: >Hi H-P! > >This begs the question iff now (i fear it's not), upon successful match(es), >the whole peepholes get re-run again per BB (ATM?), exposing more >opportunities? >If not, would we want to retry, at least for -fexpensive-optimisations (sp?) >or would all hell break loose? > >Please also see below. > >On 19 April 2023 18:59:14 CEST, Hans-Peter Nilsson via Gcc-patches > wrote: >>> From: Hans-Peter Nilsson >>> Date: Wed, 19 Apr 2023 06:06:27 +0200 >>> >>> Patch retracted, at least temporarily. My "understanding" >>> may be clouded by looking at an actual bug. Sigh. >> >>Mea culpa. I was looking at the result of one >>define_peephole2 and thinking it was due to another, and >>also tricked by incorrect code comments (patch posted, will >>commit). >> >>TL;DR: Matching indeed does resume with attempting to match >>the *first* define_peephole2 replacement insn. But the >>match-and-replacement order is largely undocumented. >> >>Anyway, the missing-context problem I ran into remains: if >>you have an insn sequence {foo bar} and a define_peephole2 >>matching and replacing {bar} into {baz}, the resulting {foo >>baz} *will not be matched* against a define_peephole2 >>looking for {foo baz}. But, I'm not trying to document this >>caveat specifically, though at least it'll now be implied by >>the documentation. >> >>This could be fixed by always backing up MAX_INSNS_PER_PEEP2 >>- 1 insns after a successful replacement. I'm somewhat >>worries that this would also mean lots of futile re-match >>attempts. Thoughts? > >Good point. Probably. Do you have stats? > >If there is even a slight benefit, then some certainly would be willing to >take that penalty for sure. I.e. iff it helps -Os or locality then such >expensive optimisations certainly provide benefit for at least a few if not >some, IMO. > >Just curious && cheers, > >> >>(I could also just restart at the BB start, but I see all >>this support for backing-up live info by single insns being >>used. Taking notes about a partial match for the first insn >>of a failed attempt, as the maximum need to back-up to, >>doesn't look like it'd fly, judging from the nonspecific >>looking (set dest src) patterns being the first in i386 >>define_peephole2's match sequences.) >
Re: [PATCH v3] doc: Document order of define_peephole2 scanning
> From: Hans-Peter Nilsson > Date: Wed, 19 Apr 2023 06:06:27 +0200 > > Patch retracted, at least temporarily. My "understanding" > may be clouded by looking at an actual bug. Sigh. Mea culpa. I was looking at the result of one define_peephole2 and thinking it was due to another, and also tricked by incorrect code comments (patch posted, will commit). TL;DR: Matching indeed does resume with attempting to match the *first* define_peephole2 replacement insn. But the match-and-replacement order is largely undocumented. Anyway, the missing-context problem I ran into remains: if you have an insn sequence {foo bar} and a define_peephole2 matching and replacing {bar} into {baz}, the resulting {foo baz} *will not be matched* against a define_peephole2 looking for {foo baz}. But, I'm not trying to document this caveat specifically, though at least it'll now be implied by the documentation. This could be fixed by always backing up MAX_INSNS_PER_PEEP2 - 1 insns after a successful replacement. I'm somewhat worries that this would also mean lots of futile re-match attempts. Thoughts? (I could also just restart at the BB start, but I see all this support for backing-up live info by single insns being used. Taking notes about a partial match for the first insn of a failed attempt, as the maximum need to back-up to, doesn't look like it'd fly, judging from the nonspecific looking (set dest src) patterns being the first in i386 define_peephole2's match sequences.) So again: Approvers: pdf output reviewed. Ok to commit? -- >8 -- I was a bit surprised when my newly-added define_peephole2 didn't match, but it was because it was expected to partially match the generated output of a previous define_peephole2, which matched and modified the last insn of a sequence to be matched. I had assumed that the algorithm backed-up the size of the match-buffer, thereby exposing newly created opportunities *with sufficient context* to all define_peephole2's. While things can change in that direction, let's start with documenting the current state. * doc/md.texi (define_peephole2): Document order of scanning. --- gcc/doc/md.texi | 9 + 1 file changed, 9 insertions(+) diff --git a/gcc/doc/md.texi b/gcc/doc/md.texi index 07bf8bdebffb..300d104d58ab 100644 --- a/gcc/doc/md.texi +++ b/gcc/doc/md.texi @@ -9362,6 +9362,15 @@ If the preparation falls through (invokes neither @code{DONE} nor @code{FAIL}), then the @code{define_peephole2} uses the replacement template. +Insns are scanned in forward order from beginning to end for each basic +block. Matches are attempted in order of @code{define_peephole2} +appearance in the @file{md} file. After a successful replacement, +scanning for further opportunities for @code{define_peephole2}, resumes +with the first generated replacement insn as the first insn to be +matched against all @code{define_peephole2}. For the example above, +after its successful replacement, the first insn that can be matched by +a @code{define_peephole2} is @code{(set (match_dup 4) (match_dup 1))}. + @end ifset @ifset INTERNALS @node Insn Attributes -- 2.30.2