On 4/18/23 08:12, Hans-Peter Nilsson wrote:
Date: Tue, 18 Apr 2023 07:43:41 -0600
From: Jeff Law <jeffreya...@gmail.com>

On 2/15/23 08:34, Hans-Peter Nilsson via Gcc-patches wrote:
Regtested cris-elf with its LEGITIMIZE_RELOAD_ADDRESS
disabled, where it regresses gcc.target/cris/rld-legit1.c;
as expected, because that test guards proper function of its
LEGITIMIZE_RELOAD_ADDRESS i.e., that there's no sign of
decomposed address elements.

LRA also causes a similar decomposition (and worse, in even
smaller bits), but it can create valid insns as-is.
Unfortunately, it doesn't have something equivalent to
LEGITIMIZE_RELOAD_ADDRESS so it generates worse code for
cases where that hook helped reload.

I fear reload-related patches these days are treated like a
redheaded stepchild and even worse as this one is intended
for stage 1.  Either way, I need to create a reference to
it, and it's properly tested and has been a help when
working towards LRA, thus might help other targets: ok to
install for the next stage 1?

-- >8 --
When LEGITIMIZE_RELOAD_ADDRESS for cris-elf is disabled,
this code is now required for reload to generate valid insns
from some reload-decomposed addresses, for example the
(plus:SI
   (sign_extend:SI (mem:HI (reg/v/f:SI 32 [ a ]) [1 *a_6(D)+0 S2 A8]))
   (reg/v/f:SI 33 [ y ]))
generated in gcc.target/cris/rld-legit1.c (a valid address
but with two registers needing reload).  Now after decc0:ing,
most SET insns for former cc0 targets need to be a parallel
with a clobber of the flags register.  Such targets
typically have TARGET_FLAGS_REGNUM set to a valid register.

        * reload1.cc (emit_insn_if_valid_for_reload_1): Rename from
        emit_insn_if_valid_for_reload.
        (emit_insn_if_valid_for_reload): Call new helper, and if a SET fails
        to be recognized, also try emitting a parallel that clobbers
        TARGET_FLAGS_REGNUM, as applicable.
BUt isn't it the case that we're not supposed to be exposing the flags
register until after reload?   And if that's the case, then why would
this be necessary?  Clearly I must be missing something.

That "supposed to" is only *one* possible implementation.
The one in CRIS - and I believe the preferred one; one I
should advocate more - is to *always* expose clobbering of
the flags.  (I managed to do the CRIS decc0ification
transformation without loss of performance.  There were much
fewer issues with code taking PATTERN (insn) and failing on
it being PARALLEL than I had expected, much thanks to use of
rtx_single_set.)

Think about it: why should the semantics of a valid insn
change after a "random" pass?  That's almost as crazy as the
implied semantics of cc0.
Ah, yes, thanks for the reminder that there's multiple approaches here. If I cared enough it'd probably make more sense at this point to expose cc0 early on the H8 as doing so would allow easier codegen for overflow tests which in turn could significantly speed up the testsuite.

OK for the trunk.

jeff

Reply via email to