Re: [PATCH v3] Implement new RTL optimizations pass: fold-mem-offsets.

Jeff Law via Gcc-patches Tue, 18 Jul 2023 21:31:41 -0700



On 7/18/23 17:42, Vineet Gupta wrote:

Hi Manolis,

On 7/18/23 11:01, Jeff Law via Gcc-patches wrote:
Vineet @ Rivos has indicated he stumbled across an ICE with the V3code. Hopefully he'll get a testcase for that extracted shortly.
Yeah, I was trying to build SPEC2017 with this patch and ran into ICEfor several of them with -Ofast build: The reduced test from 455.nab isattached here.
The issue happens with v2 as well, so not something introduced by v3.

There's ICE in cprop_hardreg which immediately follows f-m-o.
The protagonist is ins 93 which starts off in combine as a simple set ofa DF 0.
| sff.i.288r.combine:(insn 93 337 94 8 (set (reg/v:DF 236 [ e ])
| sff.i.288r.combine- (const_double:DF 0.0 [0x0.0p+0])) "sff.i":23:11190 {*movdf_hardfloat_rv64}
Subsequently reload transforms it into SP + offset
| sff.i.303r.reload:(insn 93 337 94 9 (set (mem/c:DF (plus:DI (reg/f:DI2 sp)
| sff.i.303r.reload- (const_int 8 [0x8])) [4 %sfp+-8 S8 A64])
| sff.i.303r.reload- (const_double:DF 0.0 [0x0.0p+0])) "sff.i":23:11 190{*movdf_hardfloat_rv64}
| sff.i.303r.reload- (expr_list:REG_EQUAL (const_double:DF 0.0 [0x0.0p+0])
It gets processed by f-m-o and lands in cprop_hardreg, where it triggersICE.
| (insn 93 337 523 11 (set (mem/c:DF (plus:DI (reg/f:DI 2 sp)
|                 (const_int 8 [0x8])) [4 %sfp+-8 S8 A64])
|         (const_double:DF 0.0 [0x0.0p+0])) "sff.i":23:11 -1
^^^
|      (expr_list:REG_EQUAL (const_double:DF 0.0 [0x0.0p+0])
|        (nil)))
| during RTL pass: cprop_hardreg

Here's my analysis:
f-m-o: do_check_validity() -> insn_invalid_p() tries to recog() amodified version of insn 93 (actually there is no change, so perhapssomething we can optimize later). The corresponding md patternmovdf_hardfloat_rv64 no longer matches since it expects REG_P foroperand0, while reload has converted it into SP + offset. f-m-o thendoes the right thing by invalidating INSN_CODE=-1 for a subsequentrecog() to work correctly.But it seems this -1 lingers into the next pass, and trips upcopyprop_hardreg_forward_1() -> extract_constrain_insn()
So I don't know what the right fix here should be.

This is a bug in the RISC-V backend. I actually fixed basically thesame bug in another backend that was exposed by the f-m-o code.

In a run with -fno-fold-mem-offsets, the same insn 93 is successfullygrok'ed by cprop_hardreg,
| (insn 93 337 522 11 (set (mem/c:DF (plus:DI (reg/f:DI 2 sp)
|                (const_int 8 [0x8])) [4 %sfp+-8 S8 A64])
| (const_double:DF 0.0 [0x0.0p+0])) "sff.i":23:11 190{*movdf_hardfloat_rv64}
^^^^^^^^^^^^^^^
|     (expr_list:REG_EQUAL (const_double:DF 0.0 [0x0.0p+0])
|        (nil)))
P.S. I wonder if it is a good idea in general to call recog() postreload since the insn could be changed sufficiently to no longer matchthe md patterns. Of course I don't know the answer.

If this ever causes a problem, it's a backend bug.  It's that simple.

Conceptually it should always be safe to set INSN_CODE to -1 for any insn.

Odds are for this specific case in the RV backend, we just need aconstraint to store 0.0 into a memory location. That can actually beimplemented as a store from x0 since 0.0 has the bit pattern 0x0. Thisis probably a good thing to expose anyway as an optimization and canmove forward independently of the f-m-o patch.

P.S.2 When debugging code, I noticed a minor annoyance in the patch withthe whole fold_mem_offsets_driver() switch-case indirection. It doesn'tseem to be serving any purpose, and we could simply call correspondingdo_* routines in execute () itself.

We were in the process of squashing some of this out of theimplementation. I hadn't looked at the V3 patch to see how muchprogress had been made on this yet.


Thanks for digging into this!

jeff

Re: [PATCH v3] Implement new RTL optimizations pass: fold-mem-offsets.

Reply via email to