On 2/1/2022 7:45 AM, Robin Dapp wrote:
Hi,

this is a bugfix for aa8cfe785953a0e87d2472311e1260cd98c605c0 which
broke an or1k test case (PR104153) as well as SPARC bootstrap (PR104198).

cond_exec_get_condition () returns the jump condition directly and we
now it to the backend.  The or1k backend modified the condition in-place
but this modification is not reverted when the sequence in question is
discarded.  Therefore this patch copies the RTX instead of using it
directly.

The SPARC problem is due to the backend recreating the initial condition
when being passed a CC comparison.  This causes the sequence
to read from an already overwritten condition operand.  Generally, this
could also happen on other targets.  The workaround is to always first
emit to a temporary.  In a second run of noce_convert_multiple_1 we know
which sequences actually require the comparison and use no
temporaries if all sequences after the current one do not require it.


Before, I used reg_overlap_mentioned_p () to check the generated
instructions against the condition.  The problem with this is that
reg_overlap... only handles a set of rtx_codes while a backend can
theoretically emit everything in an expander.  Is reg_mentioned_p () the
"right thing" to do?  Maybe it is overly conservative but as soon as we
have more than let's say three insns, we are unlikely to succeed anyway.

Bootstrapped and reg-tested on s390x, Power 9, x86 and SPARC.

Regards
  Robin

--

         PR 104198
         PR 104153

gcc/ChangeLog:

         * ifcvt.cc (noce_convert_multiple_sets_1): Copy rtx instead of
using it
         directly.  Rework comparison handling and always perform a
second pass.

gcc/testsuite/ChangeLog:

         * gcc.dg/pr104198.c: New test.
Also note this fixed the or1k issues and it's been tested on a variety of the embedded targets.



ifcvt-pr.patch

commit 68489d5729b4879bf2df540753fc7ea8ba1565a5
Author: Robin Dapp <rd...@linux.ibm.com>
Date:   Mon Jan 24 10:28:05 2022 +0100

     ifcvt: Fix PR104153 and PR104198.
This is a bugfix for aa8cfe785953a0e87d2472311e1260cd98c605c0 which
     broke an or1k test case (PR104153) as well as SPARC bootstrap (PR104198).
cond_exec_get_condition () returns the jump condition directly and we now
     pass it to the backend.  The or1k backend modified the condition in-place
     (other backends do that as well) but this modification is not reverted
     when the sequence in question is discarded.  Therefore we copy the RTX
     instead of using it directly.
The SPARC problem is due to the SPARC backend recreating the initial
     condition when being passed a CC comparison.  This causes the sequence
     to read from an already overwritten condition operand.  Generally, this
     could also happen on other targets.  The workaround is to always first
     emit to a temporary.  In a second run of noce_convert_multiple_1 we know
     which sequences actually require the comparison and will use no
     temporaries if all sequences after the current one do not require it.

diff --git a/gcc/ifcvt.cc b/gcc/ifcvt.cc
index fe250d508e1..92c2b40a45a 100644
--- a/gcc/ifcvt.cc
+++ b/gcc/ifcvt.cc
@@ -3391,7 +3391,11 @@ noce_convert_multiple_sets_1 (struct noce_if_info 
*if_info,
    rtx cond = noce_get_condition (jump, &cond_earliest, false);
rtx cc_cmp = cond_exec_get_condition (jump);
+  if (cc_cmp)
+    cc_cmp = copy_rtx (cc_cmp);
    rtx rev_cc_cmp = cond_exec_get_condition (jump, /* get_reversed */ true);
+  if (rev_cc_cmp)
+    rev_cc_cmp = copy_rtx (rev_cc_cmp);
rtx_insn *insn;
    int count = 0;
@@ -3515,6 +3519,7 @@ noce_convert_multiple_sets_1 (struct noce_if_info 
*if_info,
        unsigned cost1 = 0, cost2 = 0;
        rtx_insn *seq, *seq1, *seq2;
        rtx temp_dest = NULL_RTX, temp_dest1 = NULL_RTX, temp_dest2 = NULL_RTX;
+      bool read_comparison = false;
seq1 = try_emit_cmove_seq (if_info, temp, cond,
                                 new_val, old_val, need_cmov,
@@ -3524,10 +3529,38 @@ noce_convert_multiple_sets_1 (struct noce_if_info 
*if_info,
         as well.  This allows the backend to emit a cmov directly without
         creating an additional compare for each.  If successful, costing
         is easier and this sequence is usually preferred.  */
-      seq2 = try_emit_cmove_seq (if_info, target, cond,
+      seq2 = try_emit_cmove_seq (if_info, temp, cond,
Do you need to adjust anything now that this is emitting into TEMP rather than TARGET?
jeff

Reply via email to