On 26.07.2016 14:51, Segher Boessenkool wrote:
On Tue, Jul 26, 2016 at 02:14:49PM +0200, Georg-Johann Lay wrote:
which returns const0_rtx because reg 18 is set in insn 43 to const0_rtx.
Total outcome is that the right shift of reg:DI 18 is transformed to a
no-op move and cancelled out in the remainder.

Why does num_sign_bit_copies return something bigger than 8?

Because it thinks the last value was const0_rtx (which is incorrect) and

Why is that incorrect?  See insn 43.  Is there another insn I missed?

Yes, the insn stream that matters is (prior to combine):

(insn 43 31 44 2 (set (reg:QI 18 r18)
        (const_int 0 [0])) bug-combin.c:29 56 {movqi_insn}
     (nil))
(insn 44 43 45 2 (set (reg:QI 19 r19 [+1 ])
        (const_int 0 [0])) bug-combin.c:29 56 {movqi_insn}
     (nil))
(insn 45 44 46 2 (set (reg:QI 20 r20 [+2 ])
        (const_int 0 [0])) bug-combin.c:29 56 {movqi_insn}
     (nil))
(insn 46 45 47 2 (set (reg:QI 21 r21 [+3 ])
        (const_int 0 [0])) bug-combin.c:29 56 {movqi_insn}
     (nil))
(insn 47 46 48 2 (set (reg:QI 22 r22 [+4 ])
        (const_int 0 [0])) bug-combin.c:29 56 {movqi_insn}
     (nil))
(insn 48 47 49 2 (set (reg:QI 23 r23 [+5 ])
        (reg:QI 65 [ _3+5 ])) bug-combin.c:29 56 {movqi_insn}
     (nil))
(insn 49 48 50 2 (set (reg:QI 24 r24 [+6 ])
        (reg:QI 66 [ _3+6 ])) bug-combin.c:29 56 {movqi_insn}
     (nil))
(insn 50 49 51 2 (set (reg:QI 25 r25 [+7 ])
        (reg:QI 67 [ _3+7 ])) bug-combin.c:29 56 {movqi_insn}
     (nil))
(insn 51 50 52 2 (set (reg:QI 16 r16)
        (const_int 40 [0x28])) bug-combin.c:29 56 {movqi_insn}
     (nil))
(insn 52 51 61 2 (set (reg:DI 18 r18)
        (ashiftrt:DI (reg:DI 18 r18)
            (reg:QI 16 r16))) bug-combin.c:29 1417 {ashrdi3_insn}
     (expr_list:REG_DEAD (reg:QI 16 r16)
        (nil)))

The SETs of insns 43...50 all overlap reg:DI 18. This is what expand does at it breaks up a DImode move into 8 QImode moves. The first 8 instructions comprise an &= 0xffffff0000000000.

It's corect that QI:18 is set to 0, but this does not apply to DI:18 of course.

The problem might be that get_last_value() is called for reg:DI 18 and
combine's internal bookkeeping is incorrect

struct reg_stat_type {
  /* Record last point of death of (hard or pseudo) register n.  */
  rtx_insn                      *last_death;

  /* Record last point of modification of (hard or pseudo) register n.  */
  rtx_insn                      *last_set;

.last_set is the set for reg:QI 18 but later insns also change hard reg:DI
18, for example the set of reg:QI 19.  This means that the information in
reg_stat_type does not tell the whole story for hard regs.

One fix could be to get the mode precision of the SET from the last_set
insn and only use the information if that mode is at least as wide as the
mode of the register for which get_last_value is called.  reg_stat_type has
.last_set_mode for this purpose (QImode in the test case).

Yes, last_set_mode, and there is last_set_sign_bit_copies too.  Are those
correct?

(gdb) p *rsp
$9 = {last_death = 0x0, last_set = 0x7ffff730cc00, last_set_value = 0x7ffff73094f0, last_set_table_tick = 2, last_set_label = 2, last_set_nonzero_bits = 0, last_set_sign_bit_copies = 8 '\b', last_set_mode = QImode, last_set_invalid = 1 '\001', sign_bit_copies = 0 '\000', nonzero_bits = 0, truncation_label = 2, truncated_to_mode = DImode}

(gdb) p rsp->last_set
$10 = (rtx_insn *) 0x7ffff730cc00
(gdb) pr
(insn 43 31 44 2 (set (reg:QI 18 r18)
        (const_int 0 [0])) bug-combin.c:29 56 {movqi_insn}
     (nil))

(gdb) p rsp->last_set_value
$11 = (rtx) 0x7ffff73094f0
(gdb) pr
(const_int 0 [0])

The .last_set_sign_bit_copies is correct if it is applied to QImode, but the sign bits of reg:DI 18 are contained in reg:QI 25 set by insn 50.

As already said, rsp[regno] is not used to get the number of sign bit copies but to conclude that DI:18 is entirely set to 0, and the number of sign bit copies is then computed for that 0.

We do have a value, and it is for this bb.

But not for the complete hard register; it's only for reg:QI 18 which is
one byte of reg:DI 18,

The rest of the hard register bits are set to unspecified values.

No, insns 44...50 set the rest of reg:DI 18 to specified values. This is what expand does as is lowers DImode to word_mode (QImode for avr).

FYI, FIRST_PSEUDO_REGISTER is 34.

thus the code should never reach this point in the
first place and return earlier.  For example:

Index: combine.c
===================================================================
--- combine.c   (revision 238701)
+++ combine.c   (working copy)
@@ -13206,6 +13206,13 @@ get_last_value (const_rtx x)
       && DF_INSN_LUID (rsp->last_set) >= subst_low_luid)
     return 0;

+  /* If the lookup is for a hard register make sure that value contains at
least
+     as many bits as x does.  */
+
+  if (regno < FIRST_PSEUDO_REGISTER
+      && GET_MODE_PRECISION (rsp->last_set_mode) < GET_MODE_PRECISION
(GET_MODE (x)))
+    return 0;
+
   /* If the value has all its registers valid, return it.  */
   if (get_last_value_validate (&value, rsp->last_set, rsp->last_set_label,
   0))
     return value;

That might be a bit harsh.

In what respect?

First things first: is the information recorded correct?

I think yes, but care must be taken what may be concluded from that information. From a set of 8 bits you cannot draw conclusion about all 64 bits; this should be obvious enough :-)

In the above case rsp[regno] holds only information for 1 sub-byte. In order to get the complete DImode story we would have to get the info for all sub-parts and then put them together...

Johann


Segher


Reply via email to