"Bin.Cheng" <amker.ch...@gmail.com> writes:
> On Tue, May 23, 2017 at 6:53 PM, Richard Sandiford
> <richard.sandif...@linaro.org> wrote:
>> AIUI, the reason the old code mishandled negative steps was that the
>> associated segment lengths were stored as sizetype and so looked like
>> big unsigned values.  Those values therefore satisfied tree_fits_uhwi_p
>> even though they were semantically negative.  Is that right?
> Yes, and the undesired wrapping behavior when such large unsigned hwi
> is involved in computation.  But I think there are possible leaks in
> the code even after this patch, as embedded below.
>>
>> Assuming yes, and quoting the change as a context diff...
>>
>>> diff --git a/gcc/tree-data-ref.c b/gcc/tree-data-ref.c
>>> index a5f8c1c..f0799d9 100644
>>> *** a/gcc/tree-data-ref.c
>>> --- b/gcc/tree-data-ref.c
>>> ***************
>>> *** 1259,1264 ****
>>> --- 1259,1273 ----
>>>             != tree_int_cst_compare (DR_STEP (dr_a2->dr), size_zero_node))
>>>           continue;
>>>
>>> +       bool neg_step
>>> +         = (tree_int_cst_compare (DR_STEP (dr_a1->dr), size_zero_node) < 
>>> 0);
>>> +
>>> +       /* DR_A1 and DR_A2 must have the same step if it's negative.  */
>>> +       if (neg_step
>>> +           && tree_int_cst_compare (DR_STEP (dr_a1->dr),
>>> +                                    DR_STEP (dr_a2->dr)) != 0)
>>> +         continue;
>>> +
>>
>> [Why do they need to be the same step?]
> There are two reasons.  First is to simplify diff computation between
> dr_a1 and dr_a2, otherwise we need to adjust diff for negative steps.

What kind of adjustment would be needed?  Could you give an example?

> And wrapping behavior needs to be handled when adjusting diff with
> steps.  The second reason is not fully handled in this patch.  We now
> only set merged segment length to MAX only when both dr_a1->seg_len
> and dr_a2->seg_len are constant, as below:
> +          if (tree_fits_uhwi_p (dr_a1->seg_len)
> +              && tree_fits_uhwi_p (dr_a2->seg_len))
> +            new_seg_len
> +              = size_int (MAX (tree_to_uhwi (dr_a1->seg_len),
> +                       diff + tree_to_uhwi (dr_a2->seg_len)));
> +          else
> +            new_seg_len
> +              = size_binop (PLUS_EXPR, dr_a2->seg_len, size_int (diff));
> In fact, we should do this for else branch too.  with different steps,
> it is still possible that dr_a1-seg_len > dr_a2->seg_len + diff.  Here
> I only restrict it to negative DR_STEP.  Patch updated with
> explanation in comment.
>>
>>>         /* Make sure dr_a1 starts left of dr_a2.  */
>>>         if (tree_int_cst_lt (DR_INIT (dr_a2->dr), DR_INIT (dr_a1->dr)))
>>>           std::swap (*dr_a1, *dr_a2);
>>> ***************
>>> *** 1266,1325 ****
>>>         bool do_remove = false;
>>>         unsigned HOST_WIDE_INT diff
>>>           = (tree_to_shwi (DR_INIT (dr_a2->dr))
>>> !                - tree_to_shwi (DR_INIT (dr_a1->dr)));
>>>
>>> !       /* If the left segment does not extend beyond the start of the
>>> !          right segment the new segment length is that of the right
>>> !          plus the segment distance.  */
>>> !       if (tree_fits_uhwi_p (dr_a1->seg_len)
>>> !           && compare_tree_int (dr_a1->seg_len, diff) <= 0)
>>>           {
>>> !           dr_a1->seg_len = size_binop (PLUS_EXPR, dr_a2->seg_len,
>>> !                                        size_int (diff));
>>> !           do_remove = true;
>>>           }
>>> !       /* Generally the new segment length is the maximum of the
>>> !          left segment size and the right segment size plus the distance.
>>> !          ???  We can also build tree MAX_EXPR here but it's not clear this
>>> !          is profitable.  */
>>> !       else if (tree_fits_uhwi_p (dr_a1->seg_len)
>>> !                && tree_fits_uhwi_p (dr_a2->seg_len))
>>> !         {
>>> !           unsigned HOST_WIDE_INT seg_len_a1 = tree_to_uhwi 
>>> (dr_a1->seg_len);
>>> !           unsigned HOST_WIDE_INT seg_len_a2 = tree_to_uhwi 
>>> (dr_a2->seg_len);
>>> !           dr_a1->seg_len = size_int (MAX (seg_len_a1, diff + seg_len_a2));
>>> !           do_remove = true;
>>> !         }
>>> !       /* Now we check if the following condition is satisfied:
>>>
>>> !          DIFF - SEGMENT_LENGTH_A < SEGMENT_LENGTH_B
>>>
>>> !          where DIFF = DR_A2_INIT - DR_A1_INIT.  However,
>>> !          SEGMENT_LENGTH_A or SEGMENT_LENGTH_B may not be constant so we
>>> !          have to make a best estimation.  We can get the minimum value
>>> !          of SEGMENT_LENGTH_B as a constant, represented by MIN_SEG_LEN_B,
>>> !          then either of the following two conditions can guarantee the
>>> !          one above:
>>>
>>> !          1: DIFF <= MIN_SEG_LEN_B
>>> !          2: DIFF - SEGMENT_LENGTH_A < MIN_SEG_LEN_B  */
>>> !       else
>>>           {
>>> !           unsigned HOST_WIDE_INT min_seg_len_b
>>> !             = (tree_fits_uhwi_p (dr_b1->seg_len)
>>> !                ? tree_to_uhwi (dr_b1->seg_len)
>>> !                : factor);
>>>
>>>             if (diff <= min_seg_len_b
>>>                 || (tree_fits_uhwi_p (dr_a1->seg_len)
>>> !                   && diff - tree_to_uhwi (dr_a1->seg_len) < 
>>> min_seg_len_b))
>>>               {
>>> !               dr_a1->seg_len = size_binop (PLUS_EXPR,
>>> !                                            dr_a2->seg_len, size_int 
>>> (diff));
>>>                 do_remove = true;
>>>               }
>>>           }
>>>
>>>         if (do_remove)
>>>           {
>>>             if (dump_enabled_p ())
>>> --- 1275,1375 ----
>>>         bool do_remove = false;
>>>         unsigned HOST_WIDE_INT diff
>>>           = (tree_to_shwi (DR_INIT (dr_a2->dr))
>>> !            - tree_to_shwi (DR_INIT (dr_a1->dr)));
>>> !       tree new_seg_len;
>>> !       unsigned HOST_WIDE_INT min_seg_len_b;
>>>
>>> !       if (tree_fits_uhwi_p (dr_b1->seg_len))
>>>           {
>>> !           min_seg_len_b = tree_to_uhwi (dr_b1->seg_len);
>>> !           if (tree_int_cst_sign_bit (dr_b1->seg_len))
>>> !             min_seg_len_b = 0 - min_seg_len_b;
>>>           }
>>> !       else
>>> !         min_seg_len_b = factor;
>>
>> ...I'm not sure how safe this or the later neg_step handling is
>> for 16-bit and 32-bit sizetypes.  It might be better to use wide_int
> I think it could be a problem in case sizetype is smaller than
> unsigned_type_for(ptr).

But I think it would a problem even for "normal" 32-bit and 16-bit
targets, because you're doing uhwi (i.e. 64-bit) negation on things that
come from 32-bit and 16-bit unsigned values.  E.g. a segment length of
-32 on a 32-bit target would be 0xffffffe0.  If you read that as a uhwi
and negate it, you get 0xffffffff00000020 rather than 32.

Using wide_ints would avoid that.  I don't think the existing code
needed it (because the existing code didn't handle negative steps
properly at all).

>> instead, so that all arithmetic and comparisons happen in the precision
>> of sizetype.
> I was trying to make minimal refactoring for fixing the negative step
> issue.  Also I guess your SVE patches will rewrite this part entirely?

Not sure TBH :-)  I'll have to see how it works out when I merge it in.

Thanks,
Richard

Reply via email to