On Thu, 18 Jan 2024, Jakub Jelinek wrote:

> On Thu, Jan 18, 2024 at 08:27:51AM +0100, Richard Biener wrote:
> > On Thu, 18 Jan 2024, Jakub Jelinek wrote:
> > 
> > > Hi!
> > > 
> > > On aarch64 the backend decides to use non-BLKmode for some arrays
> > > like unsigned long[4] - OImode in that case, but the corresponding
> > > BITINT_TYPEs have BLKmode (like structures containing that many limb
> > > elements).  This both isn't a good idea (we really want such underlying 
> > > vars
> > > to live in memory and access them there, rather than live in registers and
> > > access their parts in there) and causes ICEs during expansion
> > > (VIEW_CONVERT_EXPR from such OImode array to BLKmode BITINT_TYPE), so the
> > > following patch makes sure such arrays reflect the BLKmode of BITINT_TYPEs
> > > it is accessed with (if any).
> > > 
> > > Bootstrapped/regtested on x86_64-linux and i686-linux, ok for trunk?
> > 
> > So the issue is only manifesting during expansion?  I think it would
> > be better to detect the specific issue (V_C_E from register to BLKmode)
> > in discover_nonconstant_array_refs_r and force the register argument
> > to stack?
> 
> That doesn't really work, tried:
> --- gcc/cfgexpand.cc.jj       2024-01-16 11:45:16.159326506 +0100
> +++ gcc/cfgexpand.cc  2024-01-18 11:26:17.906447274 +0100
> @@ -6380,11 +6380,16 @@ discover_nonconstant_array_refs_r (tree
>    /* References of size POLY_INT_CST to a fixed-size object must go
>       through memory.  It's more efficient to force that here than
>       to create temporary slots on the fly.
> -     RTL expansion expectes TARGET_MEM_REF to always address actual memory.  
> */
> +     RTL expansion expectes TARGET_MEM_REF to always address actual memory.
> +     Also, force to stack non-BLKmode vars accessed through VIEW_CONVERT_EXPR
> +     to BLKmode BITINT_TYPEs.  */
>    else if (TREE_CODE (t) == TARGET_MEM_REF
>          || (TREE_CODE (t) == MEM_REF
>              && TYPE_SIZE (TREE_TYPE (t))
> -            && POLY_INT_CST_P (TYPE_SIZE (TREE_TYPE (t)))))
> +            && POLY_INT_CST_P (TYPE_SIZE (TREE_TYPE (t))))
> +        || (TREE_CODE (t) == VIEW_CONVERT_EXPR
> +            && TREE_CODE (TREE_TYPE (t)) == BITINT_TYPE
> +            && TYPE_MODE (TREE_TYPE (t)) == BLKmode))
>      {
>        tree base = get_base_address (t);
>        if (base
> This doesn't actually do anything, because the base is TREE_ADDRESSABLE.
> The var gets both with -O0 and -O2 DECL_RTL like
>         (mem/c:OI (plus:DI (reg/f:DI 95 virtual-stack-vars)
>                 (const_int -64 [0xffffffffffffffc0])) [2 bitint.2+0 S32 A128])

But then it's not promoted to register but instead somebody decides
it gets an integer mode instead of BLKmode.

> but the problem is that the expansion of the VAR_DECL because of the
> non-BLKmode is forced into a pseudo.
> --- gcc/expr.cc.jj    2024-01-12 10:07:58.194851657 +0100
> +++ gcc/expr.cc       2024-01-18 11:56:07.142361031 +0100
> @@ -12382,6 +12382,17 @@ expand_expr_real_1 (tree exp, rtx target
>         }
>        }

There's already an odd bit of code dealing with non-BLKmode to
BLKmode converts, suggesting those would need intermediate memory,
but likely not triggering because the base is a decl, not a
handled_component.  Does it work to go there also for DECL_P (treeop0)?


> +      /* Ensure non-BLKmode array VAR_DECLs VCEd to BLKmode BITINT_TYPE
> +      aren't promoted to registers.  */
> +      if (op0 == NULL_RTX
> +       && mode == BLKmode
> +       && TREE_CODE (type) == BITINT_TYPE
> +       && modifier == EXPAND_NORMAL
> +       && VAR_P (treeop0)
> +       && DECL_MODE (treeop0) != BLKmode)
> +     op0 = expand_expr_real (treeop0, NULL_RTX, VOIDmode, EXPAND_MEMORY,
> +                             NULL, inner_reference_p);
> +
>        if (!op0)
>       op0 = expand_expr_real (treeop0, NULL_RTX, VOIDmode, modifier,
>                               NULL, inner_reference_p);
> doesn't work either, while we get the MEM op0 in that case, because
> mode != GET_MODE (op0) we still take the
>       /* If the output type is a bit-field type, do an extraction.  */
>       else if (reduce_bit_field)
>         return extract_bit_field (op0, TYPE_PRECISION (type), 0,
>                                   TYPE_UNSIGNED (type), NULL_RTX,
>                                   mode, mode, false, NULL);
> path which can't deal with BLKmode extraction from non-BLKmode.
> I guess we could in the above new expr.cc hunk perhaps
> also if (MEM_P (op0)) op0 = adjust_address (op0, BLKmode, 0);

Hmm, 'reduce_bit_field' is odd with V_C_E - if you disable that,
does it work?

How does it behave differently when the base is BLKmode instead
of OImode?

Richard.

Reply via email to