On 27/06/2024 17:16, Wilco Dijkstra wrote:
> Hi Richard,
> 
>> Doing just this will mean that the register allocator will have to undo a 
>> pre/post memory operand that was accepted by the predicate (memory_operand). 
>>  I think we really need a tighter predicate (lets call it noautoinc_mem_op) 
>> here to avoid that.  Note that the existing uses of Uw also had another 
>> alternative that did permit 'm', so this wasn't previously practical, but 
>> they had alternative ways of being reloaded.
>>
>> No, sorry that won't work; there's another 'm' alternative here as well.
>> The correct fix is to add alternatives for T1, I think, similar to the one 
>> in thumb1_movsi_insn.
>>
>> Also, by observation I think there's a similar problem in the load 
>> operations.
> 
> Just using 'Uw' works fine, but restricting the memory operand too is better 
> indeed.
> I added 'restricted_memory_operand' that only disallows Thumb-1 postincrement.
> 
> There were also a few more cases in unaligned accesses where 'm' was used 
> incorrectly when
> emitting Thumb-1 LDR/STR alternatives (and where no LDM/STMis allowed), so 
> those also use
> 'Uw' and 'restricted_memory_operand'.
> 
> Long term it seems like a better idea is to remove support this odd 
> post-increment
> in general memory operand and only emit it from a peephole pass.
> 
> Cheers,
> Wilco
> 
> 
> v3: Use 'Uw' in a few more cases. Add 'restricted_memory_operand'.
> 
> A Thumb-1 memory operand allows single-register LDMIA/STMIA. This doesn't get
> printed as LDR/STR with writeback in unified syntax, resulting in strange
> assembler errors if writeback is selected.  To work around this, use the 'Uw'
> constraint that blocks writeback.  Also use a new 'restricted_memory_operand'
> which is a general memory operand that disallows writeback in Thumb-1.
> A few other patterns were using 'm' for Thumb-1 in a similar way, update these
> to also use 'restricted_memory_operand' and 'Uw'.
> 
> Passes bootstrap & regress, OK for commit (and backport to GCC14.2)?

I'm not a major fan of the name restricted_memory_operand as it doesn't 
describe which restriction is being applied and something like 
t1_restricted_memory_operand would not be any clearer.  Perhaps 
mem_and_no_t1_wback_op would be better?

OK with that change.

R.

> 
> gcc:
>         PR target/115188
>         * config/arm/arm.md (unaligned_loadsi): Use 'Uw' constraint and
>         'restricted_memory_operand'.
>         (unaligned_loadhiu): Likewise.
>         (unaligned_storesi): Likewise.
>         (unaligned_storehi): Likewise.
>         * config/arm/predicates.md (restricted_memory_operand): Add new 
> predicate.
>         * config/arm/sync.md (arm_atomic_load<mode>): Use 'Uw' constraint.
>         (arm_atomic_store<mode>): Likewise.
> 
> gcc/testsuite:
>         PR target/115188
>         * gcc.target/arm/pr115188.c: Add new test.
> 
> ---
> 
> diff --git a/gcc/config/arm/arm.md b/gcc/config/arm/arm.md
> index 
> f47e036a8034ed16c61bbd753c7a7cd3efb1ecbd..c962a9341779e4da38f4e1afb26d4a364fc5aee4
>  100644
> --- a/gcc/config/arm/arm.md
> +++ b/gcc/config/arm/arm.md
> @@ -5011,7 +5011,7 @@
>  
>  (define_insn "unaligned_loadsi"
>    [(set (match_operand:SI 0 "s_register_operand" "=l,l,r")
> -     (unspec:SI [(match_operand:SI 1 "memory_operand" "m,Uw,m")]
> +     (unspec:SI [(match_operand:SI 1 "restricted_memory_operand" "Uw,Uw,m")]
>                  UNSPEC_UNALIGNED_LOAD))]
>    "unaligned_access"
>    "@
> @@ -5041,7 +5041,7 @@
>  (define_insn "unaligned_loadhiu"
>    [(set (match_operand:SI 0 "s_register_operand" "=l,l,r")
>       (zero_extend:SI
> -       (unspec:HI [(match_operand:HI 1 "memory_operand" "m,Uw,m")]
> +       (unspec:HI [(match_operand:HI 1 "restricted_memory_operand" 
> "Uw,Uw,m")]
>                    UNSPEC_UNALIGNED_LOAD)))]
>    "unaligned_access"
>    "@
> @@ -5066,7 +5066,7 @@
>     (set_attr "type" "store_8")])
>  
>  (define_insn "unaligned_storesi"
> -  [(set (match_operand:SI 0 "memory_operand" "=m,Uw,m")
> +  [(set (match_operand:SI 0 "restricted_memory_operand" "=Uw,Uw,m")
>       (unspec:SI [(match_operand:SI 1 "s_register_operand" "l,l,r")]
>                  UNSPEC_UNALIGNED_STORE))]
>    "unaligned_access"
> @@ -5081,7 +5081,7 @@
>     (set_attr "type" "store_4")])
>  
>  (define_insn "unaligned_storehi"
> -  [(set (match_operand:HI 0 "memory_operand" "=m,Uw,m")
> +  [(set (match_operand:HI 0 "restricted_memory_operand" "=Uw,Uw,m")
>       (unspec:HI [(match_operand:HI 1 "s_register_operand" "l,l,r")]
>                  UNSPEC_UNALIGNED_STORE))]
>    "unaligned_access"
> diff --git a/gcc/config/arm/predicates.md b/gcc/config/arm/predicates.md
> index 
> 4994c0c57d6431117c16f7a05e800821dee93408..3dfe381c098c06517dca6026f8dafe87b46135ae
>  100644
> --- a/gcc/config/arm/predicates.md
> +++ b/gcc/config/arm/predicates.md
> @@ -907,3 +907,8 @@
>  ;; A special predicate that doesn't match a particular mode.
>  (define_special_predicate "arm_any_register_operand"
>    (match_code "reg"))
> +
> +;; General memory operand that disallows Thumb-1 POST_INC.
> +(define_predicate "restricted_memory_operand"
> +  (and (match_operand 0 "memory_operand")
> +       (match_test "!(TARGET_THUMB1 && GET_CODE (XEXP (op, 0)) == 
> POST_INC)")))
> diff --git a/gcc/config/arm/sync.md b/gcc/config/arm/sync.md
> index 
> df8dbe170cacb6b60d56a6f19aadd5a6c9c51f7a..7696c1a6f9819cfcbc9008f58431b9c2f08cb0ce
>  100644
> --- a/gcc/config/arm/sync.md
> +++ b/gcc/config/arm/sync.md
> @@ -65,7 +65,7 @@
>  (define_insn "arm_atomic_load<mode>"
>    [(set (match_operand:QHSI 0 "register_operand" "=r,l")
>      (unspec_volatile:QHSI
> -      [(match_operand:QHSI 1 "memory_operand" "m,m")]
> +      [(match_operand:QHSI 1 "restricted_memory_operand" "m,Uw")]
>        VUNSPEC_LDR))]
>    ""
>    "ldr<sync_sfx>\t%0, %1"
> @@ -81,7 +81,7 @@
>  )
>  
>  (define_insn "arm_atomic_store<mode>"
> -  [(set (match_operand:QHSI 0 "memory_operand" "=m,m")
> +  [(set (match_operand:QHSI 0 "restricted_memory_operand" "=m,Uw")
>      (unspec_volatile:QHSI
>        [(match_operand:QHSI 1 "register_operand" "r,l")]
>        VUNSPEC_STR))]
> diff --git a/gcc/testsuite/gcc.target/arm/pr115188.c 
> b/gcc/testsuite/gcc.target/arm/pr115188.c
> new file mode 100644
> index 
> 0000000000000000000000000000000000000000..9a4022b56796d6962bb3f22e40bac4b81eb78ccf
> --- /dev/null
> +++ b/gcc/testsuite/gcc.target/arm/pr115188.c
> @@ -0,0 +1,10 @@
> +/* { dg-do assemble } */
> +/* { dg-require-effective-target arm_arch_v6m_ok }
> +/* { dg-options "-O2" } */
> +/* { dg-add-options arm_arch_v6m } */
> +
> +void init (int *p, int n)
> +{
> +  for (int i = 0; i < n; i++)
> +    __atomic_store_4 (p + i, 0, __ATOMIC_RELAXED);
> +}
> 

Reply via email to