pr45636.f90 fails for AArch64 - memcpy and memset are not combined

kugan at gcc dot gnu.org Mon, 24 Nov 2014 03:37:45 -0800

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=64045


--- Comment #1 from kugan at gcc dot gnu.org ---
Look at gcc/tree-ssa-fwprop.c:1650

      /* If the new memcpy wouldn't be emitted by storing the literal
         by pieces, this optimization might enlarge .rodata too much,
         as commonly used string literals couldn't be shared any
         longer.  */
      if (!can_store_by_pieces (src_len,
                    builtin_strncpy_read_str,
                    src_buf, ptr1_align, false))
        break;

AArch64 back-end rejects use_by_pieces_infrastructure for STORE_BY_PIECES and
hence this optimisation will fail. It is probably an xfail case if back-end is
going to reject it.


static bool
aarch64_use_by_pieces_infrastructure_p (unsigned int size,
                                       unsigned int align,
                                       enum by_pieces_operation op,
                                       bool speed_p)
{
  /* STORE_BY_PIECES can be used when copying a constant string, but
     in that case each 64-bit chunk takes 5 insns instead of 2 (LDR/STR).
     For now we always fail this and let the move_by_pieces code copy
     the string from read-only memory.  */
  if (op == STORE_BY_PIECES)
    return false;

  return default_use_by_pieces_infrastructure_p (size, align, op, speed_p);
}

[Bug target/64045] fortran.dg/pr45636.f90 fails for AArch64 - memcpy and memset are not combined

Reply via email to