https://gcc.gnu.org/bugzilla/show_bug.cgi?id=96966
Jakub Jelinek <jakub at gcc dot gnu.org> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |jakub at gcc dot gnu.org --- Comment #4 from Jakub Jelinek <jakub at gcc dot gnu.org> --- Even extern char a[32]; void f (const void *s) { char *p = (char*)__builtin_memcpy (a, s, 16); __builtin_memcpy (p, s, 16); } void g (const void *s) { __builtin_memcpy (a, s, 16); __builtin_memcpy (a, s, 16); } used to be optimized just in 8.1/8.2 and not in earlier or later GCC versions. Perhaps delaying the lowering of memcpy a tiny bit and trying to optimize it when it is still not lowered?