"H.J. Lu" <hjl.to...@gmail.com> writes: > > i386.c has > > { > /* When not optimize for size, enable vzeroupper optimization for > TARGET_AVX with -fexpensive-optimizations and split 32-byte > AVX unaligned load/store. */
This is only for the load, not for deciding whether peeling is worthwhile or not. I believe it's unimplemented for x86 at this point. There isn't even a hook for it. Any hook that is added should ideally work for both ARM64 and x86. This would imply it would need to handle different vector sizes. -Andi -- a...@linux.intel.com -- Speaking for myself only