https://gcc.gnu.org/bugzilla/show_bug.cgi?id=82370
--- Comment #1 from Jakub Jelinek <jakub at gcc dot gnu.org> --- Created attachment 42296 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=42296&action=edit gcc8-pr82370.patch If VPAND is exactly as fast as VPANDQ except for different encodings, then maybe we can do something like this patch, where we'd use the suffixes only for 512-bit vectors, or when any of the operands is %[xy]mm16+, or when masking. If VPAND is slower, then we could do it for -Os at least.