https://gcc.gnu.org/bugzilla/show_bug.cgi?id=66
--- Comment #5 from gnu_bugzilla_gcc at catelyn dot tech ---
(In reply to Richard Biener from comment #4)
> note the situation is difficult to rectify - ideally the vectorizer
> would see that we require two 64bit register pieces but it doesn't
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=66
--- Comment #3 from gnu_bugzilla_gcc at catelyn dot tech ---
(In reply to Richard Biener from comment #1)
> Unless you can come up with an actual benchmark showing the vector code is
> slower I'd say it's not. Given it's smaller it should win
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=66
--- Comment #2 from gnu_bugzilla_gcc at catelyn dot tech ---
Created attachment 55807
--> https://gcc.gnu.org/bugzilla/attachment.cgi?id=55807=edit
preprocessed file containing the benchmark code I used
I compiled this code (although using
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=66
Bug ID: 66
Summary: gcc unnecessarily creates vector operations for
packing 32 bit integers into struct (x86_64)
Product: gcc
Version: 13.2.1
Status: UNCONFIRMED