https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98563
--- Comment #3 from N Schaeffer <nathanael.schaeffer at gmail dot com> --- I'd like to add that when you say "vectorization of the basic block", the code generated is actually worse than non-vectorized naive code: it handles all loads and arithmetic operations in scalar mode (v*sd instructions) and packs two values into xmm before storing...