https://gcc.gnu.org/bugzilla/show_bug.cgi?id=97127
--- Comment #8 from Michael_S <already5chosen at yahoo dot com> --- What are values of gcc "loop" cost of the relevant instructions now? 1. AVX256 Load 2. FMA3 ymm,ymm,ymm 3. AVX256 Regmove 4. FMA3 mem,ymm,ymm