[PATCH] AVX2 vec_widen_[su]mult_{hi,lo}*, sdot_prod* and udot_prod* (take 2)

2011-10-14 Thread Jakub Jelinek
On Fri, Oct 14, 2011 at 09:21:15AM -0700, Richard Henderson wrote: So what you're doing here is the low-part permutation: 0 4 1 5 2 6 3 7 followed by a shift to get 4 . 5 . 6 . 7 . But you need to load a 256-bit constant from memory to get it. Right. I wonder if it

Re: [PATCH] AVX2 vec_widen_[su]mult_{hi,lo}*, sdot_prod* and udot_prod* (take 2)

2011-10-14 Thread Richard Henderson
On 10/14/2011 11:34 AM, Jakub Jelinek wrote: 2011-10-14 Jakub Jelinek ja...@redhat.com * config/i386/sse.md (vec_widen_smult_hi_v8hi, vec_widen_smult_lo_v8hi, vec_widen_umult_hi_v8hi, vec_widen_umult_lo_v8hi): Macroize using VI2_AVX2 mode iterator and any_extend