On 2017-10-22 19:14, Martin Vignali wrote: > @@ -86,16 +101,20 @@ SECTION .text > > ; void ff_bswap_buf(uint32_t *dst, const uint32_t *src, int w); > %macro BSWAP32_BUF 0 > -%if cpuflag(ssse3) > +%if cpuflag(ssse3)||cpuflag(avx2) > cglobal bswap32_buf, 3,4,3 > mov r3, r1 > +%if cpuflag(avx2) > + vbroadcasti128 m2, [pb_bswap32] > +%else > mova m2, [pb_bswap32] > +%endif
Why don't you increase the size of the constant? You wouldn't need a preprocessor condition. I do admit I have never considered using a broadact-128 instruction in similar circumstances so I have no idea which might be "better".
signature.asc
Description: OpenPGP digital signature
_______________________________________________ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org http://ffmpeg.org/mailman/listinfo/ffmpeg-devel