Hi, 2014-03-04 3:25 GMT+01:00 James Almer <jamr...@gmail.com>: > -INIT_XMM sse2 > +%macro SETZERO 1 > +%if cpuflag(sse2) > + pxor %1, %1 > +%else > + xorps %1, %1, %1 > +%endif > +%endmacro > + > +%macro SHUF 2 > +%if cpuflag(sse2) > + pshufd %1, %2, q0123 > +%else > + mova %1, %2 > + shufps %1, %1, q0123 > +%endif > +%endmacro
We already discussed this, and indeed it is worth having SSE2 (integer) instructions instead of pure (float) SSE ones for the SSE2 version as they are actually faster. OK from me then for the asm. Not sure if the C part still applies cleanly, but this should be minor. -- Christophe _______________________________________________ libav-devel mailing list libav-devel@libav.org https://lists.libav.org/mailman/listinfo/libav-devel