Hi,

2014-03-04 3:25 GMT+01:00 James Almer <jamr...@gmail.com>:
> -INIT_XMM sse2
> +%macro SETZERO 1
> +%if cpuflag(sse2)
> +    pxor          %1, %1
> +%else
> +    xorps         %1, %1, %1
> +%endif
> +%endmacro
> +
> +%macro SHUF 2
> +%if cpuflag(sse2)
> +    pshufd        %1, %2, q0123
> +%else
> +    mova          %1, %2
> +    shufps        %1, %1, q0123
> +%endif
> +%endmacro

We already discussed this, and indeed it is worth having SSE2
(integer) instructions instead of pure (float) SSE ones for the SSE2
version as they are actually faster. OK from me then for the asm.

Not sure if the C part still applies cleanly, but this should be minor.

-- 
Christophe
_______________________________________________
libav-devel mailing list
libav-devel@libav.org
https://lists.libav.org/mailman/listinfo/libav-devel

Reply via email to