On Thu, 20 Mar 2014 15:37:53 -0300, James Almer jamr...@gmail.com wrote:
Here are some extra implementations that extend Christophe's work.
Thanks, pushed
--
Anton Khirnov
___
libav-devel mailing list
libav-devel@libav.org
Here are some extra implementations that extend Christophe's work.
Differences with v1:
* AVX/FMA3: Removed the main loop and related bookkeepeing for x64 since said
loop
would be run only once anyway.
* FMA3: Replaced mulps+subps with FMA3 instructions, meaning two less
instructions
run per