On 30/01/15 01:21, Peter Meerwald wrote:
> this patch series splits out the g722_apply_qmf() function so it can
> be optimized
> 
> it turns out that the efficiency of the C code can be improved quite a bit
> as well by unrolling :)
> 
> benchmarking a G722 encode/decode in a loop compiled with gcc 4.8.2
> 
> x86-64, Intel i5-2400:
> 340 ms baseline
> 300 ms after g722_qmf_apply() unrolling, -11.7%
> 275 ms after s_zero() unrolling, -19.1%
> 
> ARM Cortex-A8:
> 2720 ms baseline
> 2365 ms after g722_qmf_apply() unrolling, -13.1%
> 1935 ms after s_zero() unrolling, -28.8%
> 1850 ms after q722_qmf_apply() in NEON, -32.0%
> 

Impressive!

lu

_______________________________________________
libav-devel mailing list
libav-devel@libav.org
https://lists.libav.org/mailman/listinfo/libav-devel

Reply via email to