On 30/01/15 01:21, Peter Meerwald wrote: > this patch series splits out the g722_apply_qmf() function so it can > be optimized > > it turns out that the efficiency of the C code can be improved quite a bit > as well by unrolling :) > > benchmarking a G722 encode/decode in a loop compiled with gcc 4.8.2 > > x86-64, Intel i5-2400: > 340 ms baseline > 300 ms after g722_qmf_apply() unrolling, -11.7% > 275 ms after s_zero() unrolling, -19.1% > > ARM Cortex-A8: > 2720 ms baseline > 2365 ms after g722_qmf_apply() unrolling, -13.1% > 1935 ms after s_zero() unrolling, -28.8% > 1850 ms after q722_qmf_apply() in NEON, -32.0% >
Impressive! lu _______________________________________________ libav-devel mailing list libav-devel@libav.org https://lists.libav.org/mailman/listinfo/libav-devel