[libav-devel] NEON: performance comparison of fixed-point and float FFT

Orjan Friberg Fri, 04 May 2012 06:41:53 -0700

Hi,

Comparing the performance of fft-test and fft-fixed-test on a Cortex-A8(with Neon support enabled) I only see a very small performance increasewith the 16-bit fixed point version compared to the float version,regardless of the FFT size (64, 256, 4096). I didn't see anyperformance numbers in Måns' original patch post.

The obvious question is what limits the performance of the fixed-pointimplementation? My assumption being that for many of the operationsinvolved, it should be possible to process twice the amount of elementsin the same amount of time.

(The underlying data type isn't changed for the fixed-point test (i.e.the data is not 16-bit packed), but for small sizes the L1 data cacheshould be pretty warm anyway so I don't suspect that the implementationis throttled by memory.)



Thanks,
Orjan

--
Orjan Friberg
FlatFrog Laboratories AB
_______________________________________________
libav-devel mailing list
libav-devel@libav.org
https://lists.libav.org/mailman/listinfo/libav-devel

[libav-devel] NEON: performance comparison of fixed-point and float FFT

Reply via email to