Op di 5 jul. 2022 om 09:41 schreef olivier tristan <o.tris...@uvi.net>:
> You do not talk about the SSE 4.1 version in your bench. > > Have you tried this use case ? > I compared 4 compiles: one without any changes (so with all variants of the lpc functions, including the SSE4.1 ones) and three with variants of plain C code. As both CPUs that were tested had SSE4.1 capability, these functions were compared with. So yes, current GCC outperforms those SSE4.1 intrinsics functions on 16-bit inputs and comes close on 24-bit inputs.
_______________________________________________ flac-dev mailing list flac-dev@xiph.org http://lists.xiph.org/mailman/listinfo/flac-dev