On Tue, 16 Aug 2022, Hubert Mazur wrote:

Add arm64 neon implementation for functions from motion estimation
family. All of them were tested and benchmarked using checkasm tool.
The rare code paths, e.g. when filter_size % 4 != 0 were also tested.


Instructions were manualy deinterleaved to reach best performance.

You probably mean "interleaved", as deinterleaved would be how it was initially, which is detrimental for performance.

Overall I think this patchset is close enough now. There were a bunch of minor details left on the patches, but I'll fix that up locally and push them, instead of doing yet another round of these. I'll comment and point out the details I changed - please pay attention to them for future patches though!

// Martin

_______________________________________________
ffmpeg-devel mailing list
ffmpeg-devel@ffmpeg.org
https://ffmpeg.org/mailman/listinfo/ffmpeg-devel

To unsubscribe, visit link above, or email
ffmpeg-devel-requ...@ffmpeg.org with subject "unsubscribe".

Reply via email to