Thanks Michael for your feedback.

On Wed, Aug 19, 2020 at 6:55 AM Michael Niedermayer <mich...@niedermayer.cc>
wrote:

> faster is better obviously, so if its tested with odd sizes and arm
> developers had a chance to comment. it should be ok
>
>
The current patch was tested with `make check` on Arm64 Graviton2.
I also have tested randomly selected rescale factors, for example:
./ffmpeg -nostats -f lavfi -i testsrc2=4k:d=2 -vf
bench=start,scale=1023x42,bench=stop -f null -


> one potential improvment is to use the unrolled code for odd width
> too and use the non unrolled for the end
>

Done.  Please see the amended patch.

Thanks,
Sebastian

Attachment: 0001-aarch64-yuv2planeX-unroll-outer-loop-by-4-increases-.patch
Description: Binary data

_______________________________________________
ffmpeg-devel mailing list
ffmpeg-devel@ffmpeg.org
https://ffmpeg.org/mailman/listinfo/ffmpeg-devel

To unsubscribe, visit link above, or email
ffmpeg-devel-requ...@ffmpeg.org with subject "unsubscribe".

Reply via email to