On 2016-12-01 11:26:59 +0200, Martin Storsjö wrote: > This work is sponsored by, and copyright, Google. > > This reduces the code size of libavcodec/aarch64/vp9itxfm_neon.o from > 19496 to 14740 bytes. > > This gives a small slowdown of a couple of tens of cycles, but makes > it more feasible to add more optimized versions of these transforms. > > Before: > vp9_inv_dct_dct_16x16_sub4_add_neon: 1036.7 > vp9_inv_dct_dct_16x16_sub16_add_neon: 1372.2 > vp9_inv_dct_dct_32x32_sub4_add_neon: 5180.0 > vp9_inv_dct_dct_32x32_sub32_add_neon: 8095.7 > > After: > vp9_inv_dct_dct_16x16_sub4_add_neon: 1051.0 > vp9_inv_dct_dct_16x16_sub16_add_neon: 1390.1 > vp9_inv_dct_dct_32x32_sub4_add_neon: 5199.9 > vp9_inv_dct_dct_32x32_sub32_add_neon: 8125.8 > --- > libavcodec/aarch64/vp9itxfm_neon.S | 42 > +++++++++++++++++++++++--------------- > 1 file changed, 25 insertions(+), 17 deletions(-)
ok Janne _______________________________________________ libav-devel mailing list libav-devel@libav.org https://lists.libav.org/mailman/listinfo/libav-devel