Re: [libav-devel] [PATCHv2] aarch64: vp9: Add NEON optimizations of VP9 MC functions

2016-11-05 Thread Martin Storsjö
On Thu, 3 Nov 2016, Janne Grunau wrote: On 2016-11-02 14:58:41 +0200, Martin Storsjö wrote: --- libavcodec/aarch64/Makefile | 2 + libavcodec/aarch64/vp9dsp_init_aarch64.c | 139 ++ libavcodec/aarch64/vp9mc_neon.S | 733 +++

Re: [libav-devel] [PATCHv2] aarch64: vp9: Add NEON optimizations of VP9 MC functions

2016-11-03 Thread Martin Storsjö
On Thu, 3 Nov 2016, Janne Grunau wrote: On 2016-11-02 14:58:41 +0200, Martin Storsjö wrote: --- libavcodec/aarch64/Makefile | 2 + libavcodec/aarch64/vp9dsp_init_aarch64.c | 139 ++ libavcodec/aarch64/vp9mc_neon.S | 733 +++

Re: [libav-devel] [PATCHv2] aarch64: vp9: Add NEON optimizations of VP9 MC functions

2016-11-03 Thread Janne Grunau
On 2016-11-03 23:36:08 +0200, Martin Storsjö wrote: > On Thu, 3 Nov 2016, Diego Biurrun wrote: > > >On Thu, Nov 03, 2016 at 11:02:17PM +0200, Martin Storsjö wrote: > >>On Thu, 3 Nov 2016, Diego Biurrun wrote: > >>>On Thu, Nov 03, 2016 at 09:42:44PM +0100, Janne Grunau wrote: > On 2016-11-03

Re: [libav-devel] [PATCHv2] aarch64: vp9: Add NEON optimizations of VP9 MC functions

2016-11-03 Thread Janne Grunau
On 2016-11-02 14:58:41 +0200, Martin Storsjö wrote: > --- > libavcodec/aarch64/Makefile | 2 + > libavcodec/aarch64/vp9dsp_init_aarch64.c | 139 ++ > libavcodec/aarch64/vp9mc_neon.S | 733 > +++ > libavcodec/vp9.h |

Re: [libav-devel] [PATCHv2] aarch64: vp9: Add NEON optimizations of VP9 MC functions

2016-11-03 Thread Martin Storsjö
On Thu, 3 Nov 2016, Diego Biurrun wrote: On Thu, Nov 03, 2016 at 11:02:17PM +0200, Martin Storsjö wrote: On Thu, 3 Nov 2016, Diego Biurrun wrote: >On Thu, Nov 03, 2016 at 09:42:44PM +0100, Janne Grunau wrote: >>On 2016-11-03 20:38:11 +0100, Diego Biurrun wrote: >>> On Wed, Nov 02, 2016 at

Re: [libav-devel] [PATCHv2] aarch64: vp9: Add NEON optimizations of VP9 MC functions

2016-11-03 Thread Diego Biurrun
On Thu, Nov 03, 2016 at 11:02:17PM +0200, Martin Storsjö wrote: > On Thu, 3 Nov 2016, Diego Biurrun wrote: > >On Thu, Nov 03, 2016 at 09:42:44PM +0100, Janne Grunau wrote: > >>On 2016-11-03 20:38:11 +0100, Diego Biurrun wrote: > >>> On Wed, Nov 02, 2016 at 07:24:03PM +0100, Janne Grunau wrote: >

Re: [libav-devel] [PATCHv2] aarch64: vp9: Add NEON optimizations of VP9 MC functions

2016-11-03 Thread Martin Storsjö
On Thu, 3 Nov 2016, Diego Biurrun wrote: On Thu, Nov 03, 2016 at 09:42:44PM +0100, Janne Grunau wrote: On 2016-11-03 20:38:11 +0100, Diego Biurrun wrote: > On Wed, Nov 02, 2016 at 07:24:03PM +0100, Janne Grunau wrote: > > On 2016-11-02 15:29:34 +0100, Diego Biurrun wrote: > > > On Wed, Nov 02,

Re: [libav-devel] [PATCHv2] aarch64: vp9: Add NEON optimizations of VP9 MC functions

2016-11-03 Thread Diego Biurrun
On Thu, Nov 03, 2016 at 09:42:44PM +0100, Janne Grunau wrote: > On 2016-11-03 20:38:11 +0100, Diego Biurrun wrote: > > On Wed, Nov 02, 2016 at 07:24:03PM +0100, Janne Grunau wrote: > > > On 2016-11-02 15:29:34 +0100, Diego Biurrun wrote: > > > > On Wed, Nov 02, 2016 at 04:00:38PM +0200, Martin

Re: [libav-devel] [PATCHv2] aarch64: vp9: Add NEON optimizations of VP9 MC functions

2016-11-03 Thread Janne Grunau
On 2016-11-03 20:38:11 +0100, Diego Biurrun wrote: > On Wed, Nov 02, 2016 at 07:24:03PM +0100, Janne Grunau wrote: > > On 2016-11-02 15:29:34 +0100, Diego Biurrun wrote: > > > On Wed, Nov 02, 2016 at 04:00:38PM +0200, Martin Storsjö wrote: > > > > On Wed, 2 Nov 2016, Diego Biurrun wrote: > > > >

Re: [libav-devel] [PATCHv2] aarch64: vp9: Add NEON optimizations of VP9 MC functions

2016-11-03 Thread Diego Biurrun
On Wed, Nov 02, 2016 at 07:24:03PM +0100, Janne Grunau wrote: > On 2016-11-02 15:29:34 +0100, Diego Biurrun wrote: > > On Wed, Nov 02, 2016 at 04:00:38PM +0200, Martin Storsjö wrote: > > > On Wed, 2 Nov 2016, Diego Biurrun wrote: > > > >On Wed, Nov 02, 2016 at 03:23:14PM +0200, Martin Storsjö

Re: [libav-devel] [PATCHv2] aarch64: vp9: Add NEON optimizations of VP9 MC functions

2016-11-02 Thread Janne Grunau
On 2016-11-02 15:29:34 +0100, Diego Biurrun wrote: > On Wed, Nov 02, 2016 at 04:00:38PM +0200, Martin Storsjö wrote: > > On Wed, 2 Nov 2016, Diego Biurrun wrote: > > >On Wed, Nov 02, 2016 at 03:23:14PM +0200, Martin Storsjö wrote: > > >>On Wed, 2 Nov 2016, Martin Storsjö wrote: > > >>Technically,

Re: [libav-devel] [PATCHv2] aarch64: vp9: Add NEON optimizations of VP9 MC functions

2016-11-02 Thread Diego Biurrun
On Wed, Nov 02, 2016 at 04:00:38PM +0200, Martin Storsjö wrote: > On Wed, 2 Nov 2016, Diego Biurrun wrote: > >On Wed, Nov 02, 2016 at 03:23:14PM +0200, Martin Storsjö wrote: > >>On Wed, 2 Nov 2016, Martin Storsjö wrote: > >>Technically, having a _neon prefix for them is wrong, but anything else >

Re: [libav-devel] [PATCHv2] aarch64: vp9: Add NEON optimizations of VP9 MC functions

2016-11-02 Thread Luca Barbato
On 02/11/2016 14:23, Martin Storsjö wrote: > Technically, having a _neon prefix for them is wrong If we are sure that there won't be anything neon but without those instructions I wouldn't be much concerned. ___ libav-devel mailing list

Re: [libav-devel] [PATCHv2] aarch64: vp9: Add NEON optimizations of VP9 MC functions

2016-11-02 Thread Martin Storsjö
On Wed, 2 Nov 2016, Diego Biurrun wrote: On Wed, Nov 02, 2016 at 03:23:14PM +0200, Martin Storsjö wrote: On Wed, 2 Nov 2016, Martin Storsjö wrote: Technically, having a _neon prefix for them is wrong, but anything else (omitting these two while hooking up avg32/avg64 separately) is more

Re: [libav-devel] [PATCHv2] aarch64: vp9: Add NEON optimizations of VP9 MC functions

2016-11-02 Thread Diego Biurrun
On Wed, Nov 02, 2016 at 03:23:14PM +0200, Martin Storsjö wrote: > On Wed, 2 Nov 2016, Martin Storsjö wrote: > > Technically, having a _neon prefix for them is wrong, but anything else > (omitting these two while hooking up avg32/avg64 separately) is more > complication - although I'm open for

Re: [libav-devel] [PATCHv2] aarch64: vp9: Add NEON optimizations of VP9 MC functions

2016-11-02 Thread Martin Storsjö
On Wed, 2 Nov 2016, Martin Storsjö wrote: This work is sponsored by, and copyright, Google. These are ported from the ARM version; it is essentially a 1:1 port with no extra added features, but with some hand tuning (especially for the plain copy/avg functions). The ARM version isn't very

[libav-devel] [PATCHv2] aarch64: vp9: Add NEON optimizations of VP9 MC functions

2016-11-02 Thread Martin Storsjö
This work is sponsored by, and copyright, Google. These are ported from the ARM version; it is essentially a 1:1 port with no extra added features, but with some hand tuning (especially for the plain copy/avg functions). The ARM version isn't very register starved to begin with, so there's not