Re: [FFmpeg-devel] [PATCH] avcodec/v210: add avx2 version of the line encoder

2016-01-14 Thread James Darnley
On 2016-01-14 21:42, Henrik Gramner wrote: > On Thu, Jan 14, 2016 at 9:27 PM, James Darnley > wrote: >> On 2016-01-14 20:21, Henrik Gramner wrote: >>> xmN can be used unconditionally which gets rid of the %else. E.g. >>> >>> movu xm1, [yq+widthq*2] >>> %if

Re: [FFmpeg-devel] [PATCH] avcodec/v210: add avx2 version of the line encoder

2016-01-14 Thread Henrik Gramner
On Thu, Jan 14, 2016 at 9:27 PM, James Darnley wrote: > On 2016-01-14 20:21, Henrik Gramner wrote: >> xmN can be used unconditionally which gets rid of the %else. E.g. >> >> movu xm1, [yq+widthq*2] >> %if cpuflag(avx2) >> vinserti128 m1, m1,

Re: [FFmpeg-devel] [PATCH] avcodec/v210: add avx2 version of the line encoder

2016-01-14 Thread James Darnley
On 2016-01-14 20:21, Henrik Gramner wrote: > On Wed, Jan 13, 2016 at 4:55 PM, James Darnley > wrote: >> diff --git a/libavcodec/x86/v210enc.asm b/libavcodec/x86/v210enc.asm >> index 859e2d9..a8f3d3c 100644 >> --- a/libavcodec/x86/v210enc.asm >> +++

[FFmpeg-devel] [PATCH] avcodec/v210: add avx2 version of the line encoder

2016-01-13 Thread James Darnley
Around 35% faster than the avx version. --- libavcodec/v210enc.c | 5 ++-- libavcodec/v210enc.h | 1 + libavcodec/x86/v210enc.asm| 53 +++ libavcodec/x86/v210enc_init.c | 7 ++ 4 files changed, 49 insertions(+), 17 deletions(-)