Re: [FFmpeg-devel] [PATCH] vp9: 16bpp tm/dc/h/v intra pred simd (mostly sse2) functions.

2015-10-05 Thread Matt Oliver
On 6 October 2015 at 07:51, Ronald S. Bultje wrote: > Hi, > > On Mon, Oct 5, 2015 at 7:39 AM, Ronald S. Bultje > wrote: > > > Hi, > > > > On Mon, Oct 5, 2015 at 5:54 AM, Henrik Gramner > wrote: > > > >> On Mon, Oct 5, 2015 at 10:55 AM, Matt Oliver > >> wrote: > >> > This patch has broken the 3

Re: [FFmpeg-devel] [PATCH] vp9: 16bpp tm/dc/h/v intra pred simd (mostly sse2) functions.

2015-10-05 Thread Ronald S. Bultje
Hi, On Mon, Oct 5, 2015 at 7:39 AM, Ronald S. Bultje wrote: > Hi, > > On Mon, Oct 5, 2015 at 5:54 AM, Henrik Gramner wrote: > >> On Mon, Oct 5, 2015 at 10:55 AM, Matt Oliver >> wrote: >> > This patch has broken the 32bit msvc builds: >> > >> http://fate.ffmpeg.org/report.cgi?time=2015100506510

Re: [FFmpeg-devel] [PATCH] vp9: 16bpp tm/dc/h/v intra pred simd (mostly sse2) functions.

2015-10-05 Thread Ronald S. Bultje
Hi, On Mon, Oct 5, 2015 at 5:54 AM, Henrik Gramner wrote: > On Mon, Oct 5, 2015 at 10:55 AM, Matt Oliver wrote: > > This patch has broken the 32bit msvc builds: > > > http://fate.ffmpeg.org/report.cgi?time=20151005065109&slot=x86_32-msvc12-windows-native > > > > I had a look through the code bu

Re: [FFmpeg-devel] [PATCH] vp9: 16bpp tm/dc/h/v intra pred simd (mostly sse2) functions.

2015-10-05 Thread Henrik Gramner
On Mon, Oct 5, 2015 at 10:55 AM, Matt Oliver wrote: > This patch has broken the 32bit msvc builds: > http://fate.ffmpeg.org/report.cgi?time=20151005065109&slot=x86_32-msvc12-windows-native > > I had a look through the code but couldnt find the cause. The error message > just points to the code lin

Re: [FFmpeg-devel] [PATCH] vp9: 16bpp tm/dc/h/v intra pred simd (mostly sse2) functions.

2015-10-05 Thread Matt Oliver
On 3 October 2015 at 21:05, Ronald S. Bultje wrote: > --- > libavcodec/x86/Makefile | 1 + > libavcodec/x86/constants.c | 4 + > libavcodec/x86/constants.h | 2 + > libavcodec/x86/h264_idct_10bit.asm | 5 +- > libavcodec/x86/

[FFmpeg-devel] [PATCH] vp9: 16bpp tm/dc/h/v intra pred simd (mostly sse2) functions.

2015-10-03 Thread Ronald S. Bultje
--- libavcodec/x86/Makefile | 1 + libavcodec/x86/constants.c | 4 + libavcodec/x86/constants.h | 2 + libavcodec/x86/h264_idct_10bit.asm | 5 +- libavcodec/x86/h264_intrapred_10bit.asm | 2 +- libavcodec/x86/vp9dsp_init.

Re: [FFmpeg-devel] [PATCH] vp9: 16bpp tm/dc/h/v intra pred simd (mostly sse2) functions.

2015-10-03 Thread Henrik Gramner
On Sat, Oct 3, 2015 at 2:12 AM, Ronald S. Bultje wrote: > Well, they prototype is different. For H/V, it's not critical, but for the > directional ones, the edge handling is very quirky so I wanted to do that > in C, so l/a are arguments instead of part of the source buffer. > > (And because we do

Re: [FFmpeg-devel] [PATCH] vp9: 16bpp tm/dc/h/v intra pred simd (mostly sse2) functions.

2015-10-02 Thread Ronald S. Bultje
Hi, On Fri, Oct 2, 2015 at 5:31 PM, Henrik Gramner wrote: > On Fri, Sep 25, 2015 at 11:24 PM, Ronald S. Bultje > wrote: > > +++ b/libavcodec/x86/vp9intrapred_16bpp.asm > > > +cglobal vp9_ipred_v_4x4_16, 2, 4, 1, dst, stride, l, a > > +cglobal vp9_ipred_v_8x8_16, 2, 4, 1, dst, stride, l, a > > +

[FFmpeg-devel] [PATCH] vp9: 16bpp tm/dc/h/v intra pred simd (mostly sse2) functions.

2015-10-02 Thread Ronald S. Bultje
--- libavcodec/x86/Makefile | 1 + libavcodec/x86/constants.c | 4 + libavcodec/x86/constants.h | 2 + libavcodec/x86/h264_idct_10bit.asm | 5 +- libavcodec/x86/h264_intrapred_10bit.asm | 2 +- libavcodec/x86/vp9dsp_init.

Re: [FFmpeg-devel] [PATCH] vp9: 16bpp tm/dc/h/v intra pred simd (mostly sse2) functions.

2015-10-02 Thread Henrik Gramner
On Fri, Sep 25, 2015 at 11:24 PM, Ronald S. Bultje wrote: > +++ b/libavcodec/x86/vp9intrapred_16bpp.asm > +cglobal vp9_ipred_v_4x4_16, 2, 4, 1, dst, stride, l, a > +cglobal vp9_ipred_v_8x8_16, 2, 4, 1, dst, stride, l, a > +cglobal vp9_ipred_v_16x16_16, 2, 4, 2, dst, stride, l, a > +cglobal vp9_ip

[FFmpeg-devel] [PATCH] vp9: 16bpp tm/dc/h/v intra pred simd (mostly sse2) functions.

2015-09-25 Thread Ronald S. Bultje
--- libavcodec/x86/Makefile | 1 + libavcodec/x86/constants.c | 6 + libavcodec/x86/constants.h | 3 + libavcodec/x86/h264_idct_10bit.asm | 5 +- libavcodec/x86/h264_intrapred_10bit.asm | 2 +- libavcodec/x86/vp9dsp_init.

Re: [FFmpeg-devel] [PATCH] vp9: 16bpp tm/dc/h/v intra pred simd (mostly sse2) functions.

2015-09-25 Thread Ronald S. Bultje
Hi, On Fri, Sep 25, 2015 at 5:09 PM, Christophe Gisquet < christophe.gisq...@gmail.com> wrote: > Hi, > > 2015-09-25 22:36 GMT+02:00 Ronald S. Bultje : > > +pd_16: times 8 dd 16 > > In h264_intrapred_10bit.asm > > > +pd_32: times 8 dd 32 > > In h264_idct_10bit.asm > > And that's about anything rem

Re: [FFmpeg-devel] [PATCH] vp9: 16bpp tm/dc/h/v intra pred simd (mostly sse2) functions.

2015-09-25 Thread Christophe Gisquet
Hi, 2015-09-25 22:36 GMT+02:00 Ronald S. Bultje : > +pd_16: times 8 dd 16 In h264_intrapred_10bit.asm > +pd_32: times 8 dd 32 In h264_idct_10bit.asm And that's about anything remotely useful I have to say, as you were careful about those duplications. > +pmaxsw m0, m4 > +

[FFmpeg-devel] [PATCH] vp9: 16bpp tm/dc/h/v intra pred simd (mostly sse2) functions.

2015-09-25 Thread Ronald S. Bultje
--- libavcodec/x86/Makefile | 1 + libavcodec/x86/constants.c | 2 + libavcodec/x86/constants.h | 1 + libavcodec/x86/vp9dsp_init.h| 23 + libavcodec/x86/vp9dsp_init_16bpp.c | 15 + libavcodec/x86/vp9dsp_init_16