Re: [FFmpeg-devel] [PATCH v4] mdct15: add assembly optimizations for the 15-point FFT

2017-06-23 Thread James Almer
On 6/23/2017 7:11 PM, Michael Niedermayer wrote: > On Fri, Jun 23, 2017 at 10:35:43PM +0200, Paul B Mahol wrote: >> On 6/23/17, Michael Niedermayer wrote: >>> On Fri, Jun 23, 2017 at 02:51:37AM +0100, Rostislav Pehlivanov wrote: c:1802 decicycles in fft15,16774635 runs, 2581 skips

Re: [FFmpeg-devel] [PATCH v4] mdct15: add assembly optimizations for the 15-point FFT

2017-06-23 Thread Michael Niedermayer
On Fri, Jun 23, 2017 at 10:35:43PM +0200, Paul B Mahol wrote: > On 6/23/17, Michael Niedermayer wrote: > > On Fri, Jun 23, 2017 at 02:51:37AM +0100, Rostislav Pehlivanov wrote: > >> c:1802 decicycles in fft15,16774635 runs, 2581 skips > >> avx: 865 decicycles in fft15,16776378 runs,838

Re: [FFmpeg-devel] [PATCH v4] mdct15: add assembly optimizations for the 15-point FFT

2017-06-23 Thread Paul B Mahol
On 6/23/17, Michael Niedermayer wrote: > On Fri, Jun 23, 2017 at 02:51:37AM +0100, Rostislav Pehlivanov wrote: >> c:1802 decicycles in fft15,16774635 runs, 2581 skips >> avx: 865 decicycles in fft15,16776378 runs,838 skips >> >> Signed-off-by: Rostislav Pehlivanov >> --- >> libavcode

Re: [FFmpeg-devel] [PATCH v4] mdct15: add assembly optimizations for the 15-point FFT

2017-06-23 Thread Henrik Gramner
On Fri, Jun 23, 2017 at 10:18 PM, Michael Niedermayer wrote: > seems to fail to build here: > > libavcodec/x86/mdct15.asm:116: error: invalid combination of opcode and > operands > libavcodec/x86/mdct15.asm:117: error: invalid combination of opcode and > operands > libavcodec/x86/mdct15.asm:118:

Re: [FFmpeg-devel] [PATCH v4] mdct15: add assembly optimizations for the 15-point FFT

2017-06-23 Thread Michael Niedermayer
On Fri, Jun 23, 2017 at 02:51:37AM +0100, Rostislav Pehlivanov wrote: > c:1802 decicycles in fft15,16774635 runs, 2581 skips > avx: 865 decicycles in fft15,16776378 runs,838 skips > > Signed-off-by: Rostislav Pehlivanov > --- > libavcodec/mdct15.c | 182 > ++

[FFmpeg-devel] [PATCH v4] mdct15: add assembly optimizations for the 15-point FFT

2017-06-22 Thread Rostislav Pehlivanov
c:1802 decicycles in fft15,16774635 runs, 2581 skips avx: 865 decicycles in fft15,16776378 runs,838 skips Signed-off-by: Rostislav Pehlivanov --- libavcodec/mdct15.c | 182 +-- libavcodec/mdct15.h | 26 +++ libavcodec/x86