Re: [FFmpeg-devel] [PATCH] h264_i386: Optimize decode_significance_8x8_x86 for 64 bit.

2014-12-03 Thread Michael Niedermayer
On Wed, Dec 03, 2014 at 10:39:00PM +0100, Reimar Döffinger wrote: > On Wed, Dec 03, 2014 at 01:19:48PM +0100, Michael Niedermayer wrote: > > On Wed, Dec 03, 2014 at 09:00:39AM +0100, Reimar Döffinger wrote: > > > On 03.12.2014, at 01:40, Michael Niedermayer wrote: > > > > On Sat, Nov 22, 2014 at 0

Re: [FFmpeg-devel] [PATCH] h264_i386: Optimize decode_significance_8x8_x86 for 64 bit.

2014-12-03 Thread Reimar Döffinger
On Wed, Dec 03, 2014 at 01:19:48PM +0100, Michael Niedermayer wrote: > On Wed, Dec 03, 2014 at 09:00:39AM +0100, Reimar Döffinger wrote: > > On 03.12.2014, at 01:40, Michael Niedermayer wrote: > > > On Sat, Nov 22, 2014 at 02:09:01PM +0100, Reimar Döffinger wrote: > > >> On Mon, Nov 17, 2014 at 01

Re: [FFmpeg-devel] [PATCH] h264_i386: Optimize decode_significance_8x8_x86 for 64 bit.

2014-12-03 Thread Michael Niedermayer
On Wed, Dec 03, 2014 at 09:00:39AM +0100, Reimar Döffinger wrote: > On 03.12.2014, at 01:40, Michael Niedermayer wrote: > > On Sat, Nov 22, 2014 at 02:09:01PM +0100, Reimar Döffinger wrote: > >> On Mon, Nov 17, 2014 at 01:41:13PM +0100, Michael Niedermayer wrote: > >>> On Mon, Nov 17, 2014 at 08:1

Re: [FFmpeg-devel] [PATCH] h264_i386: Optimize decode_significance_8x8_x86 for 64 bit.

2014-12-03 Thread Reimar Döffinger
On 03.12.2014, at 01:40, Michael Niedermayer wrote: > On Sat, Nov 22, 2014 at 02:09:01PM +0100, Reimar Döffinger wrote: >> On Mon, Nov 17, 2014 at 01:41:13PM +0100, Michael Niedermayer wrote: >>> On Mon, Nov 17, 2014 at 08:19:32AM +0100, Reimar Döffinger wrote: On 17.11.2014, at 02:37, Michae

Re: [FFmpeg-devel] [PATCH] h264_i386: Optimize decode_significance_8x8_x86 for 64 bit.

2014-12-02 Thread Michael Niedermayer
On Sat, Nov 22, 2014 at 02:09:01PM +0100, Reimar Döffinger wrote: > On Mon, Nov 17, 2014 at 01:41:13PM +0100, Michael Niedermayer wrote: > > On Mon, Nov 17, 2014 at 08:19:32AM +0100, Reimar Döffinger wrote: > > > On 17.11.2014, at 02:37, Michael Niedermayer wrote: > > > > On Sat, Nov 15, 2014 at 0

Re: [FFmpeg-devel] [PATCH] h264_i386: Optimize decode_significance_8x8_x86 for 64 bit.

2014-11-22 Thread Reimar Döffinger
On Mon, Nov 17, 2014 at 01:41:13PM +0100, Michael Niedermayer wrote: > On Mon, Nov 17, 2014 at 08:19:32AM +0100, Reimar Döffinger wrote: > > On 17.11.2014, at 02:37, Michael Niedermayer wrote: > > > On Sat, Nov 15, 2014 at 06:16:03PM +0100, Reimar Döffinger wrote: > > >> 11674 -> 10877 decicycles

Re: [FFmpeg-devel] [PATCH] h264_i386: Optimize decode_significance_8x8_x86 for 64 bit.

2014-11-17 Thread Michael Niedermayer
On Mon, Nov 17, 2014 at 08:19:32AM +0100, Reimar Döffinger wrote: > On 17.11.2014, at 02:37, Michael Niedermayer wrote: > > On Sat, Nov 15, 2014 at 06:16:03PM +0100, Reimar Döffinger wrote: > >> 11674 -> 10877 decicycles on my Phenom II. > >> Overall speedup was unfortunately within measurement er

Re: [FFmpeg-devel] [PATCH] h264_i386: Optimize decode_significance_8x8_x86 for 64 bit.

2014-11-16 Thread Reimar Döffinger
On 17.11.2014, at 02:37, Michael Niedermayer wrote: > On Sat, Nov 15, 2014 at 06:16:03PM +0100, Reimar Döffinger wrote: >> 11674 -> 10877 decicycles on my Phenom II. >> Overall speedup was unfortunately within measurement error. > > here its 10153 ->10135 I suspect it also depends a bit on the

Re: [FFmpeg-devel] [PATCH] h264_i386: Optimize decode_significance_8x8_x86 for 64 bit.

2014-11-16 Thread Michael Niedermayer
On Sat, Nov 15, 2014 at 06:16:03PM +0100, Reimar Döffinger wrote: > 11674 -> 10877 decicycles on my Phenom II. > Overall speedup was unfortunately within measurement error. here its 10153 ->10135 but ive a slightly odd feeling about the chnages to the asm code, iam not sure if all assemblers wil

[FFmpeg-devel] [PATCH] h264_i386: Optimize decode_significance_8x8_x86 for 64 bit.

2014-11-15 Thread Reimar Döffinger
11674 -> 10877 decicycles on my Phenom II. Overall speedup was unfortunately within measurement error. Signed-off-by: Reimar Döffinger --- libavcodec/x86/h264_i386.h | 30 ++ 1 file changed, 18 insertions(+), 12 deletions(-) diff --git a/libavcodec/x86/h264_i386.h b/