Re: gmp-devel list

2013-03-06 Thread Niels Möller
Zimmermann Paul paul.zimmerm...@loria.fr writes: the gmp-devel list is for Technical discussions between developers. We have seen recently several patches posted, which I believe do no match the list definition. If there is no other way to transfer source code, maybe one should create a

Re: gmp-devel list

2013-03-06 Thread Vincent Lefevre
On 2013-03-06 09:53:05 +0100, Paul Zimmermann wrote: Niels, We need a forum for both posting patches, and discussing the same patches. To me, it seems easiest to have both on the same list. however maybe not all people on gmp-devel are interested by the detail of all patches and

Re: [PATCH 2/2] Optimize 64-bit mpn_add_N and mpn_sub_N for sparc T3 and later.

2013-03-06 Thread Torbjorn Granlund
I think all you T3/T4 changes are now in. Please check that I didn't mess something up. Thanks for this contribution! -- Torbjörn ___ gmp-devel mailing list gmp-devel@gmplib.org http://gmplib.org/mailman/listinfo/gmp-devel

Re: [PATCH 0/3] Resubmit of Sparc T3/T4 patches.

2013-03-06 Thread Torbjorn Granlund
David Miller da...@davemloft.net writes: From: Torbjorn Granlund t...@gmplib.org Date: Wed, 06 Mar 2013 00:08:09 +0100 The addmul code could be simularly improved. Grumble... and I did this work already, I sent older versions of my T3/T4 changes, let me go see how I screwed this

Re: [PATCH 2/2] Optimize 64-bit mpn_add_N and mpn_sub_N for sparc T3 and later.

2013-03-06 Thread David Miller
From: Torbjorn Granlund t...@gmplib.org Date: Wed, 06 Mar 2013 12:36:34 +0100 I think all you T3/T4 changes are now in. Please check that I didn't mess something up. Thanks for this contribution! Looks good, there is some trailing whitespace in the ChangeLog but that's probably my fault:

Re: [PATCH 2/2] Optimize 64-bit mpn_add_N and mpn_sub_N for sparc T3 and later.

2013-03-06 Thread David Miller
From: Torbjorn Granlund t...@gmplib.org Date: Thu, 07 Mar 2013 00:51:24 +0100 David Miller da...@davemloft.net writes: From: Torbjorn Granlund t...@gmplib.org Date: Wed, 06 Mar 2013 12:36:34 +0100 I think all you T3/T4 changes are now in. Please check that I didn't mess

Re: [PATCH 2/2] Optimize 64-bit mpn_add_N and mpn_sub_N for sparc T3 and later.

2013-03-06 Thread Torbjorn Granlund
David Miller da...@davemloft.net writes: I optimised submul_1.asm, and then edited both addmul_1 and submul_1 to use as similar operand order as possible. Please test these using tests/devel/try, and please time this new submul_1. The testsuite starts failing very early with these

Re: [PATCH 2/2] Optimize 64-bit mpn_add_N and mpn_sub_N for sparc T3 and later.

2013-03-06 Thread David Miller
From: Torbjorn Granlund t...@gmplib.org Date: Thu, 07 Mar 2013 07:00:00 +0100 Why doesn't your functions use the 'return' insns, btw? They were expensive at one point, but I just checked and on T4 using return appears to be quite cheap. PS. I have created a poor man's T3 with some m4 macros.

Re: [PATCH 2/2] Optimize 64-bit mpn_add_N and mpn_sub_N for sparc T3 and later.

2013-03-06 Thread David Miller
From: David Miller da...@davemloft.net Date: Thu, 07 Mar 2013 01:06:55 -0500 (EST) I'll test your routines with the obvious fix in a moment. With the one-liner fix both of your new implementations work. submul_1 is now much better, about 5.8 cycles per limb on T4.

Re: [PATCH 2/2] Optimize 64-bit mpn_add_N and mpn_sub_N for sparc T3 and later.

2013-03-06 Thread Torbjorn Granlund
David Miller da...@davemloft.net writes: From: David Miller da...@davemloft.net Date: Thu, 07 Mar 2013 01:06:55 -0500 (EST) I'll test your routines with the obvious fix in a moment. With the one-liner fix both of your new implementations work. Thanks for testing! submul_1 is