Zimmermann Paul paul.zimmerm...@loria.fr writes:
the gmp-devel list is for Technical discussions between developers. We have
seen recently several patches posted, which I believe do no match the list
definition. If there is no other way to transfer source code, maybe one should
create a
On 2013-03-06 09:53:05 +0100, Paul Zimmermann wrote:
Niels,
We need a forum for both posting patches, and discussing the same
patches. To me, it seems easiest to have both on the same list.
however maybe not all people on gmp-devel are interested by the
detail of all patches and
I think all you T3/T4 changes are now in. Please check that I didn't
mess something up.
Thanks for this contribution!
--
Torbjörn
___
gmp-devel mailing list
gmp-devel@gmplib.org
http://gmplib.org/mailman/listinfo/gmp-devel
David Miller da...@davemloft.net writes:
From: Torbjorn Granlund t...@gmplib.org
Date: Wed, 06 Mar 2013 00:08:09 +0100
The addmul code could be simularly improved.
Grumble... and I did this work already, I sent older versions
of my T3/T4 changes, let me go see how I screwed this
From: Torbjorn Granlund t...@gmplib.org
Date: Wed, 06 Mar 2013 12:36:34 +0100
I think all you T3/T4 changes are now in. Please check that I didn't
mess something up.
Thanks for this contribution!
Looks good, there is some trailing whitespace in the ChangeLog but
that's probably my fault:
From: Torbjorn Granlund t...@gmplib.org
Date: Thu, 07 Mar 2013 00:51:24 +0100
David Miller da...@davemloft.net writes:
From: Torbjorn Granlund t...@gmplib.org
Date: Wed, 06 Mar 2013 12:36:34 +0100
I think all you T3/T4 changes are now in. Please check that I didn't
mess
David Miller da...@davemloft.net writes:
I optimised submul_1.asm, and then edited both addmul_1 and submul_1 to
use as similar operand order as possible. Please test these using
tests/devel/try, and please time this new submul_1.
The testsuite starts failing very early with these
From: Torbjorn Granlund t...@gmplib.org
Date: Thu, 07 Mar 2013 07:00:00 +0100
Why doesn't your functions use the 'return' insns, btw?
They were expensive at one point, but I just checked and on T4 using
return appears to be quite cheap.
PS. I have created a poor man's T3 with some m4 macros.
From: David Miller da...@davemloft.net
Date: Thu, 07 Mar 2013 01:06:55 -0500 (EST)
I'll test your routines with the obvious fix in a moment.
With the one-liner fix both of your new implementations work.
submul_1 is now much better, about 5.8 cycles per limb on T4.
David Miller da...@davemloft.net writes:
From: David Miller da...@davemloft.net
Date: Thu, 07 Mar 2013 01:06:55 -0500 (EST)
I'll test your routines with the obvious fix in a moment.
With the one-liner fix both of your new implementations work.
Thanks for testing!
submul_1 is
10 matches
Mail list logo