I pushed a few more variants of gcd_11 with nice speed improvements for
several x86_64 CPUs.  I am sure much more can be done.

I have a bunch of finished gcd_22 too; these are generic without any
CPU-specific tweaks.  I haven't timed them, they are just tested for
correctness.  It might be desirable to modify some of the gcd_11 loops
to do two-limb arithmetic, and use these to create gcd_22 inner loops.

-- 
Torbjörn
Please encrypt, key id 0xC8601622
_______________________________________________
gmp-devel mailing list
gmp-devel@gmplib.org
https://gmplib.org/mailman/listinfo/gmp-devel

Reply via email to