Ciao, Il Mar, 13 Agosto 2019 10:38 pm, Torbjörn Granlund ha scritto: > I pushed a few more variants of gcd_11 with nice speed improvements for > several x86_64 CPUs. I am sure much more can be done.
I'm happy to see that all of them start with the sequence mov v0, %rax sub u0, v0 You widely developed my small idea: to keep the value in %rax :-D Now, I'd like to see if it is possible to integrate those loops with a more flexible interface, to really start using them in the library... Ĝis, m -- http://bodrato.it/papers/ _______________________________________________ gmp-devel mailing list gmp-devel@gmplib.org https://gmplib.org/mailman/listinfo/gmp-devel