David Miller <da...@davemloft.net> writes: Each load can issue in 1 cycle, there is a 4 cycle latency, the loads will fully pipeline. Therefore the overhead is around 3n. At most one memop / cycle?
> Our current Karatsuba code (evaluating in 0, -1, oo) will suffer from > the forgotten subtraction instructions. Evaluating in 0, +1, oo might > be better... As I said in my other reply it depends upon how you need this subtract. Karatsuba need mpn_sub_n when in the interpolation if we evaluated in -1. -- Torbjörn _______________________________________________ gmp-devel mailing list gmp-devel@gmplib.org http://gmplib.org/mailman/listinfo/gmp-devel