David Miller <da...@davemloft.net> writes:

  Each load can issue in 1 cycle, there is a 4 cycle latency, the
  loads will fully pipeline.  Therefore the overhead is around 3n.
  
At most one memop / cycle?

  > Our current Karatsuba code (evaluating in 0, -1, oo) will suffer from
  > the forgotten subtraction instructions.  Evaluating in 0, +1, oo might
  > be better...
  
  As I said in my other reply it depends upon how you need this
  subtract.
  
Karatsuba need mpn_sub_n when in the interpolation if we evaluated in
-1.

-- 
Torbjörn
_______________________________________________
gmp-devel mailing list
gmp-devel@gmplib.org
http://gmplib.org/mailman/listinfo/gmp-devel

Reply via email to