ni...@lysator.liu.se (Niels Möller) writes:

  And for neon instructions, cycle numbers are in
  
http://infocenter.arm.com/help/topic/com.arm.doc.ddi0388i/DDI0388I_cortex_a9_r4p1_trm.pdf
  
Page?

  Seems it should be able to do one vmull per cycle. Not sure how to
  get latency from the given table, but maybe 6 cycles.
  
If that is true, and the clock is the same as the main CPU, and if we
can sum things up at that speed, we could expect a 4-fold improvement
compared to the current GMP code for Arm.

-- 
Torbjörn
_______________________________________________
gmp-devel mailing list
gmp-devel@gmplib.org
http://gmplib.org/mailman/listinfo/gmp-devel

Reply via email to