ni...@lysator.liu.se (Niels Möller) writes: And for neon instructions, cycle numbers are in http://infocenter.arm.com/help/topic/com.arm.doc.ddi0388i/DDI0388I_cortex_a9_r4p1_trm.pdf Page?
Seems it should be able to do one vmull per cycle. Not sure how to get latency from the given table, but maybe 6 cycles. If that is true, and the clock is the same as the main CPU, and if we can sum things up at that speed, we could expect a 4-fold improvement compared to the current GMP code for Arm. -- Torbjörn _______________________________________________ gmp-devel mailing list gmp-devel@gmplib.org http://gmplib.org/mailman/listinfo/gmp-devel