On 23/02/2014 20:09, Bill Hart wrote:
> After a lot of work I have managed to remove the performance problem
> with the new division code on penryn.
> 
> Two important facts about core2/penryn are that it is always better to
> save muls and always better to save memory read/writes where possible,
> since both take a long time on that architecture.
> 
> So I now have a version of the code which performs well on Intel and
> AMD. Unfortunately the difference in the basecase range is much less
> pronounced on AMD, being only up to about 20% faster, with an average of
> more like 10%. However, the performance in the divide-and-conquer range
> has improved by 3-4% and we now beat GMP by 25% at certain points.
> 
> I still need to tune a couple of crossovers, but the new division code
> shouldn't much in the way of changes now.

Is this improvement likely to work on Nehalem/Sandy Bridge/Ivy Bridge?

   Brian

-- 
You received this message because you are subscribed to the Google Groups 
"mpir-devel" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to mpir-devel+unsubscr...@googlegroups.com.
To post to this group, send email to mpir-devel@googlegroups.com.
Visit this group at http://groups.google.com/group/mpir-devel.
For more options, visit https://groups.google.com/groups/opt_out.

Reply via email to