On 23/02/2014 20:09, Bill Hart wrote: > After a lot of work I have managed to remove the performance problem > with the new division code on penryn. > > Two important facts about core2/penryn are that it is always better to > save muls and always better to save memory read/writes where possible, > since both take a long time on that architecture. > > So I now have a version of the code which performs well on Intel and > AMD. Unfortunately the difference in the basecase range is much less > pronounced on AMD, being only up to about 20% faster, with an average of > more like 10%. However, the performance in the divide-and-conquer range > has improved by 3-4% and we now beat GMP by 25% at certain points. > > I still need to tune a couple of crossovers, but the new division code > shouldn't much in the way of changes now.
Is this improvement likely to work on Nehalem/Sandy Bridge/Ivy Bridge? Brian -- You received this message because you are subscribed to the Google Groups "mpir-devel" group. To unsubscribe from this group and stop receiving emails from it, send an email to mpir-devel+unsubscr...@googlegroups.com. To post to this group, send email to mpir-devel@googlegroups.com. Visit this group at http://groups.google.com/group/mpir-devel. For more options, visit https://groups.google.com/groups/opt_out.