Some benchmarks on core2-unknown-linux-gnu (sage.math)Intel(R) Xeon(R) CPU X7460 @ 2.66GHz
8307 on trunk r1623 , should be same score as mpir-0.9.0 10252 on core-2 branch r1623 a 23.4% speedup . I keep getting make[5]: warning: Clock skew detected. Your build may be incomplete. so hopefully the above timings are correct I found the core-2 mul/sqr_basecase code , but there was only windows versions , whatever happened to the linux stuff. > On Friday 20 February 2009 14:12:19 ja...@njkfrudils.plus.com wrote: > > What happened to core-2 mul_basecase and sqr_basecase ? , no-wonder core-2 > benchmarks are crap > > > Running the k8/k10 asm code with no changes on the core2 machine sage we > > get this > > > > popcount,hamdist no popcount instruction > > > > slowdowns--------- > > add,sub are 0.50x > > rshift1,lshift1 0.70x > > k8 lshift,rshift 0.91x > > addmul_1,submul_1 0.89x but faster for <20 limbs... > > > > speedups---------- > > and,ior,xor are 1.13x > > nand,nior,xnor,andn,iorn are 1.50x > > com is 2.00x > > divebyff 1.40x although not better until 12limbs > > diveby3 2.30x > > addadd,addsub 1.50x > > sumdiff 1.26x > > addlsh1 1.50x > > sublsh1 1.40x > > k10 lshift,rshift 1.18x > > mul_1 1.04x > > > > for mul basecase we get > > ./speed -c -r -s 1-40 mpn_jaytest mpn_mul_basecase > > overhead 6.12 cycles, precision 10000 units of 3.75e-10 secs, CPU freq > > 2666.76 MHz > > mpn_jaytest mpn_mul_basecase > > 1 #9.21 2.3531 > > 2 #21.43 2.0070 > > 3 #56.00 1.5234 > > 4 #91.36 1.4960 > > 5 #136.17 1.4320 > > 6 #195.54 1.3769 > > 7 #261.22 1.3718 > > 8 #336.56 1.3482 > > 9 #419.62 1.3441 > > 10 #527.14 1.3054 > > 11 #634.44 1.3105 > > 12 #744.00 1.3172 > > 13 #873.85 1.2931 > > 14 #1024.55 1.1088 > > 15 #1169.00 1.0873 > > 16 #1328.89 1.0704 > > 17 #1492.50 1.0672 > > 18 #1710.00 1.0317 > > 19 #1880.00 1.0488 > > 20 #2112.00 1.0246 > > 21 #2288.00 1.0385 > > 22 #2547.50 1.0128 > > 23 #2787.50 1.0063 > > 24 #3012.50 1.0108 > > 25 #3212.50 1.0241 > > 26 3556.67 #0.9953 > > 27 3836.67 #0.9939 > > 28 4106.67 #0.9935 > > 29 4370.00 #0.9908 > > 30 4700.00 #0.9617 > > 31 4996.67 #0.9973 > > 32 5380.00 #0.9944 > > 33 5685.00 #0.9727 > > 34 6105.00 #0.9853 > > 35 6375.00 #0.9906 > > 36 6775.00 #0.9764 > > 37 7540.00 #0.9151 > > 38 7515.00 #0.9714 > > 39 #7955.00 1.0578 > > 40 #8750.00 1.0086 > > > > This is all with no tweeking on > > cpu family : 6 > > model : 29 > > model name : Intel(R) Xeon(R) CPU X7460 @ 2.66GHz > > stepping : 1 > > --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "mpir-devel" group. To post to this group, send email to mpir-devel@googlegroups.com To unsubscribe from this group, send email to mpir-devel+unsubscr...@googlegroups.com For more options, visit this group at http://groups.google.com/group/mpir-devel?hl=en -~----------~----~----~----~------~----~------~--~---