Some benchmarks on core2-unknown-linux-gnu  (sage.math)Intel(R) Xeon(R) CPU     
      
X7460  @ 2.66GHz

8307 on trunk r1623 , should be same score as mpir-0.9.0
10252 on core-2 branch r1623 

a 23.4% speedup .

I keep getting 
make[5]: warning:  Clock skew detected.  Your build may be incomplete.
so hopefully the above timings are correct

I found the core-2 mul/sqr_basecase code , but there was only windows 
versions , whatever happened to the linux stuff.

> On Friday 20 February 2009 14:12:19 ja...@njkfrudils.plus.com wrote:
>
> What happened to core-2 mul_basecase and sqr_basecase ? , no-wonder core-2
> benchmarks are crap
>
> > Running the k8/k10 asm code with no changes on the core2 machine sage we
> > get this
> >
> > popcount,hamdist   no popcount instruction
> >
> > slowdowns---------
> > add,sub are 0.50x
> > rshift1,lshift1 0.70x
> > k8 lshift,rshift 0.91x
> > addmul_1,submul_1  0.89x   but faster for <20 limbs...
> >
> > speedups----------
> > and,ior,xor are 1.13x
> > nand,nior,xnor,andn,iorn are 1.50x
> > com is 2.00x
> > divebyff 1.40x  although not better until 12limbs
> > diveby3  2.30x
> > addadd,addsub 1.50x
> > sumdiff 1.26x
> > addlsh1 1.50x
> > sublsh1 1.40x
> > k10 lshift,rshift 1.18x
> > mul_1   1.04x
> >
> > for mul basecase we get
> > ./speed -c -r -s 1-40 mpn_jaytest mpn_mul_basecase
> > overhead 6.12 cycles, precision 10000 units of 3.75e-10 secs, CPU freq
> > 2666.76 MHz
> >           mpn_jaytest mpn_mul_basecase
> > 1               #9.21        2.3531
> > 2              #21.43        2.0070
> > 3              #56.00        1.5234
> > 4              #91.36        1.4960
> > 5             #136.17        1.4320
> > 6             #195.54        1.3769
> > 7             #261.22        1.3718
> > 8             #336.56        1.3482
> > 9             #419.62        1.3441
> > 10            #527.14        1.3054
> > 11            #634.44        1.3105
> > 12            #744.00        1.3172
> > 13            #873.85        1.2931
> > 14           #1024.55        1.1088
> > 15           #1169.00        1.0873
> > 16           #1328.89        1.0704
> > 17           #1492.50        1.0672
> > 18           #1710.00        1.0317
> > 19           #1880.00        1.0488
> > 20           #2112.00        1.0246
> > 21           #2288.00        1.0385
> > 22           #2547.50        1.0128
> > 23           #2787.50        1.0063
> > 24           #3012.50        1.0108
> > 25           #3212.50        1.0241
> > 26            3556.67       #0.9953
> > 27            3836.67       #0.9939
> > 28            4106.67       #0.9935
> > 29            4370.00       #0.9908
> > 30            4700.00       #0.9617
> > 31            4996.67       #0.9973
> > 32            5380.00       #0.9944
> > 33            5685.00       #0.9727
> > 34            6105.00       #0.9853
> > 35            6375.00       #0.9906
> > 36            6775.00       #0.9764
> > 37            7540.00       #0.9151
> > 38            7515.00       #0.9714
> > 39           #7955.00        1.0578
> > 40           #8750.00        1.0086
> >
> > This is all with no tweeking on
> > cpu family      : 6
> > model           : 29
> > model name      : Intel(R) Xeon(R) CPU           X7460  @ 2.66GHz
> > stepping        : 1
>
> 


--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups 
"mpir-devel" group.
To post to this group, send email to mpir-devel@googlegroups.com
To unsubscribe from this group, send email to 
mpir-devel+unsubscr...@googlegroups.com
For more options, visit this group at 
http://groups.google.com/group/mpir-devel?hl=en
-~----------~----~----~----~------~----~------~--~---

Reply via email to