I did some quick benchmarks using some older version of Brian's
benchmark code (we have a new version coming with more tests). The
penryn machine was heavily loaded, so I don't trust the values very
much, especially the pi ones, though I did all the benchmarks under
identical conditions. The K10 ones should be more accurate:

penryn:

Squaring:         GMP 4.3.0 MPIR 1.2.0  MPIR 1.3.0
==========         ========  =========  ==========

128 x 128 :        50993028   51293922    51666716
512 x 512 :         9275740    7974568     8009900
8192 x 8192 :         98217      99896      100503
131072 x 131072 :      1537       1702        1724
2097152 x 2097152 :    68.2       79.5        80.6

Multiplication:
==========
128 x 128 :        50993028   51666716    51081525
512 x 512 :         7334238    7970325     8001954
8192 x 8192 :         70880      74547       74547
131072 x 131072 :      1159       1286        1294
2097152 x 2097152 :    47.3       55.9        57.3

Unbalanced:
==========
15000 x 10000 :       34617      36381       35832
20000 x 10000 :       26373      28333       27913
30000 x 10000 :       15672      16786       16804
16777216 x 512 :        223        232         233
16777216 x 262144 :    8.96       10.7        10.8

Division :
=========
8192 / 32 :          800782     672164      928075
8192 / 64 :          795968     683094      680328
8192 / 128 :         526394     365184      367509
8192 / 4096 :        119806     115299      114019
8192 / 8064 :       1646375    1618972     6714754
131072 / 65536 :       1373       1372        1393
8388608 / 4194304 :    4.03       4.40        4.47
16777216 / 262144 :    2.69       2.79        2.81

GCD :
====
128 x 128 :         2229116    2131425     2191565
512 x 512 :          230145     156351      214626
8192 x 8192 :          5852       5453        5648
131072 x 131072 :      88.8       89.0        91.8
1048576 x 1048576 :    4.17       4.34        4.59

XGCD :
=====
128 x 128 :          991393     694605      909401
512 x 512 :          179531     110886      148946
8192 x 8192 :          3700       2438        3428
131072 x 131072 :      52.1       51.4        54.5
1048576 x 1048576 :    2.79       2.66        2.90

RSA :
====
512 :                 16190      15020       15019
1024 :                 3021       3128        3112
2048 :                  477        475         478

Pi :
===
10000 :                 328        332         335
100000 :               16.7       18.0        18.8
1000000 :              1.01       1.13        1.16

Overall :              1489       1460        1565

======================================

K10:

Squaring:         GMP 4.3.0 MPIR 1.2.0  MPIR 1.3.0
==========         ========  =========  ==========
128 x 128 :        52219920   55564735    55553695
512 x 512 :        12321692   12420943    12148952
8192 x 8192 :        159844     161622      162057
131072 x 131072 :      2483       2689        2714
2097152 x 2097152 :    88.8       99.8         101

Multiplication:
==========
128 x 128 :        52219920   55605250    55505909
512 x 512 :        10656260   12418369    12251151
8192 x 8192 :        108145     112543      111985
131072 x 131072 :      1712       2022        2012
2097152 x 2097152 :    58.2       68.0        68.3

Unbalanced:
==========
15000 x 10000 :       53859      56976       54998
20000 x 10000 :       41146      44883       43507
30000 x 10000 :       23808      25990       25729
16777216 x 512 :        418        363         358
16777216 x 262144 :    10.9       11.4        11.4

Division :
=========
8192 / 32 :         1417991    1245813     1537953
8192 / 64 :         1432755    1247058     1248305
8192 / 128 :         871200     674187      678989
8192 / 4096 :        170418     178373      194700
8192 / 8064 :       2284081    2109811     7527466
131072 / 65536 :       2056       2138        2503
8388608 / 4194304 :    5.31       6.16        3.42
16777216 / 262144 :    3.93       4.33        4.41

GCD :
====
128 x 128 :         2060071    1889462     1878091
512 x 512 :          235945     181500      195688
8192 x 8192 :          7351       6591        6551
131072 x 131072 :       129        128         129
1048576 x 1048576 :    5.72       6.25        6.51

XGCD :
=====
128 x 128 :          918836     396791      661980
512 x 512 :          176947      76129      135265
8192 x 8192 :          5030       3021        4395
131072 x 131072 :      79.6       77.0        81.6
1048576 x 1048576 :    3.69       3.90        4.26

RSA :
====
512 :                 19198      21685       21623
1024 :                 4048       4750        4768
2048 :                  752        823         822

Pi :
===
10000 :                 635        656         664
100000 :               27.7       31.6        32.7
1000000 :              1.43       1.71        1.75

Overall :              1420       1449        1539

I guess the first order of business will be to work out why the large
speed regression in the 8388608 / 4194304 division on K10 (but not
core2). Everything else is looking good though.

We need to retune for the release, so these figures are preliminary.

Bill.

--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups 
"mpir-devel" group.
To post to this group, send email to mpir-devel@googlegroups.com
To unsubscribe from this group, send email to 
mpir-devel+unsubscr...@googlegroups.com
For more options, visit this group at 
http://groups.google.com/group/mpir-devel?hl=en
-~----------~----~----~----~------~----~------~--~---

Reply via email to