Hi,

Also, add a missing include of opensslconf.h so that we properly
get the OPENSSL_SYSNAME_ULTRASPARC define even in the 32-bit case.

These changes give a pretty reasonable speed boost.

On a SPARC T4-2, without these changes:

type             16 bytes     64 bytes    256 bytes   1024 bytes   8192 bytes
des cbc          39148.28k    40071.83k    40669.53k    40685.80k    40864.43k
des ede3         14358.54k    14528.02k    14611.88k    14632.96k    14641.83k

and with them:

type             16 bytes     64 bytes    256 bytes   1024 bytes   8192 bytes
des cbc          54334.03k    56280.92k    57101.01k    57549.14k    57619.80k
des ede3         20010.98k    20682.62k    20834.90k    20873.56k    20886.87k

I can't confirm your "before" numbers. I get

type       16 bytes     64 bytes    256 bytes   1024 bytes   8192 bytes
des cbc    51501.71k    53611.82k    54637.06k    54721.74k    54976.51k
des ede3   19572.77k    20307.14k    20472.83k    20513.45k    20526.42k

Which kind of makes sense, because the difference between "before" and "after" is difference between 8 byte loads + 6 shifts-n-ors and pair of little-endian 32-bit loads, and there is no way former would be that much slower. Therefore I reckon that http://git.openssl.org/gitweb/?p=openssl.git;a=commitdiff;h=d8f3ed23062539c4a6f7b4153633f921ce16a15f is sufficient.
______________________________________________________________________
OpenSSL Project                                 http://www.openssl.org
Development Mailing List                       openssl-dev@openssl.org
Automated List Manager                           majord...@openssl.org

Reply via email to