Re: [PATCH 6/7] sparc: Add MD5 assembler for sparcv9.

Andy Polyakov Sun, 23 Sep 2012 13:54:15 -0700

The techniques used in this plain v9 implementation are:

1) Use little-endian 32-bit loads when input data is aligned.


2) Avoid having to accumulate into the context hash values every
   loop iteration.

3) In the aligned case try to seperate the loads from the first
   use by as many instructions as possible, without sacrificing
   the schedule too much.

4) Attempt to dual-issue as much as possible on UltraSPARC-I/II/III/IV
   and SPARC-T4.

I had an old module lying around, dusted it off inhttp://cvs.openssl.org/chngview?cn=22842. It's 20% faster than yourversion on US pre-Tx. Improvement coefficient is likely to be evenhigher on T1, because it keeps everything in register bank and there areno loads except for input. Not really relevant, but it's nominallyfaster even on T4.

______________________________________________________________________
OpenSSL Project                                 http://www.openssl.org
Development Mailing List                       [email protected]
Automated List Manager                           [email protected]

Re: [PATCH 6/7] sparc: Add MD5 assembler for sparcv9.

Reply via email to