In my opinion, this is yet another reason to use a proper cryptography library (openssl, gnutls or gcrypt) instead of a custom implementation for this kind of algorithm.
Over time, when these libraries add support for cryptography acceleration instructions for more architectures, all programs will benefit from it. I would expect that many rich ARM SoCs for phones, laptops and servers already have something and that openssl supports it already. What device did you run your benchmark on? Regards Stephan
signature.asc
Description: This is a digitally signed message part