Re: [Arm64, PowerPC64, S390x] Optimize Poly1305

2022-01-19 Thread Maamoun TK
On Wed, Jan 19, 2022 at 10:06 PM Niels Möller wrote: > Maamoun TK writes: > > > The patches have 41.88% speedup for arm64, 142.95% speedup for powerpc64, > > and 382.65% speedup for s390x. > > > > OpenSSL is still ahead in terms of performance speed since it uses 4-way > > interleaving or maybe

Re: [Arm64, PowerPC64, S390x] Optimize Poly1305

2022-01-19 Thread Niels Möller
Maamoun TK writes: > The patches have 41.88% speedup for arm64, 142.95% speedup for powerpc64, > and 382.65% speedup for s390x. > > OpenSSL is still ahead in terms of performance speed since it uses 4-way > interleaving or maybe more!! > Increasing the interleaving ways more than two has nothing

Re: [Arm64, S390x] Optimize Chacha20

2022-01-19 Thread Maamoun TK
On Wed, Jan 19, 2022 at 8:48 PM Niels Möller wrote: > Maamoun TK writes: > > > I created merge requests that have improvements of Chacha20 for arm64 and > > s390x architectures by following the approach used in powerpc > > implementation. > >

Re: [Arm64, S390x] Optimize Chacha20

2022-01-19 Thread Niels Möller
Maamoun TK writes: > I created merge requests that have improvements of Chacha20 for arm64 and > s390x architectures by following the approach used in powerpc > implementation. > https://git.lysator.liu.se/nettle/nettle/-/merge_requests/37 >

[S390x] Optimize various elliptic curve functions

2022-01-19 Thread Maamoun TK
I created merge requests that have improvements of prime modulo functions of elliptic curves and scalar multiply of curve25519 and curve448 for s390x architecture https://git.lysator.liu.se/nettle/nettle/-/merge_requests/42 https://git.lysator.liu.se/nettle/nettle/-/merge_requests/43 The prime