I updated the GHASH implementation for x86_64 architecture to use 2-way
block processing, it seems to me from Agner Fog's instruction tables that
this change makes sense starting from Broadwell arch for intel processors
while it has zero improvement on zen3/zen4 architectures. You can find the
I apologize for the delays. I pushed a patch that implements 4-way block
processing of poly1305 using AVX2 instructions based on radix 26.
https://git.lysator.liu.se/nettle/nettle/-/merge_requests/58
regards.
Mamone
On Sun, Nov 6, 2022 at 8:08 AM Maamoun TK wrote:
> On Fri, Nov 4, 2022 at