Re: [RFC PATCH 11/20] crypto: BLAKE2s - x86_64 implementation

2019-09-30 Thread Sebastian Siewior
On 2019-09-30 04:42:06 [+0200], Jason A. Donenfeld wrote: > Hi Sebastian, Thomas, Hi Jason, > On Sun, Sep 29, 2019 at 7:39 PM Ard Biesheuvel > wrote: > > + for (;;) { > > + const size_t blocks = min_t(size_t, nblocks, > > + PAGE_SIZE /

Re: [RFC PATCH 11/20] crypto: BLAKE2s - x86_64 implementation

2019-09-29 Thread Linus Torvalds
On Sun, Sep 29, 2019 at 7:42 PM Jason A. Donenfeld wrote: > > I had previously put quite some effort into the simd_get, simd_put, > simd_relax mechanism, so that the simd state could be persisted during > both several calls to the same function and within long loops like > below, with simd_relax e

Re: [RFC PATCH 11/20] crypto: BLAKE2s - x86_64 implementation

2019-09-29 Thread Jason A. Donenfeld
Hi Sebastian, Thomas, Take a look at the below snippet from this patch. I had previously put quite some effort into the simd_get, simd_put, simd_relax mechanism, so that the simd state could be persisted during both several calls to the same function and within long loops like below, with simd_re

[RFC PATCH 11/20] crypto: BLAKE2s - x86_64 implementation

2019-09-29 Thread Ard Biesheuvel
From: "Jason A. Donenfeld" These implementations from Samuel Neves support AVX and AVX-512VL. Originally this used AVX-512F, but Skylake thermal throttling made AVX-512VL more attractive and possible to do with negligable difference. Signed-off-by: Jason A. Donenfeld Signed-off-by: Samuel Neves