Maamoun TK <maamoun...@googlemail.com> writes:

> On Mon, Nov 30, 2020 at 11:18 PM Maamoun TK <maamoun...@googlemail.com>
> wrote:
>
>> on POWER9 I get the following benchmark with ". /configure
>> --enable-power-altivec":
>>
>> chacha      encrypt  763.57
>> chacha      decrypt  780.64
>>
>> regards,
>> Mamone
>>
>
> I got this result using ppc-chacha-2core branch on same machine:
>
> chacha      encrypt  565.79
> chacha      decrypt  582.10

I've tried running the benchmark on gcc135, and that gives me much more
consistent values than gcc112. The 2-way code (currently on master
branch) gives 686 Mbyte/2. The 4-way code you tried gives 958 MByte/s. I
then replaced the innerloop with a versino with better interleaving,
written by Torbjörn Granlund (just pushed to the branch). That gives
1225 Mbyte/s.

And for reference, the plain C implementation gives 363 MByte/s.

Regards,
/Niels

-- 
Niels Möller. PGP-encrypted email is preferred. Keyid 368C6677.
Internet email is subject to wholesale government surveillance.
_______________________________________________
nettle-bugs mailing list
nettle-bugs@lists.lysator.liu.se
http://lists.lysator.liu.se/mailman/listinfo/nettle-bugs

Reply via email to