Jack O'Connor <oconnor...@gmail.com> added the comment:

Version 0.1.3 of the official BLAKE3 repo includes some significant performance 
improvements:

- The x86 implementations include explicit prefetch instructions, which helps 
with long inputs. (commit b8c33e1)
- The C implementation now uses the same parallel parent hashing strategy that 
the Rust implementation uses. (commit 163f522)

When I repeat the benchmarks above with TurboBoost on, here's what I see now:

BLAKE3 Rust          2578 MB/s
BLAKE3 C (clang -O3) 2502 MB/s
BLAKE3 C (gcc -O2)   2223 MB/s
K12 C (gcc -O2)      2175 MB/s

Larry, if you have time to repeat your benchmarks with the latest C code, I'd 
be curious to see if you get similar results.

----------

_______________________________________
Python tracker <rep...@bugs.python.org>
<https://bugs.python.org/issue39298>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

Reply via email to