Hi, I ran Ohio micro benchmarks for openmpi and noticed broadcast with smaller number of bytes is faster than a barrier - 2us vs 120us.
I'm trying to understand how this could happen? Thank you Saliya
Hi, I ran Ohio micro benchmarks for openmpi and noticed broadcast with smaller number of bytes is faster than a barrier - 2us vs 120us.
I'm trying to understand how this could happen? Thank you Saliya