On Thu, 10 Mar 2016, BRADLEY, PETER C PW wrote:
This is an academic exercise, obviously. The curve shown comes from one pair
of ranks running on the same node alternating between MPI_Send and
MPI_Recv. The most likely suspect is a cache effect, but rather than assuming,
I was curious if
You're trying to read absurd huge message sizes considering you're busy
testing the memory bandwidth of your system in this manner.
As soon as the message gets larger than your CPU's caching
system it has to copy the message several times via your RAM, falls
outside CPU's L2 or L3 cache and
You might want to post in beowulf mailing list see cc
and you want to install linux of course.
OpenFabrics releases openmpi, yet it only works at a limited number
of distributions - most important is having
the correct kernel (usually old kernel).
I'm gonna try get it to work at debian soon.