Rolf,
Whoowh! That's actually good news, since in our own tests hierarch is
always slower. But this might be due to various reasons, including the
fact, that we only have two cores per node. BTW: I actually would expect
IMB test to have worse performance for hierarch compared to many other
be
I am curious if anyone is doing any work currently on the hierarchical
collectives. I ask this because I just did some runs on a cluster made
up of 4 servers with 4 processors per server. I used TCP over IB. I
was running with np=16 and using the IMB benchmark to test MPI_Bcast.
What I am