Hi Timur,
I don't think this is apples-to-apples comparison.
In OpenSHMEM world "MPI_waitall" would be mapped to shmem_quiet().
Even with this mapping, shmem_quiet() has a *stronger* completion semantics if
you compare it to MPI_waitall.
Quiet guarantees that the data was delivered to a remot
Hello!
What param can i tune to increase perfomance(scalability) for my app (all to
all pattern with message size = constant/nnodes)?
I can read this faq for mpi, but is it correct for shmem?
I have 2 programm doing the same thing(with same input) each node send
messages(message size = constan