Hi,
Try to increase the IB time out parameter: --mca btl_mvapi_ib_timeout 14
If the 14 will not work , try to increase little bit more (16)
Thanks,
Pasha
Neil Ludban wrote:
Hi,
I'm getting the errors below when calling MPI_Alltoallv() as part of
a matrix transpose operation. It's 100% repeata
Hi,
I'm getting the errors below when calling MPI_Alltoallv() as part of
a matrix transpose operation. It's 100% repeatable when testing with
16M matrix elements divided between 64 processes on 32 dual core nodes.
There are never any errors with fewer processes or elements, including
the same 32