Confirmed that trunk version r32658 does pass the test. From: devel [mailto:devel-boun...@open-mpi.org] On Behalf Of Pritchard Jr., Howard Sent: Monday, September 15, 2014 4:16 PM To: Open MPI Developers Subject: Re: [OMPI devel] coll ml error with some nonblocking collectives
Hi Rolf, This may be related to change set 32659. If you back this change out, do the tests pass? Howard From: devel [mailto:devel-boun...@open-mpi.org] On Behalf Of Rolf vandeVaart Sent: Monday, September 15, 2014 8:55 AM To: de...@open-mpi.org<mailto:de...@open-mpi.org> Subject: [OMPI devel] coll ml error with some nonblocking collectives I wonder if anyone else is seeing this failure. Not sure when this started but it is only on the trunk. Here is a link to my failures as well as an example below that. There are a variety of nonblocking collectives failing like this. http://mtt.open-mpi.org/index.php?do_redir=2208 [rvandevaart@drossetti-ivy0 collective]$ mpirun --mca btl self,sm,tcp -host drossetti-ivy0,drossetti-ivy0,drossetti-ivy1,drossetti-ivy1 iallreduce -------------------------------------------------------------------------- ML detected an unrecoverable error on intrinsic communicator MPI_COMM_WORLD The program will now abort -------------------------------------------------------------------------- [drossetti-ivy0.nvidia.com:04664] 3 more processes have sent help message help-mpi-coll-ml.txt / coll-ml-check-fatal-error [rvandevaart@drossetti-ivy0 collective]$ ________________________________ This email message is for the sole use of the intended recipient(s) and may contain confidential information. Any unauthorized review, use, disclosure or distribution is prohibited. If you are not the intended recipient, please contact the sender by reply email and destroy all copies of the original message. ________________________________