I wonder if anyone else is seeing this failure. Not sure when this started but it is only on the trunk. Here is a link to my failures as well as an example below that. There are a variety of nonblocking collectives failing like this.
http://mtt.open-mpi.org/index.php?do_redir=2208 [rvandevaart@drossetti-ivy0 collective]$ mpirun --mca btl self,sm,tcp -host drossetti-ivy0,drossetti-ivy0,drossetti-ivy1,drossetti-ivy1 iallreduce -------------------------------------------------------------------------- ML detected an unrecoverable error on intrinsic communicator MPI_COMM_WORLD The program will now abort -------------------------------------------------------------------------- [drossetti-ivy0.nvidia.com:04664] 3 more processes have sent help message help-mpi-coll-ml.txt / coll-ml-check-fatal-error [rvandevaart@drossetti-ivy0 collective]$ ----------------------------------------------------------------------------------- This email message is for the sole use of the intended recipient(s) and may contain confidential information. Any unauthorized review, use, disclosure or distribution is prohibited. If you are not the intended recipient, please contact the sender by reply email and destroy all copies of the original message. -----------------------------------------------------------------------------------