Also using IB.I tried with options—with-ft ulfm —mca pml ob1 —mca btl_tcp_if_include ib0 —mca btl tcp,sm,selfAnd still hangs.Build was just 5.0.2 (ulfm already on by default):—prefix=xx —with-ucx=pathtoucx1.15 —with-slurmSent from my iPhoneOn Mar 24, 2024, at 2:00 PM, George Bosilca wrote:All the examples work for me on using ULFM ge87f595 compiled with minimalistic options:'--prefix=XXX --enable-picky --enable-debug --disable-heterogeneous --enable-contrib-no-build=vt --enable-mpirun-prefix-by-default --enable-mpi-ext=ftmpi --with-ft=mpi --with-pmi'.I run using ipoib, so I select the sm,self, tcp BTL and the OB1 PML. George.On Sat, Mar 23, 2024 at 6:33 PM Dean Anderson via users wrote:If someone could take a look at https://github.com/open-mpi/ompi/issues/11404
and provide some guidance or a work around, I’d appreciate it.
The SC-22 Tutorials work just fine, but only on a single node. If you arrange multiple nodes, it hangs in MPI_Finalize.
I attended the SC22 Tutorial and it was not my impression that UFLM only worked if your tasks were all on a single node.
Thanks!!
Sent from my iPhone