Also using IB.

I tried with options
—with-ft ulfm —mca pml ob1 —mca btl_tcp_if_include ib0 —mca btl tcp,sm,self

And still hangs.

Build was just 5.0.2 (ulfm already on by default):

—prefix=xx —with-ucx=pathtoucx1.15 —with-slurm




Sent from my iPhone

On Mar 24, 2024, at 2:00 PM, George Bosilca <bosi...@icl.utk.edu> wrote:


All the examples work for me on using ULFM ge87f595 compiled with minimalistic options:
'--prefix=XXX --enable-picky --enable-debug --disable-heterogeneous --enable-contrib-no-build=vt --enable-mpirun-prefix-by-default --enable-mpi-ext=ftmpi --with-ft=mpi --with-pmi'.

I run using ipoib, so I select the sm,self, tcp BTL and the OB1 PML.

  George.


On Sat, Mar 23, 2024 at 6:33 PM Dean Anderson via users <users@lists.open-mpi.org> wrote:
If someone could take a look at https://github.com/open-mpi/ompi/issues/11404
and provide some guidance or a work around, I’d appreciate it.

The SC-22 Tutorials work just fine, but only on a single node.  If you arrange multiple nodes, it hangs in MPI_Finalize.

I attended the SC22 Tutorial and it was not my impression that UFLM only worked if your tasks were all on a single node.



Thanks!!


Sent from my iPhone

Reply via email to