Re: [OMPI users] UFLM only works on a single node???

2024-03-24 Thread Dean Anderson via users
Also using IB.I tried with options—with-ft ulfm —mca pml ob1 —mca btl_tcp_if_include ib0 —mca btl tcp,sm,selfAnd still hangs.Build was just 5.0.2 (ulfm already on by default):—prefix=xx —with-ucx=pathtoucx1.15 —with-slurmSent from my iPhoneOn Mar 24, 2024, at 2:00 PM, George Bosilca  wrote:All the examples work for me on using ULFM ge87f595 compiled with minimalistic options:'--prefix=XXX --enable-picky --enable-debug --disable-heterogeneous --enable-contrib-no-build=vt --enable-mpirun-prefix-by-default --enable-mpi-ext=ftmpi --with-ft=mpi --with-pmi'.I run using ipoib, so I select the sm,self, tcp BTL and the OB1 PML.  George.On Sat, Mar 23, 2024 at 6:33 PM Dean Anderson via users  wrote:If someone could take a look at https://github.com/open-mpi/ompi/issues/11404
and provide some guidance or a work around, I’d appreciate it.

The SC-22 Tutorials work just fine, but only on a single node.  If you arrange multiple nodes, it hangs in MPI_Finalize.

I attended the SC22 Tutorial and it was not my impression that UFLM only worked if your tasks were all on a single node.



Thanks!!


Sent from my iPhone



Re: [OMPI users] UFLM only works on a single node???

2024-03-24 Thread George Bosilca via users
All the examples work for me on using ULFM ge87f595 compiled with
minimalistic options:
'--prefix=XXX --enable-picky --enable-debug --disable-heterogeneous
--enable-contrib-no-build=vt --enable-mpirun-prefix-by-default
--enable-mpi-ext=ftmpi --with-ft=mpi --with-pmi'.

I run using ipoib, so I select the sm,self, tcp BTL and the OB1 PML.

  George.


On Sat, Mar 23, 2024 at 6:33 PM Dean Anderson via users <
users@lists.open-mpi.org> wrote:

> If someone could take a look at
> https://github.com/open-mpi/ompi/issues/11404
> and provide some guidance or a work around, I’d appreciate it.
>
> The SC-22 Tutorials work just fine, but only on a single node.  If you
> arrange multiple nodes, it hangs in MPI_Finalize.
>
> I attended the SC22 Tutorial and it was not my impression that UFLM only
> worked if your tasks were all on a single node.
>
>
>
> Thanks!!
>
>
> Sent from my iPhone
>