I see assorted problems with OMPI 4.1 on IB, including failing many of
the mpich tests (non-mpich-specific ones) particularly with RMA.  Now I
wonder if UCX build options could have anything to do with it, but I
haven't found any relevant information.

What configure options would be recommended with CUDA and ConnectX-5 IB?
(This is on POWER, but I presume that's irrelevant.)  I assume they
should be at least

--enable-cma --enable-mt --with-cuda --with-gdrcopy --with-verbs --with-mlx5-dv

but for a start I don't know what the relationship is between the cuda,
shared memory, and multi-threading options in OMPI and UCX.

Thanks for any enlightenment.

Reply via email to