Here is more information with higher verbosity: > mpirun -np 2 --mca pml ucx --mca osc ucx --bind-to core --map-by core > --rank-by core --mca pml_ucx_verbose 100 --mca osx_ucxv_erbose 100 --mca > bml_base_verbose 100 mpi_executable
[tin2:1137672] mca: base: components_register: registering framework bml components [tin2:1137672] mca: base: components_register: found loaded component r2 [tin2:1137672] mca: base: components_register: component r2 register function successful [tin2:1137672] mca: base: components_open: opening bml components [tin2:1137672] mca: base: components_open: found loaded component r2 [tin2:1137672] mca: base: components_open: component r2 open function successful [tin2:1137671] mca: base: components_register: registering framework bml components [tin2:1137671] mca: base: components_register: found loaded component r2 [tin2:1137671] mca: base: components_register: component r2 register function successful [tin2:1137671] mca: base: components_open: opening bml components [tin2:1137671] mca: base: components_open: found loaded component r2 [tin2:1137671] mca: base: components_open: component r2 open function successful -------------------------------------------------------------------------- WARNING: There is at least non-excluded one OpenFabrics device found, but there are no active ports detected (or Open MPI was unable to use them). This is most certainly not what you wanted. Check your cables, subnet manager configuration, etc. The openib BTL will be ignored for this job. Local host: tin2 -------------------------------------------------------------------------- [tin2:1137672] common_ucx.c:174 using OPAL memory hooks as external events [tin2:1137672] pml_ucx.c:198 mca_pml_ucx_open: UCX version 1.11.2 [tin2:1137671] common_ucx.c:174 using OPAL memory hooks as external events [tin2:1137671] pml_ucx.c:198 mca_pml_ucx_open: UCX version 1.11.2 [tin2:1137671] common_ucx.c:333 posix/memory: did not match transport list [tin2:1137671] common_ucx.c:333 sysv/memory: did not match transport list [tin2:1137671] common_ucx.c:333 self/memory0: did not match transport list [tin2:1137671] common_ucx.c:333 tcp/lo: did not match transport list [tin2:1137671] common_ucx.c:333 tcp/eth0: did not match transport list [tin2:1137671] common_ucx.c:337 support level is none -------------------------------------------------------------------------- No components were able to be opened in the pml framework. This typically means that either no components of this type were installed, or none of the installed components can be loaded. Sometimes this means that shared libraries required by these components are unable to be found/loaded. Host: tin2 Framework: pml -------------------------------------------------------------------------- [tin2:1137671] PML ucx cannot be selected