Re: [OMPI devel] [LOG_CAT_ML] component basesmuma is not available but requested in hierarchy: basesmuma, basesmuma, ucx_p2p:basesmsocket, basesmuma, p2p

2022-11-07 Thread Ben Menadue via devel
Hi, We see this on our cluster as well — we traced it to because Python loads shared library extensions using RTLD_LOCAL. The Python module (mpi4py?) has a dependency on libmpi.so, which in turn has a dependency on libhcoll.so. So the Python module is being loaded with RTLD_LOCAL, anything tha

Re: [OMPI devel] Fwd: --mca btl_base_verbose 30 not working in version 5.0

2022-11-07 Thread Jeff Squyres (jsquyres) via devel
Sorry; I missed that this email came in a week ago. 😕 The "btl_base_verbose" MCA param only works on the BTL components. The Linux "hostname(1)" command is not an MPI application, and therefore does not utilize any of the BTL components. Hence, you can set btl_base_verbose to whatever you wa

[OMPI devel] Open MPI v5.0.0 release timeline delay

2022-11-07 Thread Geoffrey Paulsen via devel
Open MPI developers, I’ve got some bad news from a OMPI v5.0.0 release timeframe. IBM has asked Austen and I (and our team) to focus 100% on another project for the next two full weeks. Open MPI v5.0.x still has a few remaining blocking items including documentation, PRRTE 3.0 release, some

Re: [OMPI devel] [LOG_CAT_ML] component basesmuma is not available but requested in hierarchy: basesmuma, basesmuma, ucx_p2p:basesmsocket, basesmuma, p2p

2022-11-07 Thread Tomislav Janjusic via devel
Ugh - runtime command is literally in the e-mail. Sorry about that. -- Tomislav Janjusic Staff Eng., Mellanox, HPC SW +1 (512) 598-0386 NVIDIA From: Tomislav Janjusic Sent: Monday, November 7, 2022 8:48 AM To: 'Open MPI Developers' ; Open MPI Users Cc: mrlong Subject:

Re: [OMPI devel] [LOG_CAT_ML] component basesmuma is not available but requested in hierarchy: basesmuma, basesmuma, ucx_p2p:basesmsocket, basesmuma, p2p

2022-11-07 Thread Tomislav Janjusic via devel
What is the runtime command? It’s coming from HCOLL. If HCOLL is not needed feel free to disable it -mca coll ^hcoll Tomislav Janjusic Staff Eng., Mellanox, HPC SW +1 (512) 598-0386 NVIDIA From: devel On Behalf Of mrlong via devel Sent: Monday, November 7, 2022 2:33 AM T

[OMPI devel] There are not enough slots available in the system to satisfy the 2, slots that were requested by the application

2022-11-07 Thread mrlong via devel
*Two machines, each with 64 cores. The contents of the hosts file are:* 192.168.180.48 slots=1 192.168.60.203 slots=1 *Why do you get the following error when running with openmpi 5.0.0rc9?* (py3.9) [user@machine01 share]$  mpirun -n 2 --machinefile hosts hostname --

[OMPI devel] [LOG_CAT_ML] component basesmuma is not available but requested in hierarchy: basesmuma, basesmuma, ucx_p2p:basesmsocket, basesmuma, p2p

2022-11-07 Thread mrlong via devel
The execution of openmpi 5.0.0rc9 results in the following: (py3.9) [user@machine01 share]$  mpirun -n 2 python test.py [LOG_CAT_ML] component basesmuma is not available but requested in hierarchy: basesmuma,basesmuma,ucx_p2p:basesmsocket,basesmuma,p2p [LOG_CAT_ML] ml_discover_hierarchy exited