Hi Gus,

Yeap, I can see softlink is missing on the compute nodes.

Thanks!
Luis

On 19/07/2023 17:42, Gus Correa via users wrote:
If it is installed, libunuma should be in:
/usr/lib64/libnuma.so
as a softlink to the actual number-versioned  library.
In general the loader is configured to search for shared libraries
in /usr/lib64 ("ldd <your_executable>" may shed some light here).

You can check if the numa packages are installed with:
yum list | grep numa (CentOS 7, RHEL 7)
dnf list | grep numa (CentOS 8, RHEL 8, RockyLinux 8, Fedora, etc)
apt list | grep numa (Debian, Ubuntu)

If not, you can install (or ask the system administrator to do it).

I hope this helps,
Gus Correa


On Wed, Jul 19, 2023 at 11:55 AM Jeff Squyres (jsquyres) via users <users@lists.open-mpi.org> wrote:

    It's not clear if that message is being emitted by Open MPI.

    It does say it's falling back to a different behavior if
    libnuma.so is not found, so it appears if it's treating it as a
    warning, not an error.
    ------------------------------------------------------------------------
    *From:* users <users-boun...@lists.open-mpi.org> on behalf of Luis
    Cebamanos via users <users@lists.open-mpi.org>
    *Sent:* Wednesday, July 19, 2023 10:09 AM
    *To:* users@lists.open-mpi.org <users@lists.open-mpi.org>
    *Cc:* Luis Cebamanos <luic...@gmail.com>
    *Subject:* [OMPI users] libnuma.so error
    Hello,

    I was wondering if anyone has ever seen the following runtime error:

    mpirun -np 32 ./hello
    .....
    [LOG_CAT_SBGP] libnuma.so: cannot open shared object file: No such
    file
    or directory
    [LOG_CAT_SBGP] Failed to dlopen libnuma.so. Fallback to
    GROUP_BY_SOCKET
    manual.
    .....

    The funny thing is that the binary is executed despite the errors.
    What could be causing it?

    Regards,
    Lusi

Reply via email to