Your message dated Thu, 02 Sep 2021 13:50:02 +0000
with message-id <[email protected]>
and subject line Bug#979041: fixed in openmpi 4.1.1-4
has caused the Debian Bug report #979041,
regarding libopempi3: aborts python code due to libfabric fork() issues
to be marked as done.

This means that you claim that the problem has been dealt with.
If this is not the case it is now your responsibility to reopen the
Bug report if necessary, and/or fix the problem forthwith.

(NB: If you are a system administrator and have no idea what this
message is talking about, this may indicate a serious mail system
misconfiguration somewhere. Please contact [email protected]
immediately.)


-- 
979041: https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=979041
Debian Bug Tracking System
Contact [email protected] with problems
--- Begin Message ---
Package: libopenmpi3
Version: 4.1.0-3
Severity: serious

The revert of pmix has fixed some issues, but python packages still show
autopkgtest regressions in dolfin[1], gpaw[2], gyoto[3] and mshr[4] .
The error is always like this:

|A process has executed an operation involving a call
|to the fork() system call to create a child process.
|
|As a result, the libfabric EFA provider is operating in
|a condition that could result in memory corruption or
|other system errors.
|
|For the libfabric EFA provider to work safely when fork()
|is called, you will need to set the following environment
|variable:
|          RDMAV_FORK_SAFE
|
|However, setting this environment variable can result in
|signficant performance impact to your application due to
|increased cost of memory registration.
|
|You may want to check with your application vendor to see
|if an application-level alternative (of not using fork)
|exists.
|
|Your job will now abort.

If I export RDMAV_FORK_SAFE=1, the tests run fine, but (i) it seems
something in OpenMPI has changed so that those programs no longer run
and (ii) the warnings about performance issues are to be considered.

Also note that it seems those errors only happen on amd64/i386, the ARM
ports run fine, maybe because of missing libfabric-related
features/packages?


Michael

[1] https://ci.debian.net/data/autopkgtest/testing/amd64/d/dolfin/9184050/log.gz
[2] https://ci.debian.net/data/autopkgtest/testing/amd64/g/gpaw/9302177/log.gz
[3] https://ci.debian.net/data/autopkgtest/testing/amd64/g/gyoto/9303088/log.gz
[4] https://ci.debian.net/data/autopkgtest/testing/amd64/m/mshr/9300183/log.gz

--- End Message ---
--- Begin Message ---
Source: openmpi
Source-Version: 4.1.1-4
Done: Alastair McKinstry <[email protected]>

We believe that the bug you reported is fixed in the latest version of
openmpi, which is due to be installed in the Debian FTP archive.

A summary of the changes between this version and the previous one is
attached.

Thank you for reporting the bug, which will now be closed.  If you
have further comments please address them to [email protected],
and the maintainer will reopen the bug report if appropriate.

Debian distribution maintenance software
pp.
Alastair McKinstry <[email protected]> (supplier of updated openmpi package)

(This message was generated automatically at their request; if you
believe that there is a problem with it please contact the archive
administrators by mailing [email protected])


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA256

Format: 1.8
Date: Thu, 02 Sep 2021 13:31:02 +0100
Source: openmpi
Architecture: source
Version: 4.1.1-4
Distribution: unstable
Urgency: medium
Maintainer: Alastair McKinstry <[email protected]>
Changed-By: Alastair McKinstry <[email protected]>
Closes: 979041
Changes:
 openmpi (4.1.1-4) unstable; urgency=medium
 .
   * Don't ship libopen-orted-mpir.so. Patch from upstream for 4.1.2
     internal lib only.
   * Also exclude mtl ofi to deal with libfabric fork in tests.
     Closes: #979041
Checksums-Sha1:
 9d57f09686a0e63f30ddace5319af34361fc405c 2670 openmpi_4.1.1-4.dsc
 7d8a3153513b53b89d037e2140d00ec731ccb43b 68228 openmpi_4.1.1-4.debian.tar.xz
Checksums-Sha256:
 7493a2e12cb4f336e3172358bd6bae9e61871216214f97e25366a21d4d66c21c 2670 
openmpi_4.1.1-4.dsc
 c7fe79181f432a7eac47cd0cd0ab7662883b755f1abd18d6649c296151de81c8 68228 
openmpi_4.1.1-4.debian.tar.xz
Files:
 020b26d6112e653db3a184ed99278620 2670 net optional openmpi_4.1.1-4.dsc
 a7e96cb6f1ff140d007afef518b397e5 68228 net optional 
openmpi_4.1.1-4.debian.tar.xz

-----BEGIN PGP SIGNATURE-----

iQIzBAEBCAAdFiEEgjg86RZbNHx4cIGiy+a7Tl2a06UFAmEw06MACgkQy+a7Tl2a
06WNZQ/9F5HvKIhQC6Z2i+QY9y58j+LN8bWXQhObeTA3nVvKnPRXOfcF6p7OV9/5
x3yGXKZ/Y20JVmTeT5ezgsIJ/z0dQWy3t4YUYgdye0Po+0RSWOAbli83a2cXtTgl
9cmmu9xIHjs76VkfnpYWrtVZ19Z8ahg8u/al8e2s5U2k92/D3eSxA79T/Jw8nbX8
2EOaUDyFBUAF4DRycVzG54m2WFqZ3xHC/+HQzfsUxSYoscy9mVvpPnqfpQJfsx3y
S/Y1D+wOnkDVwDSKYz9gZg13089N43d7Qu9Jd9xVymnvPLvKV0cxQSC/1fV5xvF+
mtSlSo5+2ogotXYMjn4nGZb7Uwp3lbiXhNOTVvbuW1EgpgSwiH/KqDSBhkKJW0MD
7nFMlVYUiGRKtxfWfUE6hXSVW/VxpYalUz7CWY+n3jph0QJR3OqzcuYHlboiNCYH
/uXN0R5GwQWgdBBJQe8ybOyK2x5kRYkkEBvtSOiRlMtzGkdVmqoi1WLiTVHPnkcf
FwqR6NtnHWbwrfuroXhOIAtK9ynNxW6CXtP2qK9xIwil6JsBVXadvYGfV8+rmpvz
Nk/x4GSEjNE1zkv582/TjHMmw3I886hehn76N0V8R/cT9XnT4ajmGOwKwkIo1HAk
PBcwG3W+IiBXwdwYGO24wuG9cDddaPNyAW80Od4dc4/cc/T5nW0=
=njSD
-----END PGP SIGNATURE-----

--- End Message ---

Reply via email to