Your message dated Thu, 03 Jan 2019 10:34:48 +0000
with message-id <[email protected]>
and subject line Bug#918031: fixed in pmix 3.1.0~rc2-2
has caused the Debian Bug report #918031,
regarding openmpi: mpi_init pmix error (gds_dstore.c line 1030)
to be marked as done.
This means that you claim that the problem has been dealt with.
If this is not the case it is now your responsibility to reopen the
Bug report if necessary, and/or fix the problem forthwith.
(NB: If you are a system administrator and have no idea what this
message is talking about, this may indicate a serious mail system
misconfiguration somewhere. Please contact [email protected]
immediately.)
--
918031: https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=918031
Debian Bug Tracking System
Contact [email protected] with problems
--- Begin Message ---
Source: openmpi
Version: 3.1.3-7
Severity: important
Control: affects -1 mpgrafic
Dear Maintainer,
SUMMARY: mpi_init pmix error on GNU/Hurd (gds_dstore.c line 1030)
DESCRIPTION: mpi_init gives a fatal pmix error on GNU/Hurd.
This occurred for a Debian automatic build for mpgrafic-0.3.16-1
on a machine with two processors, and on the Debian porter machine
exodar on a 1-processor machine for a minimal example (provided here).
MINIMAL EXAMPLE:
On exodar:
$ cat openmpi_hurd_bug.f90
program openmpi_hurd_bug
call MPI_INIT(ierr)
call MPI_FINALIZE(ierr)
end program openmpi_hurd_bug
$ mpifort --show
gfortran -I/usr/lib/i386-gnu/openmpi/include -pthread
-I/usr/lib/i386-gnu/openmpi/lib -Wl,--enable-new-dtags
-L/usr/lib/i386-gnu/openmpi/lib -lmpi_usempif08 -lmpi_usempi_ignore_tkr
-lmpi_mpifh -lmpi
$ mpifort openmpi_hurd_bug # compiles with no warnings or errors.
$ mpirun -n 1 a.out
[exodar:00753] PMIX ERROR: INIT in file
../../../../../../src/mca/gds/ds12/gds_dstore.c at line 1030
[exodar:00753] PMIX ERROR: ERROR in file
../../../../../../src/mca/gds/ds12/gds_dstore.c at line 2863
[exodar:00753] PMIX ERROR: UNREACHABLE in file
../../../../../../src/mca/ptl/tcp/ptl_tcp_component.c at line 1423
[exodar:00755] PMIX ERROR: UNREACHABLE in file
../../../../../../src/mca/ptl/tcp/ptl_tcp.c at line 790
[exodar:00755] OPAL ERROR: Unreachable in file ext2x_client.c at line 109
*** An error occurred in MPI_Init
*** on a NULL communicator
*** MPI_ERRORS_ARE_FATAL (processes in this communicator will now abort,
*** and potentially your MPI job)
[exodar:00755] Local abort before MPI_INIT completed completed successfully,
but am not able to aggregate error messages, and not able to guarantee that all
other processes were killed!
--------------------------------------------------------------------------
Primary job terminated normally, but 1 process returned
a non-zero exit code. Per user-direction, the job has been aborted.
--------------------------------------------------------------------------
C example on exodar:
$ cat openmpi_hurd_bug_C.c
#include <mpi.h>
int main(int argc, char **argv){
MPI_Init(&argc,&argv);
MPI_Finalize();
return 0;
}
$ mpicc --show
gcc -I/usr/lib/i386-gnu/openmpi/include/openmpi
-I/usr/lib/i386-gnu/openmpi/include -pthread -L/usr/lib/i386-gnu/openmpi/lib
-lmpi
$ mpicc openmpi_hurd_bug_C.c # no warnings or errors
$ mpirun -n 1 ./a.out
[exodar:00962] PMIX ERROR: INIT in file
../../../../../../src/mca/gds/ds12/gds_dstore.c at line 1030
[exodar:00962] PMIX ERROR: ERROR in file
../../../../../../src/mca/gds/ds12/gds_dstore.c at line 2863
[exodar:00962] PMIX ERROR: UNREACHABLE in file
../../../../../../src/mca/ptl/tcp/ptl_tcp_component.c at line 1423
[exodar:00964] PMIX ERROR: UNREACHABLE in file
../../../../../../src/mca/ptl/tcp/ptl_tcp.c at line 790
[exodar:00964] OPAL ERROR: Unreachable in file ext2x_client.c at line 109
*** An error occurred in MPI_Init
*** on a NULL communicator
*** MPI_ERRORS_ARE_FATAL (processes in this communicator will now abort,
*** and potentially your MPI job)
[exodar:00964] Local abort before MPI_INIT completed completed successfully,
but am not able to aggregate error messages, and not able to guarantee that all
other processes were killed!
--------------------------------------------------------------------------
Primary job terminated normally, but 1 process returned
a non-zero exit code. Per user-direction, the job has been aborted.
--------------------------------------------------------------------------
CONTEXT of minimal example:
This is on an schroot running sid on exodar:
https://db.debian.org/machines.cgi?host=exodar
$ sessionid=$(schroot -b -c sid)
$ dd-schroot-cmd -c ${sessionid} apt-get update
$ dd-schroot-cmd -c ${sessionid} apt-get upgrade
$ dd-schroot-cmd -c $sessionid apt-get install mpifort mpi-default-dev mpi-default-bin gfortran
$ schroot -e -c ${sessionid}
$ uname -a
GNU exodar 0.9 GNU-Mach 1.8+git20181103-486-dbg/Hurd-0.9 i686-AT386 GNU
$ cat /proc/hostinfo
Basic info:
max_cpus = 1 /* max number of cpus possible */
avail_cpus = 1 /* number of cpus now available */
memory_size = 3221151744 /* size of memory in bytes */
cpu_type = 19 /* cpu type */
cpu_subtype = 1 /* cpu subtype */
$ dpkg -l |egrep "openmpi|pmix|gfortran|gcc|mpifort"
ii gcc 4:8.2.0-2 hurd-i386 GNU
C compiler
ii gcc-8 8.2.0-13 hurd-i386 GNU
C compiler
ii gcc-8-base:hurd-i386 8.2.0-13 hurd-i386
GCC, the GNU Compiler Collection (base package)
ii gfortran 4:8.2.0-2 hurd-i386 GNU
Fortran 95 compiler
ii gfortran-8 8.2.0-13 hurd-i386 GNU
Fortran compiler
ii libgcc-8-dev:hurd-i386 8.2.0-13 hurd-i386 GCC
support library (development files)
ii libgcc1:hurd-i386 1:8.2.0-13 hurd-i386 GCC
support library
ii libgfortran-8-dev:hurd-i386 8.2.0-13 hurd-i386
Runtime library for GNU Fortran applications (development files)
ii libgfortran5:hurd-i386 8.2.0-13 hurd-i386
Runtime library for GNU Fortran applications
ii libopenmpi-dev:hurd-i386 3.1.3-7 hurd-i386
high performance message passing library -- header files
ii libopenmpi3:hurd-i386 3.1.3-7 hurd-i386
high performance message passing library -- shared library
ii libpmix2:hurd-i386 3.0.2-2 hurd-i386
Process Management Interface (Exascale) library
ii openmpi-bin 3.1.3-7 hurd-i386
high performance message passing library -- binaries
ii openmpi-common 3.1.3-7 all
high performance message passing library -- common files
REPRODUCIBILITY:
(1) Again on exodar, same context, compiled fortran file:
$ mpirun -n 1 --mca plm_rsh_agent /bin/false ./a.out
gives the same error messages, apart from the exodar prompt ([exodar:00808]...).
(2) This bug was originally detected on an mpgrafic build:
https://buildd.debian.org/status/fetch.php?pkg=mpgrafic&arch=hurd-i386&ver=0.3.16-1&stamp=1546275082&raw=0
678 This looks like a debian openmpi system.
679 [ironforge:13033] PMIX ERROR: INIT in file
../../../../../../src/mca/gds/ds12/gds_dstore.c at line 1030
680 [ironforge:13033] PMIX ERROR: ERROR in file
../../../../../../src/mca/gds/ds12/gds_dstore.c at line 2863
681 [ironforge:13033] PMIX ERROR: UNREACHABLE in file
../../../../../../src/mca/ptl/tcp/ptl_tcp_component.c at line 1423
682 [ironforge:13035] PMIX ERROR: UNREACHABLE in file
../../../../../../src/mca/ptl/tcp/ptl_tcp.c at line 790
683 [ironforge:13035] OPAL ERROR: Unreachable in file ext2x_client.c at line
109
684 *** An error occurred in MPI_Init
685 *** on a NULL communicator
686 *** MPI_ERRORS_ARE_FATAL (processes in this communicator will now abort,
687 *** and potentially your MPI job)
688 [ironforge:13035] Local abort before MPI_INIT completed completed
successfully, but am not able to aggregate error messages, and not able to
guarantee that all other processes were killed!
Cheers
Boud
--- End Message ---
--- Begin Message ---
Source: pmix
Source-Version: 3.1.0~rc2-2
We believe that the bug you reported is fixed in the latest version of
pmix, which is due to be installed in the Debian FTP archive.
A summary of the changes between this version and the previous one is
attached.
Thank you for reporting the bug, which will now be closed. If you
have further comments please address them to [email protected],
and the maintainer will reopen the bug report if appropriate.
Debian distribution maintenance software
pp.
Alastair McKinstry <[email protected]> (supplier of updated pmix package)
(This message was generated automatically at their request; if you
believe that there is a problem with it please contact the archive
administrators by mailing [email protected])
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA256
Format: 1.8
Date: Thu, 03 Jan 2019 10:09:06 +0000
Source: pmix
Binary: libpmix-dev libpmi-pmix-dev libpmi1-pmix libpmi2-pmix libpmix2
Architecture: source amd64
Version: 3.1.0~rc2-2
Distribution: unstable
Urgency: medium
Maintainer: Alastair McKinstry <[email protected]>
Changed-By: Alastair McKinstry <[email protected]>
Description:
libpmi-pmix-dev - Development files for the PMI library (OpenMPI )
libpmi1-pmix - OpenMPI implementation of the PMI v1 library
libpmi2-pmix - OpenMPI implementation of the PMI v2 library
libpmix-dev - Development files for the PMI Exascale library
libpmix2 - Process Management Interface (Exascale) library
Closes: 918031
Changes:
pmix (3.1.0~rc2-2) unstable; urgency=medium
.
* Standards-Version: 4.3.0
* Fix for handle setsockopt(SO_RCVTIMEO) not being supported.
Thanks to Samuel Thibault. Closes: #918031
Checksums-Sha1:
e375f8314a262e100a741718345511aa35964d06 2168 pmix_3.1.0~rc2-2.dsc
a3a209a22c5097f5a7a0d92b5739c9c6cf129ea5 7840 pmix_3.1.0~rc2-2.debian.tar.xz
75b4bc6bbba84ed0352b15a677b4dc0de8968824 466576
libpmi-pmix-dev_3.1.0~rc2-2_amd64.deb
e683408db3018b60714922d0696cfcf6b09e8ad2 1532548
libpmi1-pmix-dbgsym_3.1.0~rc2-2_amd64.deb
53b77a640631da8ee617b5b3c8d8ab210857874f 280444
libpmi1-pmix_3.1.0~rc2-2_amd64.deb
1c9f13fb8042977f023aa5ea003a2ef00bc35d11 1536240
libpmi2-pmix-dbgsym_3.1.0~rc2-2_amd64.deb
1170c049b50396b0b70bdde7a06978a935b13717 282880
libpmi2-pmix_3.1.0~rc2-2_amd64.deb
1feb69e29d2d8b4c683e102b3d48396c1d89f08f 534008
libpmix-dev_3.1.0~rc2-2_amd64.deb
63d03c30daffe50c025fa2fda90a66eb263374f0 2644456
libpmix2-dbgsym_3.1.0~rc2-2_amd64.deb
858c2908aba1111ee2e4c80a1121fc42f45e19d0 421424 libpmix2_3.1.0~rc2-2_amd64.deb
09eeb063531916a842d140d0c3ad7291e97fa4ac 7735 pmix_3.1.0~rc2-2_amd64.buildinfo
Checksums-Sha256:
e15cb00e09c92bdff0809cbd12b73a8ed445809a5a4d878dd563d3e469d4edef 2168
pmix_3.1.0~rc2-2.dsc
85639f797899ac93360089c4937a994929462d723d93fe453e39fbaf3579be72 7840
pmix_3.1.0~rc2-2.debian.tar.xz
22b275479302d18bf201537f90070604aaf70732ed34b7b3e84ad0f8b940204c 466576
libpmi-pmix-dev_3.1.0~rc2-2_amd64.deb
9e340fbfe68642a94045823c8905eb16e453abef24878954b58b621f74787930 1532548
libpmi1-pmix-dbgsym_3.1.0~rc2-2_amd64.deb
f9b28de8acb9cc99abae77cd2029990874f560508c3ba3aeafd3eae2a576eed0 280444
libpmi1-pmix_3.1.0~rc2-2_amd64.deb
0b2b3f01b34eecc03ab21d0c53a5bf7ec97a0946df871314469c1c17982bf37d 1536240
libpmi2-pmix-dbgsym_3.1.0~rc2-2_amd64.deb
81bccf8237cebac8238f27cf722bedf3ac88ca5675bff681f2164ced6ce5e6b0 282880
libpmi2-pmix_3.1.0~rc2-2_amd64.deb
dc5144a875c0318e1284044a85f248dd91cc2ca5e2b787f3369373a2781e150b 534008
libpmix-dev_3.1.0~rc2-2_amd64.deb
777b26ea52f7e048971f69eda23136a12823f755c99e07f705a21092de72c763 2644456
libpmix2-dbgsym_3.1.0~rc2-2_amd64.deb
985e786090474959699756d13214308f4460943f222891cae839fe8ede4355a6 421424
libpmix2_3.1.0~rc2-2_amd64.deb
c548543673e17392e1e5f1ed1222fda7e657d011e6a282d67c1e9fe90d7bdfc5 7735
pmix_3.1.0~rc2-2_amd64.buildinfo
Files:
cfec9ef10e4930eca908ad95b002a532 2168 net optional pmix_3.1.0~rc2-2.dsc
701e304838c9278a1ecc6205fdcb8e0e 7840 net optional
pmix_3.1.0~rc2-2.debian.tar.xz
45251a71eae90b83caf5d62ee4bae099 466576 libdevel optional
libpmi-pmix-dev_3.1.0~rc2-2_amd64.deb
82bd74b988b34c7a98e80af62f73d03e 1532548 debug optional
libpmi1-pmix-dbgsym_3.1.0~rc2-2_amd64.deb
7e7caa9732398565576ba296164d670a 280444 libs optional
libpmi1-pmix_3.1.0~rc2-2_amd64.deb
e96933cd99170ac1afc4e3c2386548b0 1536240 debug optional
libpmi2-pmix-dbgsym_3.1.0~rc2-2_amd64.deb
3b5a4eafd6cbcd07c9bac788a5f4a19d 282880 libs optional
libpmi2-pmix_3.1.0~rc2-2_amd64.deb
ee29aacec22ce4ede4c35a1b74d6e465 534008 libdevel optional
libpmix-dev_3.1.0~rc2-2_amd64.deb
db0ce689667aa36fcffa3ed4558a2372 2644456 debug optional
libpmix2-dbgsym_3.1.0~rc2-2_amd64.deb
677521b08937d2bc8ce7b1695a643908 421424 libs optional
libpmix2_3.1.0~rc2-2_amd64.deb
3f2e6a8b335d0b92bff684f72b7645fc 7735 net optional
pmix_3.1.0~rc2-2_amd64.buildinfo
-----BEGIN PGP SIGNATURE-----
iQIzBAEBCAAdFiEEgjg86RZbNHx4cIGiy+a7Tl2a06UFAlwt4SQACgkQy+a7Tl2a
06WeZg//eu6tjTulbqLi3niqbDxx/XoX+2jv+fBtXQ1jX9yDAEc6ADDEvJstus6H
xNMGZR8+jYsM5xIdLtrxGQrMO2gWyMH1Bjc0dTVAl8Kt9Lemh0geB+T2WfcKMrzM
sxKkcJshNBps13BR9n5UAdfLAM4eenCv63UEbwyGKMt6HEkwXvX7FMnqsAhA9Sb8
miNYakK73Nlz7+P3HTAz4cAfaSQm/aam46vh0bl1odHO7qcFMuEhRMoBGYJ4YpmO
U7pG9Xbtap9sde9Pr/lHeAuTp52hi0CRSkc3E6ejmyVVrw0Og+Sii6+d/K6VdGbx
ZGh5KA2K2qL9GqmJqxUHMt6J17kmOJRRWaZYjA0Ssdf/UxbIywjEShGL7XuXDxI9
5U4Ci//7BczqJ88mI94EYKYyUMkJSHiv67udA7QFEHIY9JvVY68itfxXSeJb2Vsv
MPIMiuKOQFP4h7kscwfUrbV+hYLVkJY2cYc5SadCDXdzeWuQlXRfpazj7hAamRbI
QBcZiY51tDgHBiIXpZTRBvnoYcQEpZt32Dl/AzDnKyCJnUGJpaCB2nbEAIDEI6z9
HCFavDvjUWymtMLOoDXEbuZtqL+VJijKOYg7xPD6nr/zlLmRgV0NS9xwk7VTZqRu
wi1E09SXAflWUwYevwEureD4leBIQFZwIDvnCu9imbuLNzzth8E=
=+/Np
-----END PGP SIGNATURE-----
--- End Message ---