Re: [OMPI users] openmpi5.0.7 with Intel2021 can not compile simple MPI program, error #6633

2025-05-28 Thread 'Jeff Squyres (jsquyres)' via Open MPI users
Does that mean you've fixed the environment module, and now everything is working as expected / your program compiles properly with Open MPI v5.0.x? From: Miroslav Iliaš Sent: Wednesday, May 28, 2025 4:22 PM To: Open MPI users Cc: Jeff Squyres (jsq

Re: [OMPI users] openmpi5.0.7 with Intel2021 can not compile simple MPI program, error #6633

2025-05-28 Thread 'Jeff Squyres (jsquyres)' via Open MPI users
-i8. IIRC / I'm pretty sure that neither of these things should have happened unless configure was told to build Open MPI this way, though. From: users@lists.open-mpi.org on behalf of Miroslav Iliaš Sent: Wednesday, May 28, 2025 2:39 PM To: Open MPI use

Re: [OMPI users] openmpi5.0.7 with Intel2021 can not compile simple MPI program, error #6633

2025-05-28 Thread 'Jeff Squyres (jsquyres)' via Open MPI users
I am not a Fortran expert, but you did not declare ierr to be a specific type (e.g., integer). If you declare it properly, does the problem go away? From: users@lists.open-mpi.org on behalf of Miroslav Iliaš Sent: Wednesday, May 28, 2025 11:22 AM To: Open MPI u

Re: [OMPI users] Open MPI license submitted for Open Source Intitative approval

2025-05-21 Thread Jeff Squyres
t 7:06 PM, McCoy Smith wrote: > > FWIW if there is any objection to me doing this from Open MPI, let me know as > I'd want to let OSI know. I'm assuming you wouldn't but don't know for sure. > > On 5/21/2025 2:34 PM, Jeff Squyres wrote: >> Wow, cool! Many

Re: [OMPI users] Open MPI license submitted for Open Source Intitative approval

2025-05-21 Thread Jeff Squyres
e > (which is over 20 years old, AFAIK), but if you have opinions on this > submission, you may want to provide them to the OSI mailing list. > > McCoy Smith > Lex Pan Law LLC > > To unsubscribe from this group and stop receiving emails from it, send an > email to users+unsubscr.

Re: [OMPI users] OpenMPI-5.0.7 libopen-rte.so.0 missing

2025-04-10 Thread 'Jeff Squyres (jsquyres)' via Open MPI users
Gilles raises a good point: if you think you're using Open MPI v5.0.x, but you're somehow not finding a library that is from a prior version of Open MPI, then you may be inadvertently mixing using multiple different versions of Open MPI. Make sure that your environment (e.g., your PATH and LD_L

Re: [OMPI users] OpenMPI-5.0.5 & 5.0.6 build failure: error: expected expression before ‘struct’

2025-03-31 Thread Jeff Squyres
OpenMPI-5.0.7 surpasses this error, but it is not able build cuda [GPU >>>>> DIRECT] & ofi support: >>>>> >>>>> Gromacs applications complains that it is not able to detect Cuda Aware >>>>> MPI: >>>>> >>>>

Re: [OMPI users] Disable PMPI bindings?

2025-02-15 Thread Jeff Squyres
an standards than > gfortran or ifort. Is it possible to disable PMPI bindings at compile > time so they don't appear at all? I am also happy to triage the bug, > since I have a Power10 system and the Open XLF compiler. > > -JPS > > To unsubscribe from this group and sto

Re: [OMPI users] prterun: symbol lookup error: /usr/lib/libprrte.so.3: undefined symbol: PMIx_Session_control

2024-08-15 Thread Jeff Squyres (jsquyres) via users
This isn't enough information to provide a definitive answer. Can you provide more information about your setup, how you built and installed Open MPI, ... etc.? In general, the error message is the standard Linux error message when a symbol is unable to be found at run time. In particular, mp

Re: [OMPI users] Fwd: Unable to run basic mpirun command (OpenMPI v5.0.3)

2024-05-05 Thread Jeff Squyres (jsquyres) via users
.open-mpi.org/en/v5.0.x/launching-apps/ssh.html#finding-open-mpi-executables-and-libraries. From: T Brouns Sent: Sunday, May 5, 2024 4:37 PM To: users@lists.open-mpi.org Cc: Jeff Squyres (jsquyres) ; hear...@gmail.com Subject: Re: [OMPI users] Fwd: Unable to run

Re: [OMPI users] Fwd: Unable to run basic mpirun command (OpenMPI v5.0.3)

2024-05-04 Thread Jeff Squyres (jsquyres) via users
, your could prefix your LD_LIBRARY_PATH​ environment variable with the libdir from the Open MPI installation you just created. From: T Brouns Sent: Saturday, May 4, 2024 10:56 AM To: Jeff Squyres (jsquyres) ; users@lists.open-mpi.org Subject: Re: [OMPI users

Re: [OMPI users] Fwd: Unable to run basic mpirun command (OpenMPI v5.0.3)

2024-05-03 Thread Jeff Squyres (jsquyres) via users
Your config.log file shows that you are trying to build Open MPI 2.1.6 and that configure failed. I'm not sure how to square this with the information that you provided in your message... did you upload the wrong config.log? Can you provide all the information from https://docs.open-mpi.org/en

Re: [OMPI users] [EXTERNAL] Help deciphering error message

2024-03-08 Thread Jeff Squyres (jsquyres) via users
(sorry this is so long – it's a bunch of explanations followed by 2 suggestions at the bottom) One additional thing worth mentioning is that your mpirun command line does not seem to explicitly be asking for the "ucx" PML component, but the error message you're getting indicates that you specif

Re: [OMPI users] Seg error when using v5.0.1

2024-01-31 Thread Jeff Squyres (jsquyres) via users
No worries – glad you figured it out! From: users on behalf of afernandez via users Sent: Wednesday, January 31, 2024 10:56 AM To: Open MPI Users Cc: afernandez Subject: Re: [OMPI users] Seg error when using v5.0.1 Hello, I'm sorry as I totally messed up here.

Re: [OMPI users] MPI Wireshark Packet Dissector

2023-12-11 Thread Jeff Squyres (jsquyres) via users
Cool! I dimly remember this project; it was written independently of the main Open MPI project. It looks like it supports the TCP OOB and TCP BTL. The TCP OOB has since moved from Open MPI's "ORTE" sub-project to the independent PRRTE project. Regardless, TCP OOB traffic is effectively about

Re: [OMPI users] OpenMPI 5.0.0 & Intel OneAPI 2023.2.0 on MacOS 14.0:

2023-11-06 Thread Jeff Squyres (jsquyres) via users
We develop and build with clang on macOS frequently; it would be surprising if it didn't work. That being said, I was able to replicate both errors report here. One macOS Sonoma with XCode 15.x and the OneAPI compilers: * configure fails in the PMIx libevent section, complaining about how

[OMPI users] Open MPI BOF at SC'23

2023-11-06 Thread Jeff Squyres (jsquyres) via users
available after the presentation as well, but chances are: if you have a question, others have the same question. So submit your question to us<https://docs.google.com/forms/d/e/1FAIpQLSefccrJaKOjkEDLroO3Fq4fvn7o8v6N5WNSIaQ9VbSY16x_Rw/viewform> so that we can include them in the present

Re: [OMPI users] OpenMPI 5.0.0 & Intel OneAPI 2023.2.0 on MacOS 14.0:

2023-10-30 Thread Jeff Squyres (jsquyres) via users
Volker -- If that doesn't work, send all the information requested here: https://docs.open-mpi.org/en/v5.0.x/getting-help.html From: users on behalf of Volker Blum via users Sent: Saturday, October 28, 2023 8:47 PM To: Matt Thompson Cc: Volker Blum ; Open MPI

Re: [OMPI users] MPI4Py Only Using Rank 0

2023-10-25 Thread Jeff Squyres (jsquyres) via users
and/or use it. From: caitlin lamirez Sent: Wednesday, October 25, 2023 1:17 PM To: Jeff Squyres (jsquyres) Subject: Re: [OMPI users] MPI4Py Only Using Rank 0 Hi Jeff, After getting that error, I did reinstall MPI4py using conda remove mpi4py and conda install m

Re: [OMPI users] MPI4Py Only Using Rank 0

2023-10-25 Thread Jeff Squyres (jsquyres) via users
This usually​ means that you have accidentally switched to using a different MPI implementation under the covers somehow. E.g., did you somehow accidentally start using mpiexec from MPICH instead of Open MPI? Or did MPI4Py somehow get upgraded or otherwise re-build itself for MPICH, but you're

Re: [OMPI users] Binding to thread 0

2023-09-08 Thread Jeff Squyres (jsquyres) via users
In addition to what Gilles mentioned, I'm curious: is there a reason you have hardware threads enabled? You could disable them in the BIOS, and then each of your MPI processes can use the full core, not just a single hardware thread. From: users on behalf of Lui

Re: [OMPI users] Segmentation fault

2023-08-09 Thread Jeff Squyres (jsquyres) via users
simple MPI application that replicates the issue? That would be something we could dig into and investigate. From: Aziz Ogutlu Sent: Wednesday, August 9, 2023 10:31 AM To: Jeff Squyres (jsquyres) ; Open MPI Users Subject: Re: [OMPI users] Segmentation fault Hi J

Re: [OMPI users] Segmentation fault

2023-08-09 Thread Jeff Squyres (jsquyres) via users
ation. From: Aziz Ogutlu Sent: Wednesday, August 9, 2023 10:08 AM To: Jeff Squyres (jsquyres) ; Open MPI Users Subject: Re: [OMPI users] Segmentation fault Hi Jeff, I also tried with OpenMPI 4.1.5, I got same error. On 8/9/23 17:05, Jeff Squyres (jsquyres) wrote: I'

Re: [OMPI users] Segmentation fault

2023-08-09 Thread Jeff Squyres (jsquyres) via users
I'm afraid I don't know anything about the SU2 application. You are using Open MPI v4.0.3, which is fairly old. Many bug fixes have been released since that version. Can you upgrade to the latest version of Open MPI (v4.1.5)? From: users on behalf of Aziz Ogut

Re: [OMPI users] [EXT] Re: Error handling

2023-07-19 Thread Jeff Squyres (jsquyres) via users
MPI_Allreduce should work just fine, even with negative numbers. If you are seeing something different, can you provide a small reproducer program that shows the problem? We can dig deeper into if if we can reproduce the problem. mpirun's exit status can't distinguish between MPI processes who

Re: [OMPI users] libnuma.so error

2023-07-19 Thread Jeff Squyres (jsquyres) via users
It's not clear if that message is being emitted by Open MPI. It does say it's falling back to a different behavior if libnuma.so is not found, so it appears if it's treating it as a warning, not an error. From: users on behalf of Luis Cebamanos via users Sent:

Re: [OMPI users] Error build Open MPI 4.1.5 with GCC 11.3

2023-07-18 Thread Jeff Squyres (jsquyres) via users
n Sent: Tuesday, July 18, 2023 12:51 PM To: Jeff Squyres (jsquyres) Cc: Open MPI Users Subject: Re: [OMPI users] Error build Open MPI 4.1.5 with GCC 11.3 As soon as you pointed out /usr/lib/gcc/x86_64-linux-gnu/9/include/float.h that made me think of the previous build. I did "make clean"

Re: [OMPI users] Error build Open MPI 4.1.5 with GCC 11.3

2023-07-18 Thread Jeff Squyres (jsquyres) via users
if you had run make clean and then re-ran configure, it probably would have built ok. But deleting the whole source tree and re-configuring + re-building also works. 🙂 From: Jeffrey Layton Sent: Tuesday, July 18, 2023 11:38 AM To: Jeff Squyres (jsquyres) Cc: Ope

Re: [OMPI users] Error build Open MPI 4.1.5 with GCC 11.3

2023-07-17 Thread Jeff Squyres (jsquyres) via users
That's a little odd. Usually, the specific .h files that are listed as dependencies came from somewhere​ -- usually either part of the GNU Autotools dependency analysis. I'm guessing that /usr/lib/gcc/x86_64-linux-gnu/9/include/float.h doesn't actually exist on your system -- but then how did

Re: [OMPI users] OMPI compilation error in Making all datatypes

2023-07-12 Thread Jeff Squyres (jsquyres) via users
George Bosilca Sent: Wednesday, July 12, 2023 2:26 PM To: Open MPI Users Cc: Jeff Squyres (jsquyres) ; Elad Cohen Subject: Re: [OMPI users] OMPI compilation error in Making all datatypes I can't replicate this on my setting, but I am not using the tar archive from the OMPI website (

Re: [OMPI users] OMPI compilation error in Making all datatypes

2023-07-12 Thread Jeff Squyres (jsquyres) via users
The output you sent (in the attached tarball) in doesn't really make much sense: libtool: link: ar cru .libs/libdatatype_reliable.a .libs/libdatatype_reliable_la-opal_datatype_pack.o .libs/libdatatype_reliable_la-opal_datatype_unpack.o libtool: link: ranlib .libs/libdatatype_reliable.a ranlib

Re: [OMPI users] Issue with Running MPI Job on CentOS 7

2023-06-14 Thread Jeff Squyres (jsquyres) via users
ications in the "examples" directory. From: 深空探测 Sent: Tuesday, June 13, 2023 8:59 PM To: Open MPI Users Cc: John Hearns ; Jeff Squyres (jsquyres) ; gilles.gouaillar...@gmail.com ; t...@pasteur.fr Subject: Re: [OMPI users] Issue with Running MPI Job on

Re: [OMPI users] Issue with Running MPI Job on CentOS 7

2023-06-12 Thread Jeff Squyres (jsquyres) via users
Your steps are generally correct, but I cannot speak for whether your /home/wude/.bashrc file is executed for both non-interactive and interactive logins. If /home/wude is your $HOME, it probably is, but I don't know about your specific system. Also, you should be aware that MPI applications b

Re: [OMPI users] What is the best choice of pml and btl for intranode communication

2023-03-06 Thread Jeff Squyres (jsquyres) via users
the UCX PML is selected make my above comment moot. Sorry for any confusion! From: users on behalf of Jeff Squyres (jsquyres) via users Sent: Monday, March 6, 2023 10:40 AM To: Chandran, Arun ; Open MPI Users Cc: Jeff Squyres (jsquyres) Subject: Re: [OMPI use

Re: [OMPI users] What is the best choice of pml and btl for intranode communication

2023-03-06 Thread Jeff Squyres (jsquyres) via users
rfaces at all? (this is an open question to George / the UCX team) From: Chandran, Arun Sent: Monday, March 6, 2023 10:31 AM To: Jeff Squyres (jsquyres) ; Open MPI Users Subject: RE: [OMPI users] What is the best choice of pml and btl for intranode communication

Re: [OMPI users] What is the best choice of pml and btl for intranode communication

2023-03-06 Thread Jeff Squyres (jsquyres) via users
If this run was on a single node, then UCX probably disabled itself since it wouldn't be using InfiniBand or RoCE to communicate between peers. Also, I'm not sure your command line was correct: perf_benchmark $ mpirun -np 32 --map-by core --bind-to core ./perf --mca pml ucx You probably need

Re: [OMPI users] Compile options to disable Infiniband

2022-12-12 Thread Jeff Squyres (jsquyres) via users
Open MPI's libraries and from what's available in $prefix/lib/openmpi [the latter is the default]). If ompi_info doesn't show any output with "ucx", "openib", and/or "psm", then your Open MPI does not contain any IB support. -- Jeff Squyres jsquy...@

Re: [OMPI users] mpi program gets stuck

2022-12-07 Thread Jeff Squyres (jsquyres) via users
continuing to investigate. If it turns into a problem with Open MPI, we'll report back here. -- Jeff Squyres jsquy...@cisco.com From: Jeff Squyres (jsquyres) Sent: Wednesday, November 30, 2022 7:42 AM To: timesir ; Open MPI Users Subject: Re: mpi program gets stuc

Re: [OMPI users] Can't run an MPI program through mpirun command

2022-12-04 Thread Jeff Squyres (jsquyres) via users
Can you try steps 1-3 in https://docs.open-mpi.org/en/v5.0.x/validate.html#testing-your-open-mpi-installation ? -- Jeff Squyres jsquy...@cisco.com From: users on behalf of Blaze Kort via users Sent: Saturday, December 3, 2022 5:52 AM To: users@lists.open

Re: [OMPI users] mpi program gets stuck

2022-12-01 Thread Jeff Squyres (jsquyres) via users
--mca plm_base_verbose 100 --mca rmaps_base_verbose 100 --mca ras_base_verbose 100 --prtemca grpcomm_base_verbose 5 --prtemca state_base_verbose 5 ./ring_c And please send the output back here to the list. -- Jeff Squyres jsquy...@cisco.com From: timesir Sent: Tu

Re: [OMPI users] mpi program gets stuck

2022-11-29 Thread Jeff Squyres (jsquyres) via users
tead of a Python MPI program? That would just eliminate a few more variables from the troubleshooting process. In the "examples" directory in the tarball I provided are trivial "hello world" and "ring" MPI programs. A "make" should build them all. Try runn

Re: [OMPI users] CephFS and striping_factor

2022-11-29 Thread Jeff Squyres (jsquyres) via users
More specifically, Gilles created a skeleton "ceph" component in this draft pull request: https://github.com/open-mpi/ompi/pull/11122 If anyone has any cycles to work on it and develop it beyond the skeleton that is currently there, that would be great! -- Jeff Squyres jsquy...

Re: [OMPI users] Question about "mca" parameters

2022-11-29 Thread Jeff Squyres (jsquyres) via users
ironment. You might want to investigate the suggestion from the help message to set the memlock limits correctly, and see if using the qib0 interfaces would yield better performance. -- Jeff Squyres jsquy...@cisco.com From: users on behalf of Gilles Gouaillardet vi

Re: [OMPI users] users Digest, Vol 4818, Issue 1

2022-11-25 Thread Jeff Squyres (jsquyres) via users
Ok, this is a good / consistent output. That being said, I don't grok what is happening here: it says it finds 2 slots, but then it tells you it doesn't have enough slots. Let me dig deeper and get back to you... -- Jeff Squyres jsquy...@cisco.com ___

Re: [OMPI users] users Digest, Vol 4818, Issue 1

2022-11-25 Thread Jeff Squyres (jsquyres) via users
such as an "em dash", or somesuch. -- Jeff Squyres jsquy...@cisco.com From: timesir Sent: Friday, November 18, 2022 8:59 AM To: Jeff Squyres (jsquyres) ; users@lists.open-mpi.org ; gilles.gouaillar...@gmail.com Subject: Re: users Digest, Vol 4818,

Re: [OMPI users] users Digest, Vol 4818, Issue 1

2022-11-25 Thread Jeff Squyres (jsquyres) via users
"hostfile" component altogether. How did you install Open MPI? Can you send the information from "Run time problems" on https://docs.open-mpi.org/en/v5.0.x/getting-help.html#for-run-time-problems ? -- Jeff Squyres jsquy...@cisco.com From: time

Re: [OMPI users] users Digest, Vol 4818, Issue 1

2022-11-25 Thread Jeff Squyres (jsquyres) via users
I see 2 config.log files -- can you also send the other information requested on that page? I.e, the version you're using (I think​ you said in a prior email that it was 5.0rc9, but I'm not 100% sure), and the output from ompi_info --all. -- Jeff Squyres jsquy...

Re: [OMPI users] Tracing of openmpi internal functions

2022-11-14 Thread Jeff Squyres (jsquyres) via users
the way of detailed internal function call tracing inside Open MPI itself, due to performance considerations. You might want to look into flamegraphs, or something similar...? -- Jeff Squyres jsquy...@cisco.com From: users on behalf of arun c via users Sent

Re: [OMPI users] [OMPI devel] There are not enough slots available in the system to satisfy the 2, slots that were requested by the application

2022-11-14 Thread Jeff Squyres (jsquyres) via users
Yes, somehow I'm not seeing all the output that I expect to see. Can you ensure that if you're copy-and-pasting from the email, that it's actually using "dash dash" in front of "mca" and "machinefile" (vs. a copy-and-pasted &qu

Re: [OMPI users] [OMPI devel] There are not enough slots available in the system to satisfy the 2, slots that were requested by the application

2022-11-13 Thread Jeff Squyres (jsquyres) via users
I should have asked you to run with 2​ variables last time -- can you re-run with "mpirun --mca rmaps_base_verbose 100 --mca ras_base_verbose 100 ..."? Turning on the RAS verbosity should show us what the hostfile component is doing. -- Jeff Squyres jsquy...@cisco.com

Re: [OMPI users] --mca btl_base_verbose 30 not working in version 5.0

2022-11-07 Thread Jeff Squyres (jsquyres) via users
Sorry for the delay in replying. To tie up this thread for the web mail archives: this same question was cross-posted over in the devel list; I replied there. -- Jeff Squyres jsquy...@cisco.com From: users on behalf of mrlong via users Sent: Sunday, October

Re: [OMPI users] [OMPI devel] There are not enough slots available in the system to satisfy the 2, slots that were requested by the application

2022-11-07 Thread Jeff Squyres (jsquyres) via users
" so that we can get some debugging output and see why the slots aren't working for you? Show the full output, like I did above (e.g., cat the hostfile, and then mpirun with the MCA param and all the output). Thanks! -- Jeff Squyres jsquy...@cisco.com F

Re: [OMPI users] [EXTERNAL] Beginner Troubleshooting OpenMPI Installation - pmi.h Error

2022-10-06 Thread Jeff Squyres (jsquyres) via users
uot; is in this file? If that's the case, then that's where Open MPI is getting these CLI arguments. -- Jeff Squyres jsquy...@cisco.com From: Jeffrey D. (JD) Tamucci Sent: Wednesday, October 5, 2022 5:16 PM To: Jeff Squyres (jsquyres) Cc: Open MPI Users ;

Re: [OMPI users] [EXTERNAL] Beginner Troubleshooting OpenMPI Installation - pmi.h Error

2022-10-05 Thread Jeff Squyres (jsquyres) via users
configure+build with just one of those two options, does it work? -- Jeff Squyres jsquy...@cisco.com From: users on behalf of Pritchard Jr., Howard via users Sent: Wednesday, October 5, 2022 11:47 AM To: Jeffrey D. (JD) Tamucci Cc: Pritchard Jr., Howard ; Open MPI

Re: [OMPI users] openmpi compile failure

2022-09-28 Thread Jeff Squyres (jsquyres) via users
-MD -MP -MF \ .deps/signal.Tpo -c \ ../../../../../../opal/mca/event/libevent2022/libevent/signal.c -fPIC \ -DPIC -E > signal-preprocessed.c -- Jeff Squyres jsquy...@cisco.com From: Zilore Mumba Sent: Wednesday, September 28, 2022 1:50 AM To: Jeff Squyres (jsquyres) Cc:

Re: [OMPI users] openmpi compile failure

2022-09-27 Thread Jeff Squyres (jsquyres) via users
&& ./foo NSIG is 65 You can see that NSIG is definitely defined for me. It's likely that until the above trivial program can compile properly, Open MPI won't compile properly, either. -- Jeff Squyres jsquy...@cisco.com From: Zilore Mumba Sent:

Re: [OMPI users] openmpi compile failure

2022-09-27 Thread Jeff Squyres (jsquyres) via users
Can you re-try with the latest Open MPI v4.1.x release (v4.1.4)? There have been many bug fixes since v4.1.0. -- Jeff Squyres jsquy...@cisco.com From: users on behalf of Zilore Mumba via users Sent: Tuesday, September 27, 2022 5:10 AM To: users@lists.open

Re: [OMPI users] --mca parameter explainer; mpirun WARNING: There was an error initializing an OpenFabrics device

2022-09-26 Thread Jeff Squyres (jsquyres) via users
Just to follow up for the email web archives: this issue was followed up in https://github.com/open-mpi/ompi/issues/10841. -- Jeff Squyres jsquy...@cisco.com From: users on behalf of Rob Kudyba via users Sent: Thursday, September 22, 2022 2:15 PM To: users

Re: [OMPI users] Hardware topology influence

2022-09-14 Thread Jeff Squyres (jsquyres) via users
VM x). This will allow the MPI processes to use shared memory for on-node communication. -- Jeff Squyres jsquy...@cisco.com From: Jeff Squyres (jsquyres) Sent: Tuesday, September 13, 2022 10:08 AM To: Open MPI Users Cc: Gilles Gouaillardet Subject: Re: [OMPI

Re: [OMPI users] Hardware topology influence

2022-09-13 Thread Jeff Squyres (jsquyres) via users
of HPC applications. It's a surprisingly complicated topic. In the v4.x series, note that you can use "mpirun --report-bindings ..." to see exactly where Open MPI thinks it has bound each process. Note that this binding occurs before each MPI process starts; it's nothing

Re: [OMPI users] Disabling barrier in MPI_Finalize

2022-09-09 Thread Jeff Squyres (jsquyres) via users
No, it does not, sorry. What are you trying to do? -- Jeff Squyres jsquy...@cisco.com From: users on behalf of Mccall, Kurt E. (MSFC-EV41) via users Sent: Friday, September 9, 2022 2:30 PM To: OpenMpi User List (users@lists.open-mpi.org) Cc: Mccall, Kurt E

Re: [OMPI users] MPI with RoCE

2022-09-06 Thread Jeff Squyres (jsquyres) via users
protocols for MPI message passing (and ignore the "normal" Ethernet interfaces). -- Jeff Squyres jsquy...@cisco.com From: users on behalf of Harutyun Umrshatyan via users Sent: Tuesday, September 6, 2022 2:58 AM To: Open MPI Users Cc: Harutyun Umrsha

Re: [OMPI users] ucx problems

2022-08-31 Thread Jeff Squyres (jsquyres) via users
s MCA param defaults to: rc_verbs,ud_verbs,rc_mlx5,dc_mlx5,ud_mlx5,cuda_ipc,rocm_ipc (you'll need to ask the UCX community what each of those do/are) -- Jeff Squyres jsquy...@cisco.com From: users on behalf of Bernstein, Noam CIV USN NRL (6393) Washington DC

Re: [OMPI users] Oldest version of SLURM in use?

2022-08-17 Thread Jeff Squyres (jsquyres) via users
Fair point. If there's anyone out there who's unwilling to reply publicly, please feel free to reply directly to me. Specifically: we want to know if Open MPI v5.0.0 stops supporting < SLURM 2017.11 is going to be a problem. -- Jeff Squyres jsquy

Re: [OMPI users] Oldest version of SLURM in use?

2022-08-17 Thread Jeff Squyres (jsquyres) via users
These are great data points! I'd love to hear from others, too. -- Jeff Squyres jsquy...@cisco.com From: users on behalf of Andrew Reid via users Sent: Tuesday, August 16, 2022 10:21 AM To: Open MPI Users Cc: Andrew Reid Subject: Re: [OMPI users] O

[OMPI users] Oldest version of SLURM in use?

2022-08-16 Thread Jeff Squyres (jsquyres) via users
e a lifetime of 2-4 years before they are refreshed, which fits nicely within that 5-year window. But in less well-funded institutions, HPC clusters could have lifetimes longer than 5 years. Do any of you run versions of SLURM that are more than 5 years old? -- Jeff Squyres jsquy...@cisco.com

Re: [OMPI users] RUNPATH vs. RPATH

2022-08-11 Thread Jeff Squyres (jsquyres) via users
Thanks for the feedback! I made a follow-up PR https://github.com/open-mpi/ompi/pull/10652 incorporating your feedback and feedback from Harmen Stoppels. I would have @mentioned you in the PR, but it doesn't appear that you have a Github ID (or, I couldn't find it, at least). -- Je

Re: [OMPI users] RUNPATH vs. RPATH

2022-08-10 Thread Jeff Squyres (jsquyres) via users
Reuti -- thanks for the comments+fix about missing "-Wl," (oops!). In addition to yours, some more came in on https://github.com/open-mpi/ompi/pull/10624 after it was merged. I'll make a follow-on PR with these suggestions. -- Jeff Squyres jsq

[OMPI users] Open MPI Java MPI bindings

2022-08-09 Thread Jeff Squyres (jsquyres) via users
a bindings, or know of anyone who is using them. Thank you! -- Jeff Squyres jsquy...@cisco.com

Re: [OMPI users] RUNPATH vs. RPATH

2022-08-09 Thread Jeff Squyres (jsquyres) via users
the Right Thing for your environment, but you might want to check the output of "readelf -d ..." to be sure. Does that additional text help explain things? -- Jeff Squyres jsquy...@cisco.com ________ From: Jeff Squyres (jsquyres) Sent: Saturday, August 6, 2022 9

Re: [OMPI users] Problem with OpenMPI as Third pary library

2022-08-09 Thread Jeff Squyres (jsquyres) via users
I can't see the image that you sent; it seems to be broken. But I think you're asking about this: https://www.open-mpi.org/faq/?category=building#installdirs -- Jeff Squyres jsquy...@cisco.com From: users on behalf of Sebastian Gutierrez via u

Re: [OMPI users] RUNPATH vs. RPATH

2022-08-06 Thread Jeff Squyres (jsquyres) via users
Reuti -- See my disclaimers on other posts about apologies for taking so long to reply! This code was written forever ago; I had to dig through it a bit, read the comments and commit messages, and try to remember why it was done this way. What I thought would be a 5-minute search turned into a

Re: [OMPI users] Multiple IPs on network interface

2022-07-07 Thread Jeff Squyres (jsquyres) via users
Can you send the full output of "ifconfig" (or "ip addr") from one of your compute nodes? -- Jeff Squyres jsquy...@cisco.com From: users on behalf of George Johnson via users Sent: Monday, July 4, 2022 11:06 AM To: users@lists.ope

Re: [OMPI users] Intercommunicator issue (any standard about communicator?)

2022-06-24 Thread Jeff Squyres (jsquyres) via users
pt of an MPI communicator. Hence: the handle is meaningless to the application -- it's just an opaque value that the user program can pass around. User applications *can* compare it to the value for MPI_COMM_NULL, but that's about it. -- Jeff Squyres jsquy...@cisco.com __

Re: [OMPI users] Intercommunicator issue (any standard about communicator?)

2022-06-24 Thread Jeff Squyres (jsquyres) via users
ctionality (connect/accept/etc.) has always been a bit shaky; they have been tested to work in very, very specific conditions, and not made super robust to work in many different / generalized cases. Is there a chance you can orient your app to not use the MPI dynamic APIs? -- Jeff Squ

Re: [OMPI users] OpenMPI and names of the nodes in a cluster

2022-06-24 Thread Jeff Squyres (jsquyres) via users
the format of the hostname). I'm surprised that using the naive module (instead of the fwd module) doesn't solve your problem. ...oh shoot, I see why. It's because I had a typo in what I suggested to you. Please try: mpirun --mca regx naive ... (i.e., "regx", not &quo

Re: [OMPI users] OpenMPI and names of the nodes in a cluster

2022-06-16 Thread Jeff Squyres (jsquyres) via users
"fwd" regex component is selected by default, but it has certain expectations about the format of hostnames. Try using the "naive" regex component, instead. --  Jeff Squyres jsquy...@cisco.com From: Patrick Begou Sent: Thursday, June

Re: [OMPI users] OpenMPI and names of the nodes in a cluster

2022-06-16 Thread Jeff Squyres (jsquyres) via users
What exactly is the error that is occurring? -- Jeff Squyres jsquy...@cisco.com From: users on behalf of Patrick Begou via users Sent: Thursday, June 16, 2022 3:21 AM To: Open MPI Users Cc: Patrick Begou Subject: [OMPI users] OpenMPI and names of the

[OMPI users] Passing of an MPI luminary: Rusty Lusk

2022-05-23 Thread Jeff Squyres (jsquyres) via users
obituaries/downers-grove-il/ewing-lusk-10754811/amp --  Jeff Squyres jsquy...@cisco.com

Re: [OMPI users] Network traffic packets documentation

2022-05-17 Thread Jeff Squyres (jsquyres) via users
aders in its source code repo: https://github.com/openpmix/openpmix/. It's a different project than Open MPI, but you can certainly ask questions on their mailing lists, too. -- Jeff Squyres jsquy...@cisco.com From: victor sv Sent: Tuesday, May 17, 2022

Re: [OMPI users] Network traffic packets documentation

2022-05-16 Thread Jeff Squyres (jsquyres) via users
om the network itself (because, by definition, the OS won't have visibility of the packets). Regardless, all of those structs are defined in their BTL / MTL / PML / etc. components. We don't have formal documentation of any of them, sorry! -- Jeff Squyres jsquy...@cisco.com _

Re: [OMPI users] Network traffic packets documentation

2022-05-16 Thread Jeff Squyres (jsquyres) via users
ich network and Open MPI transport are you looking to sniff? -- Jeff Squyres jsquy...@cisco.com From: users on behalf of victor sv via users Sent: Sunday, May 15, 2022 3:55 PM To: users@lists.open-mpi.org Cc: victor sv Subject: [OMPI users] Net

Re: [OMPI users] mpirun hangs on m1 mac w openmpi-4.1.3

2022-05-05 Thread Jeff Squyres (jsquyres) via users
s to be necessary on some M1s (e.g., Scott's) but not others (e.g., George's). We'll put a guard in against the "unlimited" case in future releases. See https://github.com/open-mpi/ompi/issues/10358 for more details, but I figured I'd put the workaround ou

Re: [OMPI users] mpirun hangs on m1 mac w openmpi-4.1.3

2022-05-05 Thread Jeff Squyres (jsquyres) via users
Scott and I conversed a bit off list, and I got more data. I posted everything in https://github.com/open-mpi/ompi/issues/10358 -- let's follow up on this issue there. -- Jeff Squyres jsquy...@cisco.com From: George Bosilca Sent: Thursday, May 5,

Re: [OMPI users] mpirun hangs on m1 mac w openmpi-4.1.3

2022-05-05 Thread Jeff Squyres (jsquyres) via users
You can use "lldb -p PID" to attach to a running process. -- Jeff Squyres jsquy...@cisco.com From: Scott Sayres Sent: Thursday, May 5, 2022 11:22 AM To: Jeff Squyres (jsquyres) Cc: Open MPI Users Subject: Re: [OMPI users] mpirun hangs on m1 mac

Re: [OMPI users] mpirun hangs on m1 mac w openmpi-4.1.3

2022-05-05 Thread Jeff Squyres (jsquyres) via users
re is that the backtrace you posted earlier implies that the parent mpirun hadn't even finished its fork/exec sequence (i.e., mpirun itself is still in the "do_parent()" function, which implies that it didn't complete the pipe handshake that happens immediately after forking the ch

Re: [OMPI users] mpirun hangs on m1 mac w openmpi-4.1.3

2022-05-04 Thread Jeff Squyres (jsquyres) via users
start seeing output, good!If it completes, better! If it hangs, and/or if you don't see any output at all, do this: ps auxwww | egrep 'mpirun|foo.sh' It should show mpirun and 2 copies of foo.sh (and probably a grep). Does it? -- Jeff Squyres jsq

Re: [OMPI users] mpirun hangs on m1 mac w openmpi-4.1.3

2022-05-04 Thread Jeff Squyres (jsquyres) via users
code. E.g.: ./configure CFLAGS=-g ... make -j 8 all [sudo] make install (put whatever other configure flags you want in there, such as a custom prefix, ... etc.) -- Jeff Squyres jsquy...@cisco.com From: users on behalf of George Bosilca via users

Re: [OMPI users] mpirun hangs on m1 mac w openmpi-4.1.3

2022-05-04 Thread Jeff Squyres (jsquyres) via users
ples make mpirun -np 4 hello_c mpirun -np 4 ring_c -- Jeff Squyres jsquy...@cisco.com From: users on behalf of Scott Sayres via users Sent: Tuesday, May 3, 2022 1:07 PM To: users@lists.open-mpi.org Cc: Scott Sayres Subject: [OMPI users] mpirun hangs on m1 ma

Re: [OMPI users] help with M1 chip macOS openMPI installation

2022-04-22 Thread Jeff Squyres (jsquyres) via users
Can you send all the information listed under "For compile problems" (please compress!): https://www.open-mpi.org/community/help/ -- Jeff Squyres jsquy...@cisco.com From: users on behalf of Cici Feng via users Sent: Friday, April 22, 20

Re: [OMPI users] Help diagnosing MPI+OpenMP application segmentation fault only when run with --bind-to none

2022-04-22 Thread Jeff Squyres (jsquyres) via users
else? It might be useful to compile Open MPI (and/or other libraries that you're using) with -g so that you can get more meaningful stack traces upon error -- that might give some insight into where / why the failure is occurring. -- Jeff Squyres jsquy...@cisc

Re: [OMPI users] help with M1 chip macOS openMPI installation

2022-04-21 Thread Jeff Squyres (jsquyres) via users
A little more color on Gilles' answer: I believe that we had some Open MPI community members work on adding M1 support to Open MPI, but Gilles is absolutely correct: the underlying compiler has to support the M1, or you won't get anywhere. -- Jeff Squyres jsquy...

Re: [OMPI users] mixed OpenMP/MPI

2022-03-15 Thread Jeff Squyres (jsquyres) via users
Thanks for the poke! Sorry we missed replying to your github issue. Josh replied to it this morning. -- Jeff Squyres jsquy...@cisco.com From: users on behalf of Bernstein, Noam CIV USN NRL (6393) Washington DC (USA) via users Sent: Tuesday, March 15

Re: [OMPI users] handle_wc() in openib and IBV_WC_DRIVER2/MLX5DV_WC_RAW_WQE completion code

2022-02-23 Thread Jeff Squyres (jsquyres) via users
ards compatibility issues like this can occur. -- Jeff Squyres jsquy...@cisco.com From: users on behalf of Crni Gorac via users Sent: Tuesday, February 22, 2022 7:37 AM To: users@lists.open-mpi.org Cc: Crni Gorac Subject: [OMPI users] handle_wc() in open

Re: [OMPI users] Unknown breakdown (Transport retry count exceeded on mlx5_0:1/IB)

2022-02-23 Thread Jeff Squyres (jsquyres) via users
ccessfully got the message. From back in my IB days, the typical first place to look for errors like this is to check the layer 0 and layer 1 networking with Nvidia-level diagnostics to ensure that the network itself is healthy. -- Jeff Squyres jsquy...

Re: [OMPI users] Trouble compiling OpenMPI with Infiniband support

2022-02-23 Thread Jeff Squyres (jsquyres) via users
e you configured Open MPI with --without-mxm). If you can upgrade to Open MPI v4.1.2 and the latest UCX, see if you are still getting those MXM error messages. -- Jeff Squyres jsquy...@cisco.com From: users on behalf of Angel de Vicente via users Sent:

Re: [OMPI users] Building Open MPI without zlib: what might go wrong/different?

2022-01-31 Thread Jeff Squyres (jsquyres) via users
It's used for compressing the startup time messages in PMIx. I.e., the traffic for when you "mpirun ...". It's mostly beneficial when launching very large MPI jobs. If you're only launching across several nodes, the performance improvement isn't really noti

Re: [OMPI users] RES: OpenMPI - Intel MPI

2022-01-27 Thread Jeff Squyres (jsquyres) via users
For a beginner, parts 1 and 2 are probably the most relevant, and you can probably skip the parts about PMIx (circle back to that later for more advanced knowledge). -- Jeff Squyres jsquy...@cisco.com From: users on behalf of Diego Zuccato via users

Re: [OMPI users] Gadget2 error 818 when using more than 1 process?

2022-01-27 Thread Jeff Squyres (jsquyres) via users
I'm afraid that without any further details, it's hard to help. I don't know why Gadget2 would complain about its parameters file. From what you've stated, it could be a problem with the application itself. Have you talked to the Gadget2 authors? -- Jeff Squyre

Re: [OMPI users] Gadget2 error 818 when using more than 1 process?

2022-01-25 Thread Jeff Squyres (jsquyres) via users
I'm afraid I don't know anything about Gadget, so I can't comment there. How exactly does the application fail? Can you try upgrading to Open MPI v4.1.2? What networking are you using? -- Jeff Squyres jsquy...@cisco.com From: users on

  1   2   3   4   5   6   7   8   9   10   >