Re: [OMPI users] Question about virtual interface

2020-06-23 Thread Kulshrestha, Vipul via users
still hanged (with no output) when I specified btl_base_verbose 100. I will try using the CIDR for the below hosts as an experiment. Regards, Vipul From: Jeff Squyres (jsquyres) [mailto:jsquy...@cisco.com] Sent: Tuesday, June 23, 2020 1:36 PM To: Open MPI User's List Cc: Kulshrestha,

[OMPI users] Question about virtual interface

2020-06-18 Thread Kulshrestha, Vipul via users
Hi, I have read conflicting statements about OMPI support for virtual interfaces. The Open MPI FAQ mentions that virtual IP interfaces are not supported and this will not be solved by using either btl_tcp_if_include or btl_tcp_if_exclude. (https://www.open-mpi.org/faq/?category=tcp#ip-virtual-

Re: [OMPI users] Running mpirun with grid

2020-06-02 Thread Kulshrestha, Vipul via users
"1.has6" >> Server daemon successfully started with task id "1.hpb12" >> Server daemon successfully started with task id "1.has4" > >> Unmatched ". >> Unmatched ". >> Unmatched ". > Could be a clue as to what is actual

Re: [OMPI users] Running mpirun with grid

2020-06-01 Thread Kulshrestha, Vipul via users
m:base:receive update proc state command from [[26562,0],3] [sox3:24416] [[26562,0],0] plm:base:receive got update_proc_state for job [26562,1] -Original Message- From: Jeff Squyres (jsquyres) [mailto:jsquy...@cisco.com] Sent: Monday, June 1, 2020 4:15 PM To: Open MPI User's List Cc: K

Re: [OMPI users] Running mpirun with grid

2020-05-31 Thread Kulshrestha, Vipul via users
icate and fails. I think the root of the problem lies in the plm output that shows the qrsh it will use to start the job. For some reason, mpirun is still trying to "tree spawn", which (IIRC) isn't allowed on grid (all the daemons have to be launched in one shot by mpirun using qrsh).

Re: [OMPI users] Running mpirun with grid

2020-05-31 Thread Kulshrestha, Vipul via users
consider switching to using the Slurm batch queuing system? On Sat, 30 May 2020 at 00:41, Kulshrestha, Vipul via users mailto:users@lists.open-mpi.org>> wrote: Hi, I need to launch my openmpi application on grid. My application is designed to run N processes, where each process woul

[OMPI users] Running mpirun with grid

2020-05-29 Thread Kulshrestha, Vipul via users
Hi, I need to launch my openmpi application on grid. My application is designed to run N processes, where each process would have M threads. I am using open MPI version 4.0.1 % /build/openmpi/openmpi-4.0.1/rhel6/bin/ompi_info | grep grid MCA ras: gridengine (MCA v2.1.0, API v2.

Re: [OMPI users] Warnings

2020-05-05 Thread Kulshrestha, Vipul via users
runs just fine? fwiw, the message says that rank 3 received an unexpected connection from rank 4 Cheers, Gilles On Tue, May 5, 2020 at 9:08 AM Kulshrestha, Vipul via users wrote: > > Hi, > > > > Could somebody explain what does these warning imply? Is this caused if 2 >

[OMPI users] Warnings

2020-05-04 Thread Kulshrestha, Vipul via users
Hi, Could somebody explain what does these warning imply? Is this caused if 2 distinct openmpi application end up running on same machine? I am using 4.0.1 version. Thanks, Vipul Message in the stdout of the application [orw-med-fenway1][[61362,1],3][btl_tcp_endpoint.c:626:mca_btl_tcp_endpoin

Re: [OMPI users] running mpirun with grid

2020-03-24 Thread Kulshrestha, Vipul via users
I can get on this. Thanks, Vipul -Original Message- From: Kulshrestha, Vipul Sent: Thursday, March 19, 2020 5:31 PM To: 'Reuti' ; Open MPI Users Subject: RE: [OMPI users] running mpirun with grid Hi Reuti, Finally I was able to understand how to set this up and I have so

Re: [OMPI users] running mpirun with grid

2020-03-19 Thread Kulshrestha, Vipul via users
message to the specified daemon located on the indicated node: my node: bos2 target node: hpb11 This is usually an internal programming error that should be reported to the developers. In the meantime, a workaround may be to set the MCA param routed=direct on the command line or in your environment. We apologi

Re: [OMPI users] running mpirun with grid

2020-02-06 Thread Kulshrestha, Vipul via users
: Thursday, February 6, 2020 4:35 PM To: Open MPI Users Cc: Kulshrestha, Vipul Subject: Re: [OMPI users] running mpirun with grid Hi, > Am 06.02.2020 um 21:47 schrieb Kulshrestha, Vipul via users > : > > Hi, > > I need to launch my openmpi application on grid. > > My ap

[OMPI users] running mpirun with grid

2020-02-06 Thread Kulshrestha, Vipul via users
Hi, I need to launch my openmpi application on grid. My application is designed to run N processes, where each process would have M threads. To run it without grid, I run it as (say N = 7, M = 2) % mpirun -np 7 The above works well and runs N processes. I am also able to submit it on grid us

Re: [OMPI users] mpirun --output-filename behavior

2019-10-31 Thread Kulshrestha, Vipul via users
Thanks Jeff. “:nojobid” worked well for me and helps me remove 1 extra level of hierarchy for log files. Regards Vipul From: Jeff Squyres (jsquyres) [mailto:jsquy...@cisco.com] Sent: Thursday, October 31, 2019 6:21 PM To: Kulshrestha, Vipul Cc: Open MPI User's List Subject: Re: [OMPI

Re: [OMPI users] mpirun --output-filename behavior

2019-10-30 Thread Kulshrestha, Vipul via users
] Sent: Tuesday, October 29, 2019 9:07 PM To: Open MPI User's List Cc: Kulshrestha, Vipul Subject: Re: [OMPI users] mpirun --output-filename behavior On Oct 29, 2019, at 7:30 PM, Kulshrestha, Vipul via users mailto:users@lists.open-mpi.org>> wrote: Hi, We recently shifted from openM

[OMPI users] mpirun --output-filename behavior

2019-10-29 Thread Kulshrestha, Vipul via users
Hi, We recently shifted from openMPI 2.0.1 to 4.0.1 and are seeing an important behavior change with respect to above option. We invoke mpirun as % mpirun -output-filename /app.log -np With 2.0.1, the above produced /app.log. file for stdout of the application, where is the rank of the pro

Re: [OMPI users] TMPDIR for running openMPI job under grid

2019-07-29 Thread Kulshrestha, Vipul via users
OMPI v4 or at least something in the v3 series. If you continue to have a problem, then set PMIX_MCA_ptl=tcp in your environment. On Jul 26, 2019, at 12:12 PM, Kulshrestha, Vipul via users mailto:users@lists.open-mpi.org>> wrote: Hi, I am trying to setup my open-mpi application to run

[OMPI users] TMPDIR for running openMPI job under grid

2019-07-26 Thread Kulshrestha, Vipul via users
Hi, I am trying to setup my open-mpi application to run under grid. It works sometimes, but sometimes I get the below error. I have contacted my grid site administrator and the message from them is that they cannot change the TMPDIR path used in the grid configuration. I have tried setting TNP

[OMPI users] signal handling with mpirun

2017-11-21 Thread Kulshrestha, Vipul
Hi, I am finding that on Ctrl-C, mpirun immediately stops and does not sends SIGTERM to the child processes. I am using openmpi 1.10.6. The child processes are able to handle SIGINT. I verified that by a printf in my signal handler and then issuing SIGINT to my process directly. However, when

Re: [OMPI users] Questions about integration with resource distribution systems

2017-07-26 Thread Kulshrestha, Vipul
uses the nodes that SGE provides. > > What your cmd line does is restrict the entire operation on each node (daemon > + 8 procs) to 40GB of memory. OMPI does not support per-process restrictions > other than binding to cpus. > > >> On Jul 26, 2017, at 6:03 AM, Kulshres

Re: [OMPI users] Questions about integration with resource distribution systems

2017-07-26 Thread Kulshrestha, Vipul
-mpi.org] On Behalf Of r...@open-mpi.org Sent: Tuesday, July 25, 2017 8:16 PM To: Open MPI Users Subject: Re: [OMPI users] Questions about integration with resource distribution systems On Jul 25, 2017, at 3:48 PM, Kulshrestha, Vipul mailto:vipul_kulshres...@mentor.com>> wrote: I have s

[OMPI users] Questions about integration with resource distribution systems

2017-07-25 Thread Kulshrestha, Vipul
I have several questions about integration of openmpi with resource queuing systems. 1. I understand that openmpi supports integration with various resource distribution systems such as SGE, LSF, torque etc. I need to build an openmpi application that can interact with variety of different res