Re: [OMPI users] SGE and openmpi

2011-04-12 Thread Kevin . Buckley
>> #!/bin/bash >> #$ -cwd >> #$ -j y >> #$ -S /bin/bash >> #$ -q all.q >> #$ -pe orte 18 >> MPI_DIR=/home/jason/openmpi-1.4.3-install/bin >> /home/jason/openmpi-1.4.3-install/bin/mpirun -np $NSLOTS myprog > If you have SGE integration, you should not specify the number > of slots requested on t

Re: [OMPI users] SGE and openmpi

2011-04-12 Thread Ralph Castain
On Apr 11, 2011, at 11:33 PM, kevin.buck...@ecs.vuw.ac.nz wrote: > >>> #!/bin/bash >>> #$ -cwd >>> #$ -j y >>> #$ -S /bin/bash >>> #$ -q all.q >>> #$ -pe orte 18 >>> MPI_DIR=/home/jason/openmpi-1.4.3-install/bin >>> /home/jason/openmpi-1.4.3-install/bin/mpirun -np $NSLOTS myprog > > >> If you

Re: [OMPI users] "Value out of bounds in file" error

2011-04-12 Thread hi
Hi, Any idea on this ??? Thank you. -Hiral On Mon, Apr 11, 2011 at 6:34 PM, hi wrote: > Hi, > > For the attached test program (reference: > http://www.netlib.org/blacs/BLACS/Examples.html#HELLO), I am observing > following message and binary halts... > > [:03864] [[16991,0],0] ORTE_ERROR_LOG: V

Re: [OMPI users] SGE and openmpi

2011-04-12 Thread Jason Palmer
Thanks for your resply. My problem was actually caused by having include mpif.h in the code still, rather than use mpi. But the info about NSLOTS, etc is good to know. Cheers, Jason P.S. I had originally used $MPI_DIR in mpirun call, but changed it to the explicit directory in the course of tryi

Re: [OMPI users] "Value out of bounds in file" error

2011-04-12 Thread Jeff Squyres
Can you provide more information? Where / when exactly does this error occur? E.g., does it happen during MPI_INIT, or is it later in the application? Are you able to run any MPI codes at all, such as the example "hello world" or "ring" programs? Are you able to run non-MPI applications at al

Re: [OMPI users] OpenMPI 1.4.2 Hangs When Using More Than 1 Proc

2011-04-12 Thread Jeff Squyres
On Apr 11, 2011, at 9:53 AM, Stergiou, Jonathan C CIV NSWCCD West Bethesda, 6640 wrote: > + ERROR MESSAGE + > FILE MISSING (Input): name = gemini.inp This seems like a gemini error, not an Open MPI error. -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://

Re: [OMPI users] "Value out of bounds in file" error

2011-04-12 Thread hi
Hi Jeff, I tried "hello_f77", "hello_c", "hello_cxx" examples and all are halting at call to "MPI_INIT()" with below message on console... """ [mymachine:10108] [[27195,0],0] ORTE_ERROR_LOG: Value out of bounds in file ..\..\orte\mca\oob\tcp\oob_tcp.c at line 1193 """ Do you have any idea??? T

Re: [OMPI users] "Value out of bounds in file" error

2011-04-12 Thread Jeff Squyres
Can you provide the other information? http://www.open-mpi.org/community/help/ On Apr 12, 2011, at 6:28 AM, hi wrote: > Hi Jeff, > > I tried "hello_f77", "hello_c", "hello_cxx" examples and all are halting at > call to "MPI_INIT()" with below message on console... > > """ > [mymachine

Re: [OMPI users] OpenMPI 1.4.2 Hangs When Using More Than 1 Proc

2011-04-12 Thread Stergiou, Jonathan C CIV NSWCCD West Bethesda, 6640
Apologies for not clarifying. The behavior below is expected, I am just checking to see that Gemini will start-up and look for its input file. When Gemini+OpenMPI is working correctly, I expect to see the behavior below. When Gemini+OpenMPI is not working correctly (current behavior), I see t

Re: [OMPI users] OpenMPI 1.4.2 Hangs When Using More Than 1 Proc

2011-04-12 Thread Ralph Castain
Let's simplify the issue as we have no idea what your codes are doing. Can you run two copies of hostname, for example? What about multiple copies of an MPI version of "hello" - see the examples directory in the OMPI tarball. On Apr 12, 2011, at 8:43 AM, Stergiou, Jonathan C CIV NSWCCD West Be

Re: [OMPI users] OpenMPI 1.4.2 Hangs When Using More Than 1 Proc

2011-04-12 Thread Stergiou, Jonathan C CIV NSWCCD West Bethesda, 6640
Ralph, Thanks for the reply and guidance. I ran the following: $> mpirun -np 1 hostname XXX_TUX01 $> mpirun -np 2 hostname XXX_TUX01 XXX_TUX01 $> mpirun -np 1 ./hello_c Hello, world, I am 0 of 1. $> mpirun -np 2 ./hello_c (no result, terminal does not respond until ctrl+c) > Let's simpli

Re: [OMPI users] OpenMPI 1.4.2 Hangs When Using More Than 1 Proc

2011-04-12 Thread Ralph Castain
Okay, that says that mpirun is working correctly - the problem appears to be in MPI_Init. How was OMPI configured? On Apr 12, 2011, at 9:24 AM, Stergiou, Jonathan C CIV NSWCCD West Bethesda, 6640 wrote: > Ralph, > > Thanks for the reply and guidance. > > I ran the following: > > $> mpirun

Re: [OMPI users] "Value out of bounds in file" error

2011-04-12 Thread Rainer Keller
Hiral, On Monday 11 April 2011 15:04:19 hi wrote: > Is it the case that I am combining mkl_intelmpi with locally built openmpi > and hence this problem??? Yes, as You mention yourselve here ,-] > ... > Files (x86)\Intel\Compiler\11.1\065\mkl/ia32/lib" mkl_blacs_intelmpi.lib > mkl_core.lib mkl_cor

[OMPI users] Problem with setting up openmpi-1.4.3

2011-04-12 Thread amosl...@gmail.com
Hi, I am embarrassed! I submitted a note to the users on setting up openmpi-1.4.3 using SUSE-11.3 under Linux and received several replies. I wanted to transfer them but they disappeared for no apparent reason. I hope that those that sent me messages will be kind enough to repeat them

Re: [OMPI users] Problem with setting up openmpi-1.4.3

2011-04-12 Thread Jeff Squyres
It looks like you have a broken C++ compiler installation. The code that is failing to compile is a system-level C++ header file, not an Open MPI file. The only part of Open MPI that is C++ is the optional VampirTrace package; you can disable building it and the rest of Open MPI will work fine.

[OMPI users] OMPI monitor each process behavior

2011-04-12 Thread Jack Bryan
Hi , All: I need to monitor the memory usage of each parallel process on a linux Open MPI cluster. But, top, ps command cannot help here because they only show the head node information. I need to follow the behavior of each process on each cluster node. I cannot use ssh to access each node.