Re: [OMPI users] users Digest, Vol 1052, Issue 10

2008-10-31 Thread Allan Menezes
--- next part -- HTML attachment scrubbed and removed -- next part -- A non-text attachment was scrubbed... Name: graycol.gif Type: image/gif Size: 105 bytes Desc: not available URL: <http://www.open-mpi.org/MailArchives/users/attachments/20081031/2d67

Re: [OMPI users] problem running Open MPI on Cells

2008-10-31 Thread Mi Yan
Where did you put the environment variable related to MCF licence file and MCF share libraries? What is your default shell? Did you test indicate the following? Suppose you have 4 nodes, on node 1, " mpirun -np 4 --host node1,node2,node3,node4 hostname" works, but "mpirun -np4 --host

Re: [OMPI users] Problem with openmpi version 1.3b1 beta1

2008-10-31 Thread Ralph Castain
I see you are using IPv6. From what I can tell, we do enable that support by default if the underlying system supports it. My best guess is that either that support is broken (we never test it since none of us use IPv6), or our configure system isn't properly detecting that it exists.

[OMPI users] Problem with openmpi version 1.3b1 beta1

2008-10-31 Thread Allan Menezes
List-Post: users@lists.open-mpi.org Date: Fri, 31 Oct 2008 09:34:52 -0600 From: Ralph Castain Subject: Re: [OMPI users] users Digest, Vol 1052, Issue 1 To: Open MPI Users Message-ID: <0cf28492-b13e-4f82-ac43-c1580f079...@lanl.gov> Content-Type: text/plain;

Re: [OMPI users] Working with a CellBlade cluster

2008-10-31 Thread Gilbert Grosdidier
OK, thanks to Mi and Jeff for their useful replies anyway. Gilbert. On Fri, 31 Oct 2008, Jeff Squyres wrote: > AFAIK, there are no parameters available to monitor IB message passing. The > majority of it is processed in hardware, and Linux is unaware of it. We have > not added any extra

Re: [OMPI users] problem running Open MPI on Cells

2008-10-31 Thread Gilbert Grosdidier
Hi, To monitor the environment from inside the application, it could be useful to issue a 'system("printenv")' call at the very beginning of the main program, even before (and after, btw) the MPI_Init call, when running in serial job mode with a single CAB, using mpirun. HTH,

Re: [OMPI users] Fwd: Problems installing in Cygwin

2008-10-31 Thread Jeff Squyres
On Oct 31, 2008, at 3:20 PM, Gustavo Seabra wrote: As Jeff mentioned this component is not required on Windows. You can disable it completely in Open MPI and everything will continue to work correctly. Please add --enable-mca-no-build=memory_mallopt o maybe the more generic (as there is

Re: [OMPI users] Fwd: Problems installing in Cygwin

2008-10-31 Thread George Bosilca
Gustavo, I guess that if you disable the vt contrib package, this is take you one step further :) Hopefully at the end of the compile stage ... and at the beginning of troubles with running the cygwin parallel applications ... Meanwhile, there is a special option to disable contrib

Re: [OMPI users] Fwd: Problems installing in Cygwin

2008-10-31 Thread Jeff Squyres
Ok, I'll CC the VT guys on the ticket and let them know. They'll likely slurp in whatever fix we do for OMPI into VT. FWIW: you can disable the VT package with: --enable-contrib-no-build=vt On Oct 31, 2008, at 3:02 PM, Gustavo Seabra wrote: As I keep trying to install OpenMPI in

Re: [OMPI users] Working with a CellBlade cluster

2008-10-31 Thread Jeff Squyres
AFAIK, there are no parameters available to monitor IB message passing. The majority of it is processed in hardware, and Linux is unaware of it. We have not added any extra instrumentation into the openib BTL to provide auditing information, because, among other reasons, that is the

[OMPI users] problem running Open MPI on Cells

2008-10-31 Thread Hahn Kim
Hello, I'm having problems using Open MPI on a cluster of Mercury Computer's Cell Accelerator Boards (CABs). We have an MPI application that is running on multiple CABs. The application uses Mercury's MultiCore Framework (MCF) to use the Cell's SPEs. Here's the basic problem. I can

Re: [OMPI users] Working with a CellBlade cluster

2008-10-31 Thread Mi Yan
Gilbert, I did not know the MCA parameters that can monitor the message passing. I have tried a few MCA verbose parameters and did not identify anyone helpful. One way to check if the message goes via IB or SM maybe to check the counters in /sys/class/infiniband. Regards, Mi

Re: [OMPI users] MPI + Mixed language coding(Fortran90 + C++)

2008-10-31 Thread Gustavo Seabra
On Fri, Oct 31, 2008 at 3:07 PM, Rajesh Ramaya wrote: > Actually I am > not writing any MPI code inside? It's the executable (third party software) > who does that part. What are you using for this? We too use Fortran and C routines combined with no problem at all. I would think that whatever

Re: [OMPI users] Fwd: Problems installing in Cygwin

2008-10-31 Thread Gustavo Seabra
On Thu, Oct 30, 2008 at 9:04 AM, George Bosilca wrote: Hi George, I'm sorry for taking too long to respond. As you mentioned, config takes a veeery long time in cygwin, and then the install itself takes many ties that :-( > As Jeff mentioned this component is not required on Windows. You

Re: [OMPI users] MPI + Mixed language coding(Fortran90 + C++)

2008-10-31 Thread Rajesh Ramaya
Hello Jeff Squyres, Thank you very much for the immediate reply. I am able to successfully access the data from the common block but the values are zero. In my algorithm I even update a common block but the update made by the shared library is not taken in to account by the executable. Can you

Re: [OMPI users] Fwd: Problems installing in Cygwin

2008-10-31 Thread Gustavo Seabra
As I keep trying to install OpenMPI in Cygwin, I found another instance where RTFD_NEXT is assumed to be present. Will keep trying... Gustavo. = Making all in vtlib make[5]: Entering directory `/home/seabra/local/openmpi-1.3b1/ompi/contrib/vt/vt/vtlib' gcc

Re: [OMPI users] MPI + Mixed language coding(Fortran90 + C++)

2008-10-31 Thread Jeff Squyres
On Oct 31, 2008, at 11:57 AM, Rajesh Ramaya wrote: I am completely new to MPI. I have a basic question concerning MPI and mixed language coding. I hope any of you could help me out. Is it possible to access FORTRAN common blocks in C++ in a MPI compiled code. It works without MPI but

Re: [OMPI users] ompi-checkpoint is hanging

2008-10-31 Thread Josh Hursey
After some additional testing I believe that I have been able to reproduce the problem. I suspect that there is a bug in the coordination protocol that is causing an occasional hang in the system. Since it only happens occasionally (though slightly more often on a fully loaded machine)

[OMPI users] MPI + Mixed language coding(Fortran90 + C++)

2008-10-31 Thread Rajesh Ramaya
Hello MPI Users, I am completely new to MPI. I have a basic question concerning MPI and mixed language coding. I hope any of you could help me out. Is it possible to access FORTRAN common blocks in C++ in a MPI compiled code. It works without MPI but as soon I switch to MPI the access of

Re: [OMPI users] users Digest, Vol 1052, Issue 1

2008-10-31 Thread Ralph Castain
It looks like the daemon isn't seeing the other interface address on host x2. Can you ssh to x2 and send the contents of ifconfig -a? Ralph On Oct 31, 2008, at 9:18 AM, Allan Menezes wrote: users-requ...@open-mpi.org wrote: Send users mailing list submissions to us...@open-mpi.org

[OMPI users] MPI_Type_create_darray causes MPI_File_set_view to crash when ndims=2, array_of_gsizes[0]>array_of_gsizes[1]

2008-10-31 Thread Antonio Molins
Hi again, The problem in a nutshell: it looks like, when I use MPI_Type_create_darray with an argument array_of_gsizes where array_of_gsizes[0]>array_of_gsizes[1], the datatype returned goes through MPI_Type_commit() just fine, but then it causes MPI_File_set_view to crash!! Any idea as

Re: [OMPI users] users Digest, Vol 1052, Issue 1

2008-10-31 Thread Allan Menezes
users-requ...@open-mpi.org wrote: Send users mailing list submissions to us...@open-mpi.org To subscribe or unsubscribe via the World Wide Web, visit http://www.open-mpi.org/mailman/listinfo.cgi/users or, via email, send a message with subject or body 'help' to

Re: [OMPI users] ompi-checkpoint is hanging

2008-10-31 Thread Matthias Hovestadt
Hi Tim! First of all: thanks a lot for answering! :-) Could you try running your two MPI jobs with fewer procs each, say 2 or 3 each instead of 4, so that there are a few extra cores available. This problem occurrs with any number of procs. Also, what happens to the checkpointing of one

Re: [OMPI users] Issues with MPI_Type_create_darray

2008-10-31 Thread Antonio Molins
Hi again, Using MPI_Type_get_true_extent(), I changed the way of reporting type size and extent to: int typesize; long typeextent, typelb; MPI_Type_size(this->datatype,); MPI_Type_get_true_extent(this->datatype,,); //MPI_Type_lb(this->datatype,);

Re: [OMPI users] ompi-checkpoint is hanging

2008-10-31 Thread Tim Mattox
Hello Matthias, Hopefully Josh will chime in shortly. But I have one suggestion to help diagnose this. Could you try running your two MPI jobs with fewer procs each, say 2 or 3 each instead of 4, so that there are a few extra cores available. I know that isn't a solution, but it may help us

[OMPI users] unsubscibe

2008-10-31 Thread Bertrand P. S. Russell
-- There is much pleasure to be gained from useless knowledge. Bertrand. P. S. Russell TROSY-NMR Lab, Singapore.

[OMPI users] ompi-checkpoint is hanging

2008-10-31 Thread Matthias Hovestadt
Hi! I'm using the development version of OMPI from SVN (rev. 19857) for executing MPI jobs on my cluster system. I'm particularly using the checkpoint and restart feature, basing on the currentmost version of BLCR. The checkpointing is working pretty fine as long as I only execute a single job

Re: [OMPI users] Equivalent .h files

2008-10-31 Thread Jeff Squyres
The Open MPI that ships with Leopard does not include Fortran support because OS X does not ship with a Fortran compiler (this was Apple's decision, not ours). If you have Fortran MPI applications, you'll need to a) download and install your own Fortran compiler (e.g., http://hpc.sf.net/)

Re: [OMPI users] Equivalent .h files

2008-10-31 Thread Benjamin Lamptey
Hello again, I have to be more specific with my problem. 1) I am using the Mac OS X (Leopard) operating system. When I do uname -a, I get Darwin Kernel Version 9.5.0 2) My code if fortran 90 3) I tried using the mpif90 wrapper and I got the following message x mpif90 -c -O3

Re: [OMPI users] Openmpi ver1.3beta1

2008-10-31 Thread Ralph Castain
When you typed the --host x1 command, were you sitting on x1? Likewise, when you typed the --host x2 command, were you not on host x2? If the answer to both questions is "yes", then my guess is that something is preventing you from launching a daemon on host x2. Try adding

[OMPI users] Openmpi ver1.3beta1

2008-10-31 Thread Allan Menezes
Hi, I built open mpi version 1.3b1 withe following cofigure command: ./configure --prefix=/opt/openmpi13b1 --enable-mpi-threads --with-threads=posix --disable-ipv6 I have six nodes x1..6 I distributed the /opt/openmpi13b1 with scp to all other nodes from the head node When i run the