Re: [OMPI users] Problem with openmpi-default-hostfile

2011-11-07 Thread Lukas Razik
Hello Ralph and thanks for your answer!



> Where did you install OMPI? If you check "which mpirun", does it point 
> to the same installation where you edited the default hostfile?

It was installed in the default path which is chosen by OFED.
And yes, I've edited the right openmpi-default-hostfile :

$ which mpirun
/usr/mpi/gcc/openmpi-1.4.3/bin/mpirun

$ cat /usr/mpi/gcc/openmpi-1.4.3/etc/openmpi-default-hostfile
cluster1
cluster2
cluster3
cluster4

$ mpirun -np 4 B4A1
Hello World! I am rank 003 of 4 on hostname=cluster1.
Hello World! I am rank 000 of 4 on hostname=cluster1.
Hello World! I am rank 002 of 4 on hostname=cluster1.
Hello World! I am rank 001 of 4 on hostname=cluster1.

$ mpirun -hostfile /usr/mpi/gcc/openmpi-1.4.3/etc/openmpi-default-hostfile -np 
4 B4A1
Hello World! I am rank 000 of 4 on hostname=cluster1.
Hello World! I am rank 001 of 4 on hostname=cluster2.
Hello World! I am rank 002 of 4 on hostname=cluster3.
Hello World! I am rank 003 of 4 on hostname=cluster4.

Regards,
Lukas



Re: [OMPI users] EXTERNAL: Re: How to set up state-less node /tmp for OpenMPI usage

2011-11-07 Thread Blosch, Edwin L
Thanks for the valuable input. I'll change to a wait-and-watch approach.

The FAQ on tuning sm says "If the session directory is located on a network 
filesystem, the shared memory BTL latency will be extremely high."  And the 
title is 'Why am I seeing incredibly poor performance...'.  So I made the leap 
that this configuration must be avoided at all costs...

-Original Message-
From: users-boun...@open-mpi.org [mailto:users-boun...@open-mpi.org] On Behalf 
Of David Singleton
Sent: Sunday, November 06, 2011 4:15 PM
To: us...@open-mpi.org
Subject: Re: [OMPI users] EXTERNAL: Re: How to set up state-less node /tmp for 
OpenMPI usage


On 11/05/2011 09:11 AM, Blosch, Edwin L wrote:
..
>
> I know where you're coming from, and I probably didn't title the post 
> correctly because I wasn't sure what to ask.  But I definitely saw it, and 
> still see it, as an OpenMPI issue.  Having /tmp mounted over NFS on a 
> stateless cluster is not a broken configuration, broadly speaking. The 
> vendors made those decisions and presumably that's how they do it for other 
> customers as well. There are two other (Platform/HP) MPI applications that 
> apparently work normally. But OpenMPI doesn't work normally. So it's 
> deficient.
>

I'm also concerned that there is a bit of an over-reaction to network
filesystems.  Stores to mmap'd files do not instantly turn into filesystem
writes - there are dirty_writeback parameters to control how often
writes occur and its typically 5-20 seconds.  Ideally, memory or a local
disk is used for session directories but, in many cases, you just wont
notice a performance hit from network filesystems - we didn't when we
tested session directories on Lustre.  If your app is one of those handful
that is slowed by OS jitter at megascale, then you may well notice.
Obviously, its something to test.

For our 1.5 install, I removed Lustre from the list of filesystem types
that generate the warning message about network filesystems.  It would be
nice if it was a site choice whether or not to produce that message and
when.

David

___
users mailing list
us...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users


[OMPI users] Problems compiling and running openmpi-1.4.4

2011-11-07 Thread amosl...@gmail.com
Hi all,
I have been trying to compile and run openmpi-1.4.4 using SUSE
Linux 11.4 on VMware 7.0.1 and have run into difficulties. I used the intel
compilers composerXE-2011.5.220 both C and fortran.  Both of these
compilers and and openmpi-1.4.4 appeared to compiler properly since I did
not see any errors in the config.out, make.out and install.out files.
However, when I tried to run the mpicc programs in the examples file it
gave the output shown on the attached file.  I have tried to find the files
that are listed as not found in the attached file they are not found. I
have checked the file permissions and all seems to be in order.
At this point I do not know what to do and would appreciate any
advice.

Amos Leffler


out_test
Description: Binary data