My $0.02: there is not much useful that you can learn from system time
vs. user time. The only meaningful metric is total wall clock
execution time.
Open MPI's progression engine is designed to poll aggressively; this
approach does not work well in oversubscribed environments. You can
s
I try a couple of things including your suggestion. I also find out this has
been reported before,
http://www.open-mpi.org/community/lists/users/2007/03/2904.php
but there seems to be no clear solution so far:
Here is what I observe:
I keep the problem size fixed with 24 processes. I use two nodes
If you are running fewer processes on your nodes than they have
processors, then you can improve performance by adding
-mca mpi_paffinity_alone 1
to your cmd line. This will bind your processes to individual cores,
which helps with latency. If your program involves collectives, then
you ca
Hi all,
I am new to OpenMPI, and have an urgent run-time question. I
have openmpi-1.3.2 compiled with Intel Fortran compiler v.11 simply by
./configure --prefix= F77=ifort FC=ifort
then I set my LD_LIBRARY_PATH to include and
and compile my Fortran program properly. No compilation error.
I run