Hi,

I am trying to run with Open MPI 1.3 on a cluster using PBS Pro:

pbs_version = PBSPro_9.2.0.81361


However, after compiling with these options:

../configure
--prefix=/home_nfs/parma/x86_64/UNITE/packages/openmpi/1.3-intel10.1-64bit-dynamic-threads
 CC=/opt/intel/cce/10.1.015/bin/icc CXX=/opt/intel/cce/10.1.015/bin/icpc 
CPP="/opt/intel/cce/10.1.015/bin/icc -E" FC=/opt/intel/fce/10.1.015/bin/ifort 
F90=/opt/intel/fce/10.1.015/bin/ifort F77=/opt/intel/fce/10.1.015/bin/ifort 
--enable-mpi-f90 --with-tm=/usr/pbs/ --enable-mpi-threads=yes 
--enable-contrib-no-build=vt

I get runtime errors when running on more than one reserved node
even /bin/hostname:

/home_nfs/parma/x86_64/UNITE/packages/openmpi/1.3-intel10.1-64bit-dynamic-threads/bin/mpirun
  -np 5  /bin/hostname 
/home_nfs/parma/x86_64/UNITE/packages/openmpi/1.3-intel10.1-64bit-dynamic-threads/bin/mpirun:
 symbol lookup error: 
/home_nfs/parma/x86_64/UNITE/packages/openmpi/1.3-intel10.1-64bit-dynamic-threads/lib/openmpi/mca_plm_tm.so:
 undefined symbol: tm_init

When running on one node only, I don't get this error.

Now, I see that I only have static PBS libraries so I tried to compile
this component statically. I added to the above configure:
"--enable-mca-static=ras-tm,pls-tm"

However, nothing changed. The same errors occurr.


But if I compile Open MPI only with static libraries ("--enable-static
--disable-shared"), the MPI (or non-MPI) programs run OK.

Can you help me here ?

Thanks,
Kiril



-- 
Dipl.-Inf. Kiril Dichev
Tel.: +49 711 685 60492
E-mail: dic...@hlrs.de
High Performance Computing Center Stuttgart (HLRS)
Universität Stuttgart
70550 Stuttgart
Germany


Reply via email to