I'm having a problem running OpenMPI under Torque. It complains like there is a command syntax problem, but the three variations below are all correct, best I can tell using mpirun -help. The environment in which the command executes, i.e. PATH and LD_LIBRARY_PATH, is correct. Torque is 2.3.x. OpenMPI is 1.2.8. OFED is 1.4.
Somewhere in the FAQ I had read that you must not give -machinefile under Torque with OpenMPI 1.2.8 and you did not need to give -np. That's why I tried variation 3 below without either of these options, but it still fails. Thanks for any help /usr/mpi/intel/openmpi-1.2.8/bin/mpirun -np 28 /tmp/43.fwnaeglingio/falconv4_ibm_openmpi -cycles 100 -ri restart.0 -ro /tmp/43.fwnaeglingio/restart.0 -------------------------------------------------------------------------- Failed to find the following executable: Host: n8n26 Executable: -p Cannot continue. mpirun --prefix /usr/mpi/intel/openmpi-1.2.8 --machinefile /var/spool/torque/aux/45.fwnaeglingio -np 28 --mca btl ^tcp --mca mpi_leave_pinned 1 --mca mpool_base_use_mem_hooks 1 -x LD_LIBRARY_PATH -x MPI_ENVIRONMENT /tmp/45.fwnaeglingio/falconv4_ibm_openmpi -cycles 100 -ri restart.0 -ro /tmp/45.fwnaeglingio/restart.0 -------------------------------------------------------------------------- Failed to find or execute the following executable: Host: n8n27 Executable: --prefix /usr/mpi/intel/openmpi-1.2.8 Cannot continue. /usr/mpi/intel/openmpi-1.2.8/bin/mpirun -x LD_LIBRARY_PATH -x MPI_ENVIRONMENT=1 /tmp/47.fwnaeglingio/falconv4_ibm_openmpi -cycles 100 -ri restart.0 -ro /tmp/47.fwnaeglingio/restart.0 -------------------------------------------------------------------------- Failed to find the following executable: Host: n8n27 Executable: - Cannot continue.