Pak Lui wrote:
Orion Poplawski wrote:
In our setup (which I don't believe is very unique) the nodes are
connected by two networks: an "admin" network which allows for
connections from outside the cluster and an "MPI" network that is a
private GigE network connecting the nodes for MPI traff
Hi Orion,
I wonder if you can try this:
use the PE_HOSTFILE that is generated by the qsh -pe my_pe for tight
integration, so the host names may be the ones without the "x". But then
in your mpirun command, specify the interfaces you would like to
exclude, e.g. all except for the gigE interfac
Pak Lui wrote:
Hi Orion and Reuti,
Let me see if I can understand the issue by breaking them down first:
(1) First, I am curious to know why you would need to create a
PE_HOSTFILE yourself, because that file is generated by SGE/N1GE when
you specify you are running a parallel job under SGE/N1
Hi Orion and Reuti,
Let me see if I can understand the issue by breaking them down first:
(1) First, I am curious to know why you would need to create a
PE_HOSTFILE yourself, because that file is generated by SGE/N1GE when
you specify you are running a parallel job under SGE/N1GE, by doing
so
Reuti wrote:
Hi,
Am 20.10.2006 um 01:08 schrieb Orion Poplawski:
I'm starting to test out OpenMPI 1.2 tight integration with SGE and
have run into the following issue. Currently, my startmpi script
massages the hostnames in the machines file created from the SGE
pe_hostfile add an "x" suffi