Hi all,
I have a question about set timeout limit for MPI data transmissions. Our
users run their parallel jobs (with openmpi) on our HPC cluster. Sometimes the
job may hang due to unknown reason. In such case a job is still in "RUN"
status, all processes of this job are running. But not outpu
nodes. You
need to list each node once for each proc - something like this:
nodeA
nodeB
nodeB
nodeC
nodeA
nodeC
...
would produce your described pattern
On Apr 10, 2014, at 7:25 AM, Gan, Qi PW wrote:
> We have OMPI 1.4.0, 1.4.5 and 1.6.5 installed on our system.
> >>What version of
We have OMPI 1.4.0, 1.4.5 and 1.6.5 installed on our system.
>>What version of OMPI are you using? We have a "seq" mapper that does what you
>>want, but the precise cmd line option for directing to use it depends a bit
>>on the version.
>>On Apr 9, 2014, at
Hi,
I have a problem when setting the processes of a parallel job with specified
order. Suppose a job with 6 processes (rank0 to rank5) needs to run on 3 hosts
(A, B, C) with following order:
Rank0 -- A
Rank1 -- B
Rank2 -- B
Rank3 -- C
Rank4 -- A