Re: [OMPI users] Startup limited to 128 remote hosts in some situations?

2017-01-17 Thread r...@open-mpi.org
As I recall, the problem was that qrsh isn’t available on the backend compute nodes, and so we can’t use a tree for launch. If that isn’t true, then we can certainly adjust it. > On Jan 17, 2017, at 9:37 AM, Mark Dixon wrote: > > Hi, > > While commissioning a new cluster, I wanted to run HPL

[OMPI users] Startup limited to 128 remote hosts in some situations?

2017-01-17 Thread Mark Dixon
Hi, While commissioning a new cluster, I wanted to run HPL across the whole thing using openmpi 2.0.1. I couldn't get it to start on more than 129 hosts under Son of Gridengine (128 remote plus the localhost running the mpirun command). openmpi would sit there, waiting for all the orted's to

Re: [OMPI users] KNEM errors when running OMPI 2.0.1

2017-01-17 Thread Joshua Ladd
Can you please attach your configure log. It looks like both MXM and the Vader BTL (used for OSC) are complaining because they can't find your KNEM installation. Josh On Tue, Jan 17, 2017 at 7:16 AM, Juan A. Cordero Varelaq < bioinformatica-i...@us.es> wrote: > Hi, I am running on my SCG cluster

[OMPI users] KNEM errors when running OMPI 2.0.1

2017-01-17 Thread Juan A. Cordero Varelaq
Hi, I am running on my SCG cluster the following script (using qsub): #!/bin/bash #$-cwd #$ -S /bin/bash #$ -V #$ -q normal #$ -pe mpi 40 #$ -P Lab219 #$ -o output #$ -e error module load PhyML/3.3 mpirun --mca pml yalla -np 40 phyml-mpi -i proteic -b 10 -d aa where phyml-mpi is the parallel ver