Am 12.08.2016 um 16:52 schrieb r...@open-mpi.org: > IIRC, the rationale behind adding the check was that someone using SGE wanted > to specify a custom launch agent, and we were overriding it with qrsh. > However, the check is incorrect as that MCA param cannot be NULL. > > I have updated this on master - can you see if this fixes the problem for you? > > https://github.com/open-mpi/ompi/pull/1957
As written initially, I get now this verbose output with " --mca plm_base_verbose 10": [node22:02220] mca: base: close: component isolated closed [node22:02220] mca: base: close: unloading component isolated [node22:02220] mca: base: close: component slurm closed [node22:02220] mca: base: close: unloading component slurm [node22:02220] [[28119,0],0] plm:rsh: final template argv: qrsh <template> orted --hnp-topo-sig 2N:2S:2L3:8L2:8L1:8C:8H:x86_64 -mca ess "env" -mca ess_base_jobid "1842806784" -mca es s_base_vpid "<template>" -mca ess_base_num_procs "9" -mca orte_hnp_uri "1842806784.0;usock;tcp://192.168.154.22,192.168.154.92:46186 " --mca plm_base_verbose "10" -mca plm "rsh" -mca pmix "^s1,s2,cray" bash: node13: command not found bash: node20: command not found bash: node12: command not found bash: node16: command not found bash: node17: command not found bash: node14: command not found bash: node15: command not found Your "qrsh" request could not be scheduled, try again later. Sure, the name of the machine is allowed only after the additional "-inherit" to `qrsh`. Please see below for the complete <template> in 1.10.3, hence the assembly seems also not to be done in the correct way. -- Reuti > On Aug 11, 2016, at 4:28 AM, Reuti <re...@staff.uni-marburg.de> wrote: > ... > instead of the former: > > /usr/sge/bin/lx24-amd64/qrsh -inherit -nostdin -V -verbose <template> orted > --hnp-topo-sig ... _______________________________________________ users mailing list users@lists.open-mpi.org https://rfd.newmexicoconsortium.org/mailman/listinfo/users