Am 12.08.2016 um 21:44 schrieb r...@open-mpi.org: > Don’t know about the toolchain issue - I use those same versions, and don’t > have a problem. I’m on CentOS-7, so that might be the difference? > > Anyway, I found the missing code to assemble the cmd line for qrsh - not sure > how/why it got deleted. > > https://github.com/open-mpi/ompi/pull/1960
Yep, it's working again - thx. But for sure there was a reason behind the removal, which may be elaborated in the Open MPI team to avoid any side effects by fixing this issue. -- Reuti PS: The other items I'll investigate on Monday. >> On Aug 12, 2016, at 12:15 PM, Reuti <re...@staff.uni-marburg.de> wrote: >> >>> >>> Am 12.08.2016 um 16:52 schrieb r...@open-mpi.org: >>> >>> IIRC, the rationale behind adding the check was that someone using SGE >>> wanted to specify a custom launch agent, and we were overriding it with >>> qrsh. However, the check is incorrect as that MCA param cannot be NULL. >>> >>> I have updated this on master - can you see if this fixes the problem for >>> you? >>> >>> https://github.com/open-mpi/ompi/pull/1957 >> >> I updated my tools to: >> >> autoconf-2.69 >> automake-1.15 >> libtool-2.4.6 >> >> but I face with Open MPI's ./autogen.pl: >> >> configure.ac:152: error: possibly undefined macro: AC_PROG_LIBTOOL >> >> I recall seeing in already before, how to get rid of it? For now I fixed the >> single source file just by hand. >> >> -- Reuti >> >> >>> As for the blank in the cmd line - that is likely due to a space reserved >>> for some entry that you aren’t using (e.g., when someone manually specifies >>> the prefix). It shouldn’t cause any harm as the cmd line parser is required >>> to ignore spaces >>> >>> The -ldl problem sounds like a configuration issue - you might want to file >>> a separate issue about it >>> >>>> On Aug 11, 2016, at 4:28 AM, Reuti <re...@staff.uni-marburg.de> wrote: >>>> >>>> Hi, >>>> >>>> In the file orte/mca/plm/rsh/plm_rsh_component I see an if-statement, >>>> which seems to prevent the tight integration with SGE to start: >>>> >>>> if (NULL == mca_plm_rsh_component.agent) { >>>> >>>> Why is it there (it wasn't in 1.10.3)? >>>> >>>> If I just remove it I get: >>>> >>>> [node17:25001] [[27678,0],0] plm:rsh: final template argv: >>>> qrsh <template> orted --hnp-topo-sig ... >>>> >>>> instead of the former: >>>> >>>> /usr/sge/bin/lx24-amd64/qrsh -inherit -nostdin -V -verbose <template> >>>> orted --hnp-topo-sig ... >>>> >>>> So, just removing the if-statement is not a perfect cure as the >>>> $SGE_ROOT/$ARC does not prefix `qrsh`. >>>> >>>> == >>>> >>>> BTW: why is there blank before " orted" in the assembled command line - >>>> and it's really in the argument when I check this on the slave nodes what >>>> should be started by the `qrsh_starter`? As long as there is a wrapping >>>> shell, it will be removed anyway. But in a special setup we noticed this >>>> additional blank. >>>> >>>> == >>>> >>>> I also notice, that I have to supply "-ldl" to `mpicc` to allow the >>>> compilation of an application to succeed in 2.0.0. >>>> >>>> -- Reuti >>>> _______________________________________________ >>>> users mailing list >>>> users@lists.open-mpi.org >>>> https://rfd.newmexicoconsortium.org/mailman/listinfo/users >>> >>> _______________________________________________ >>> users mailing list >>> users@lists.open-mpi.org >>> https://rfd.newmexicoconsortium.org/mailman/listinfo/users >>> >> >> _______________________________________________ >> users mailing list >> users@lists.open-mpi.org >> https://rfd.newmexicoconsortium.org/mailman/listinfo/users > > _______________________________________________ > users mailing list > users@lists.open-mpi.org > https://rfd.newmexicoconsortium.org/mailman/listinfo/users _______________________________________________ users mailing list users@lists.open-mpi.org https://rfd.newmexicoconsortium.org/mailman/listinfo/users