Am 12.08.2016 um 21:44 schrieb r...@open-mpi.org:

> Don’t know about the toolchain issue - I use those same versions, and don’t 
> have a problem. I’m on CentOS-7, so that might be the difference?
> 
> Anyway, I found the missing code to assemble the cmd line for qrsh - not sure 
> how/why it got deleted.
> 
> https://github.com/open-mpi/ompi/pull/1960

Yep, it's working again - thx.

But for sure there was a reason behind the removal, which may be elaborated in 
the Open MPI team to avoid any side effects by fixing this issue.

-- Reuti

PS: The other items I'll investigate on Monday.


>> On Aug 12, 2016, at 12:15 PM, Reuti <re...@staff.uni-marburg.de> wrote:
>> 
>>> 
>>> Am 12.08.2016 um 16:52 schrieb r...@open-mpi.org:
>>> 
>>> IIRC, the rationale behind adding the check was that someone using SGE 
>>> wanted to specify a custom launch agent, and we were overriding it with 
>>> qrsh. However, the check is incorrect as that MCA param cannot be NULL.
>>> 
>>> I have updated this on master - can you see if this fixes the problem for 
>>> you?
>>> 
>>> https://github.com/open-mpi/ompi/pull/1957
>> 
>> I updated my tools to:
>> 
>> autoconf-2.69
>> automake-1.15
>> libtool-2.4.6
>> 
>> but I face with Open MPI's ./autogen.pl:
>> 
>> configure.ac:152: error: possibly undefined macro: AC_PROG_LIBTOOL
>> 
>> I recall seeing in already before, how to get rid of it? For now I fixed the 
>> single source file just by hand.
>> 
>> -- Reuti
>> 
>> 
>>> As for the blank in the cmd line - that is likely due to a space reserved 
>>> for some entry that you aren’t using (e.g., when someone manually specifies 
>>> the prefix). It shouldn’t cause any harm as the cmd line parser is required 
>>> to ignore spaces
>>> 
>>> The -ldl problem sounds like a configuration issue - you might want to file 
>>> a separate issue about it
>>> 
>>>> On Aug 11, 2016, at 4:28 AM, Reuti <re...@staff.uni-marburg.de> wrote:
>>>> 
>>>> Hi,
>>>> 
>>>> In the file orte/mca/plm/rsh/plm_rsh_component I see an if-statement, 
>>>> which seems to prevent the tight integration with SGE to start:
>>>> 
>>>>  if (NULL == mca_plm_rsh_component.agent) {
>>>> 
>>>> Why is it there (it wasn't in 1.10.3)?
>>>> 
>>>> If I just remove it I get:
>>>> 
>>>> [node17:25001] [[27678,0],0] plm:rsh: final template argv:
>>>>      qrsh <template>  orted --hnp-topo-sig ...
>>>> 
>>>> instead of the former:
>>>> 
>>>> /usr/sge/bin/lx24-amd64/qrsh -inherit -nostdin -V -verbose <template>  
>>>> orted --hnp-topo-sig ...
>>>> 
>>>> So, just removing the if-statement is not a perfect cure as the 
>>>> $SGE_ROOT/$ARC does not prefix `qrsh`.
>>>> 
>>>> ==
>>>> 
>>>> BTW: why is there blank before " orted" in the assembled command line - 
>>>> and it's really in the argument when I check this on the slave nodes what 
>>>> should be started by the `qrsh_starter`? As long as there is a wrapping 
>>>> shell, it will be removed anyway. But in a special setup we noticed this 
>>>> additional blank.
>>>> 
>>>> ==
>>>> 
>>>> I also notice, that I have to supply "-ldl" to `mpicc` to allow the 
>>>> compilation of an application to succeed in 2.0.0.
>>>> 
>>>> -- Reuti
>>>> _______________________________________________
>>>> users mailing list
>>>> users@lists.open-mpi.org
>>>> https://rfd.newmexicoconsortium.org/mailman/listinfo/users
>>> 
>>> _______________________________________________
>>> users mailing list
>>> users@lists.open-mpi.org
>>> https://rfd.newmexicoconsortium.org/mailman/listinfo/users
>>> 
>> 
>> _______________________________________________
>> users mailing list
>> users@lists.open-mpi.org
>> https://rfd.newmexicoconsortium.org/mailman/listinfo/users
> 
> _______________________________________________
> users mailing list
> users@lists.open-mpi.org
> https://rfd.newmexicoconsortium.org/mailman/listinfo/users

_______________________________________________
users mailing list
users@lists.open-mpi.org
https://rfd.newmexicoconsortium.org/mailman/listinfo/users

Reply via email to