On Tue, Jan 6, 2015 at 12:38 PM, William Hay <[email protected]> wrote:

> While I don't know about a fix but the first thing I would check is
> whether your job is tightly integrated (that is starting slave processes
> via grid engine).  To check this log into a node running slave processes
> and check whether they are descended from an sge_shepherd.
>
Dear William,

yes, it is tightly integrated and this was working fine before the upgrade
and is still working fine with openmpi. The pstree output looks like this
on a slave node:
      |-sge_execd-+-load-sensor
     |           |-sge_shepherd-+-mycoshepherd
     |           |
|-qrsh_starter---hydra_pmi_proxy---8*[mpitests-IMB-MP---{mpitests-IMB-M}]
     |           |              `-6*[{sge_shepherd}]
     |           `-4*[{sge_execd}]

So the MPI processes are all children of the execd.

Regards, Götz
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to