Hi,

On 14 May 2013 09:05, Reuti <[email protected]> wrote:
>
> Am 14.05.2013 um 01:26 schrieb Riccardo Murri:
>
>> is there a prolog-like script that gets executed before a task spawned
>> by qrsh (as part of a parallel job) is run?  The usual prolog is only
>> run on the master node at the start of the job, but I'm trying to
>> intercept instances of `qrsh -V` spawned by OpenMPI's `mpiexec`.
>
> Why?

To workaround the multi-line environment variables bug that's still
present in OGE 6.2u8.
(But I ended up using QRSH_WRAPPER for that purpose.)


> In the prolog of the PE you could program a loop across all granted nodes by 
> using a `qrsh -inherit -V ...` to all nodes therein to make some preparations.

I'm now doing this:

        HOSTS=$( cut -d' ' -f1 $PE_HOSTFILE | fgrep -v $(hostname -s) )
        for host in $HOSTS; do
            qrsh -inherit -V $host $RUNSCRIPT | grep -q "failed"
            # ... react on failure
        done

but apparently `execd` does not allow me to qrsh from the prolog:

    error: executing task of job 2648607 failed: execution daemon on
host "r01c04b04n02" didn't accept task

anything wrong with my use of `qrsh`?

Thanks,
Riccardo

--
Riccardo Murri
http://www.gc3.uzh.ch/people/rm

Grid Computing Competence Centre
University of Zurich
Winterthurerstrasse 190, CH-8057 Zürich (Switzerland)
Tel: +41 44 635 4222
Fax: +41 44 635 6888

_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to