Hi everybody,

I need to run a single driver program that only require one proc with the
command *mpirun -np 1 ./app *or *./app*. But it will schedule the launch of
other executable files including parallel and sequential computing. So I
require more than one proc to run it. It can be run smoothly as an
interactive job with the command below.
*qrsh -cwd -pe "ompi*" 6 -l h_rt=00:30:00,test=true ./app*

But after I submitted the job, a strange error occurred and it stopped...
Please find the job script and error message below:

*job submission script:*
#$ -S /bin/bash
#$ -N couple
#$ -cwd
#$ -j y
#$ -l h_rt=05:00:00
#$ -l h_vmem=2G
#$ -o couple.out
#$ -pe ompi*  6
    ./app

*error message:*
error: executing task of job 6777095 failed:
[cl231:23777] ERROR: A daemon on node cl231 failed to start as expected.
[cl231:23777] ERROR: There may be more information available from
[cl231:23777] ERROR: the 'qstat -t' command on the Grid Engine tasks.
[cl231:23777] ERROR: If the problem persists, please restart the
[cl231:23777] ERROR: Grid Engine PE job
[cl231:23777] ERROR: The daemon exited unexpectedly with status 1.

Thanks for any help!

Pengcheng

Reply via email to