Dear Wien2k Experts,
I am trying to submit a job but the chance of job submission failure are
more than 90%.

It is SGE resource manager system and job file is take from FAQs as such.

In job.out file, I am getting this message:

qrsh_starter: cannot change to directory /mnt/oss/hod/wien_case/TCO_1: No
such file or directory
qrsh_starter: cannot change to directory /mnt/oss/hod/wien_case/TCO_1: No
such file or directory

In job.err:

qrsh_starter: cannot change to directory /mnt/oss/hod/wien_case/TCO_1: No
such file or directory
qrsh_starter: cannot change to directory /mnt/oss/hod/wien_case/TCO_1: No
such file or directory
[mpiexec@compute-0] control_cb (./pm/pmiserv/pmiserv_cb.c:717): assert
(!closed) failed
[mpiexec@compute-1] HYDT_dmxu_poll_wait_for_event
(./tools/demux/demux_poll.c:77): callback returned error status
[mpiexec@compute-2] HYD_pmci_wait_for_completion
(./pm/pmiserv/pmiserv_pmci.c:435): error waiting for event
[mpiexec@compute-3] main (./ui/mpich/mpiexec.c:901): process manager error
waiting for completion


I tried with varying number of cores (16/32/48/64) but the problem
persists.

System specifications:

sge cluster (linuxifc) with 5 nodes with each node having 16 core and each
core has 4GB RAM (~2GB/ processor), 40 Gbps Infiniband interconnect. I used
"mpiifort" and "mpiicc"   compiler with scalapck, blas, fftd3 and blacs
library (without ELPA and LIBXC-3.0.0).


Parallel options:

setenv TASKSET "no"
if ( ! $?USE_REMOTE ) setenv USE_REMOTE 0
if ( ! $?MPI_REMOTE ) setenv MPI_REMOTE 0
setenv WIEN_GRANULARITY 1
setenv DELAY 0.1
setenv SLEEPY 1
setenv WIEN_MPIRUN "mpirun -np _NP_ -machinefile _HOSTS_ _EXEC_"
setenv CORES_PER_NODE 16
# if ( ! $?PINNING_COMMAND) setenv PINNING_COMMAND "--cpu_bin=map_cpu:"
# if ( ! $?PINNING_LIST ) setenv PINNING_LIST
"0,8,1,9,2,10,3,11,4,12,5,13,6,14,7,15"

Sincerely
Bhamu
_______________________________________________
Wien mailing list
Wien@zeus.theochem.tuwien.ac.at
http://zeus.theochem.tuwien.ac.at/mailman/listinfo/wien
SEARCH the MAILING-LIST at:  
http://www.mail-archive.com/wien@zeus.theochem.tuwien.ac.at/index.html

Reply via email to