Hello,
I have installed slurm-14.11.1 on Red Hat Enterprise Linux Server release
6.5 (Santiago);
I am trying to run a simple script such as:
!/bin/bash -l
#SBATCH -p debug
#SBATCH -n 32
#SBATCH -o %j.out
#SBATCH -e %j.err
date
###########################
the job error file shows:
slurmstepd: couldn't do a strtol on str 1(1): Numerical result out of range
slurmstepd: couldn't do a strtol on str 2(2): Numerical result out of range
slurmstepd: couldn't do a strtol on str 3(3): Numerical result out of range
slurmstepd: couldn't do a strtol on str 4(4): Numerical result out of range
....
while the job output file shows:
Thu Dec 11 09:27:53 AST 2014
The job is still running...
19 debug job.sh R 4:56 1 ca098
It looks lke this error comes either from:
slurm-14.11.1/src/plugins/proctrack/linuxproc/kill_tree.c: ret_l =
strtol(num, &endptr, 10);
slurm-14.11.1/src/plugins/proctrack/linuxproc/kill_tree.c:
error("couldn't do a strtol on str %s(%ld): %m",
or slurm-14.11.1/src/plugins/proctrack/pgid/proctrack_pgid.c
Do you have any suggestion?
Cheers,
Gianluca