Thanks for confirming the issue.
I found the source of the problem with the help of SchedMD support.
18.08.4 has this bugfix to prevent commands in the cwd from taking
precedence over commands in your PATH:
https://github.com/SchedMD/slurm/commit/ccafaf7b60090155639edcbdbf4a3ab5e36967c6
The
Hi;
We upgraded from 18.08.3 to 18.08.4 and there is a job_submit.lua script
also. And nearly same issue at our cluster:
$ sbatch batch
sbatch: error: Batch job submission failed: Unspecified error
$ mv batch nobatchy
$ sbatch nobatchy
Submitted batch job 172174
I hope this helps.
Ahmet M.
Yesterday I upgraded from 18.08.3 to 18.08.4. After the upgrade, I found
that batch scripts named "batch" are being rejected. Simply changing the
script name fixes the problem. For example:
$ sbatch batch
sbatch: error: ERROR: A time limit must be specified
sbatch: error: Batch job submission f