Look at your SlurmctldLogFile (on the head node) and SlurmdLogFile (on the allocated node).

Quoting Adrian Reich <[email protected]>:

Hello,

I have set up a small SLURM cluster using the SLURM roll within Rocks.
Every time I try to submit an sbatch job it fails immediately and the job
quits. However, I can request resources using salloc and everything works.
How can I go about diagnosing where the issue is and what information can I
provide to help in the diagnosis? Thank you.

Sincerely,
Adrian Reich


--
Morris "Moe" Jette
CTO, SchedMD LLC

Reply via email to