Re: [slurm-users] Random "sbatch" failure: "Socket timed out on send/recv operation"

2019-06-13 Thread Christopher W. Harrop
> ... >> One way I?m using to work around this is to inject a long random string >> into the ?comment option. Then, if I see the socket timeout, I use squeue >> to look for that job and retrieve its ID. It?s not ideal, but it can work. > > I would have expected a different approach: use a

[slurm-users] heterogeneous launch inside a non-heterogeneous job

2018-10-15 Thread Christopher W. Harrop
srun --nodes=1-1 --tasks-per-node=1 : --nodes=2-2 --tasks-per-node=4 ./hello.exe Chris --- Christopher W. Harrop email

Re: [slurm-users] Complex resource requests for a single job

2018-07-25 Thread Christopher W. Harrop
ff White > > On 07/24/2018 09:59 AM, Christopher W. Harrop wrote: >> Hi, >> >> I am sorry if this basic question has been asked before. I’ve search the >> documentation and lists but can’t seem to find the answer. >> >> How does one submit a job that r

[slurm-users] Complex resource requests for a single job

2018-07-24 Thread Christopher W. Harrop
Chris --- Christopher W. Harrop email: christopher.w.har...@noaa.gov <mailto:christopher.w.har...@noaa.gov> Global Systems Division voice: (303) 497-6808