Re: [slurm-users] ReqGRES value is not valid
Thank you for your reply I'm sorry there were some mistakes. # srun --version slurm 18.08.6 # $ cat gresgpu.sh #!/bin/bash #SBATCH -J gresgpu# Job name #SBATCH --gres=gpu:2 #SBATCH -o job.%j.out # Name of stdout output file (%j expands to jobId) srun sleep 60 $ o gres.conf Name=gpu File=/dev/tty0 CPUs=0,1 Name=gpu File=/dev/tty1 CPUs=25,26 o slurm.conf TaskPlugin=task/affinity,task/cgroup GresTypes=gpu NodeName=sl02-c00[1-2] Gres=gpu:2 o cgroup.conf CgroupAutomount=yes ConstrainCores=yes ConstrainDevices=yes $ sbatch gresgpu.sh Submitted batch job 76 $ squeue -l Fri Oct 4 14:43:15 2019 JOBID PARTITION NAME USERSTATE TIME TIME_LIMI NODES NODELIST(REASON) 76normal gresgpu test RUNNING 0:01 1-00:00:00 1 c001 $ sacct -j 76 -a -X -o jobid,partition,ReqGRES,AllocGRES,stat JobID Partition ReqGRESAllocGRES State -- -- 76 normalgpu:0gpu:2RUNNING $ Regards, Tomo -Original Message- From: slurm-users [mailto:slurm-users-boun...@lists.schedmd.com] On Behalf Of Chris Samuel Sent: Friday, October 04, 2019 2:38 PM To: slurm-users@lists.schedmd.com Subject: Re: [slurm-users] ReqGRES value is not valid On 3/10/19 10:23 pm, Uemoto, Tomoki wrote: > I don't know why it return value of ReqGres is 0. Which version of Slurm are you on? Also there looks to be a typo, you've got "prun" not "srun" in your batch script. All the best, Chris -- Chris Samuel : http://www.csamuel.org/ : Berkeley, CA, USA
Re: [slurm-users] ReqGRES value is not valid
On 3/10/19 10:23 pm, Uemoto, Tomoki wrote: I don't know why it return value of ReqGres is 0. Which version of Slurm are you on? Also there looks to be a typo, you've got "prun" not "srun" in your batch script. All the best, Chris -- Chris Samuel : http://www.csamuel.org/ : Berkeley, CA, USA
[slurm-users] ReqGRES value is not valid
Hi, all I want to configure generic consumable resources(gpu) and confirm that the resources are assigned to jobs on each node. I executed the following settings. o gres.conf Name=gpu File=/dev/tty[0-3] CPUs=[0-24] Name=gpu File=/dev/tty[4-7] CPUs=[25-47] o slurm.conf TaskPlugin=task/affinity,task/cgroup GresTypes=gpu NodeName=sl02-c00[1-2] Gres=gpu:2 o cgroup.conf CgroupAutomount=yes ConstrainCores=yes ConstrainDevices=yes Then I submitted a sleep job with the gpu resource specified. $ cat gresgpu.sh #!/bin/bash #SBATCH -J gresgpu# Job name #SBATCH --gres=gpu:2 #SBATCH -o job.%j.out # Name of stdout output file (%j expands to jobId) prun sleep 60 $ Return value of ReqGres was 0. I don't know why it return value of ReqGres is 0. $ sbatch gresgpu.sh Submitted batch job 72 $ squeue -l Fri Oct 4 14:14:36 2019 JOBID PARTITION NAME USERSTATE TIME TIME_LIMI NODES NODELIST(REASON) 72normal gresgpu test RUNNING 0:01 1-00:00:00 1 c001 $ sacct -j 72 -a -X -o jobid,partition,ReqGRES,AllocGRES,stat JobID Partition ReqGRESAllocGRES State -- -- 72 normalgpu:0gpu:2RUNNING $ Regards, Tomo