subject:"\[slurm\-users\] ReqGRES value is not valid"

Re: [slurm-users] ReqGRES value is not valid

2019-10-03 Thread Uemoto, Tomoki

Thank you for your reply

I'm sorry there were some mistakes.

# srun --version
slurm 18.08.6
#

$ cat gresgpu.sh
#!/bin/bash

#SBATCH -J gresgpu# Job name
#SBATCH --gres=gpu:2
#SBATCH -o job.%j.out # Name of stdout output file (%j expands to jobId)

srun sleep 60
$

o gres.conf
  Name=gpu File=/dev/tty0 CPUs=0,1
  Name=gpu File=/dev/tty1 CPUs=25,26

o slurm.conf
  TaskPlugin=task/affinity,task/cgroup
　GresTypes=gpu
　NodeName=sl02-c00[1-2] Gres=gpu:2

o cgroup.conf
  CgroupAutomount=yes
  ConstrainCores=yes
  ConstrainDevices=yes

$ sbatch gresgpu.sh
Submitted batch job 76

$ squeue -l
Fri Oct  4 14:43:15 2019
 JOBID PARTITION NAME USERSTATE   TIME TIME_LIMI  
NODES NODELIST(REASON)
76normal  gresgpu test  RUNNING   0:01 1-00:00:00   
   1 c001

$ sacct -j 76 -a -X -o jobid,partition,ReqGRES,AllocGRES,stat
   JobID  Partition  ReqGRESAllocGRES  State
 --   --
76   normalgpu:0gpu:2RUNNING
$

Regards, 
Tomo



-Original Message-
From: slurm-users [mailto:slurm-users-boun...@lists.schedmd.com] On Behalf Of 
Chris Samuel
Sent: Friday, October 04, 2019 2:38 PM
To: slurm-users@lists.schedmd.com
Subject: Re: [slurm-users] ReqGRES value is not valid

On 3/10/19 10:23 pm, Uemoto, Tomoki wrote:

> I don't know why it return value of ReqGres is 0.

Which version of Slurm are you on?

Also there looks to be a typo, you've got "prun" not "srun" in your batch 
script.

All the best,
Chris
--
  Chris Samuel  :  http://www.csamuel.org/  :  Berkeley, CA, USA

Re: [slurm-users] ReqGRES value is not valid

2019-10-03 Thread Chris Samuel


On 3/10/19 10:23 pm, Uemoto, Tomoki wrote:


I don't know why it return value of ReqGres is 0.


Which version of Slurm are you on?

Also there looks to be a typo, you've got "prun" not "srun" in your 
batch script.


All the best,
Chris
--
 Chris Samuel  :  http://www.csamuel.org/  :  Berkeley, CA, USA

[slurm-users] ReqGRES value is not valid

2019-10-03 Thread Uemoto, Tomoki

Hi, all

I want to configure generic consumable resources(gpu) and confirm that the 
resources 
are assigned to jobs on each node.

I executed the following settings.

o gres.conf
  Name=gpu File=/dev/tty[0-3] CPUs=[0-24]
  Name=gpu File=/dev/tty[4-7] CPUs=[25-47]

o slurm.conf
  TaskPlugin=task/affinity,task/cgroup
　GresTypes=gpu
　NodeName=sl02-c00[1-2] Gres=gpu:2

o cgroup.conf
  CgroupAutomount=yes
  ConstrainCores=yes
  ConstrainDevices=yes

Then I submitted a sleep job with the gpu resource specified.

$ cat gresgpu.sh
#!/bin/bash

#SBATCH -J gresgpu# Job name
#SBATCH --gres=gpu:2
#SBATCH -o job.%j.out # Name of stdout output file (%j expands to jobId)

prun sleep 60
$

Return value of ReqGres was 0.
I don't know why it return value of ReqGres is 0.

$ sbatch gresgpu.sh
Submitted batch job 72

$ squeue -l
Fri Oct  4 14:14:36 2019
 JOBID PARTITION NAME USERSTATE   TIME TIME_LIMI  
NODES NODELIST(REASON)
72normal  gresgpu test  RUNNING   0:01 1-00:00:00   
   1 c001


$ sacct -j 72 -a -X -o jobid,partition,ReqGRES,AllocGRES,stat
   JobID  Partition  ReqGRESAllocGRES  State
 --   --
72   normalgpu:0gpu:2RUNNING
$

Regards,
Tomo

Re: [slurm-users] ReqGRES value is not valid

Re: [slurm-users] ReqGRES value is not valid

[slurm-users] ReqGRES value is not valid

3 matches

Site Navigation

Mail list logo

Footer information