subject:"\[slurm\-users\] Limit concurrent gpu resources"

Re: [slurm-users] Limit concurrent gpu resources

2019-04-24 Thread Renfro, Michael

We put a ‘gpu’ QOS on all our GPU partitions, and limit jobs per user to 8 (our GPU capacity) via MaxJobsPerUser. Extra jobs get blocked, allowing other users to queue jobs ahead of the extras. # sacctmgr show qos gpu format=name,maxjobspu Name MaxJobsPU -- - gpu

[slurm-users] Limit concurrent gpu resources

2019-04-24 Thread Mike Cammilleri

Hi everyone, We have a single node with 8 gpus. Users often pile up lots of pending jobs and are using all 8 at the same time, but for a user who just wants to do a short run debug job and needs one of the gpus, they are having to wait too long for a gpu to free up. Is there a way with